Open-source Replication of Decomposition of Language Models with Dictionary Learning

Artificial Intelligence

Date: March 2024

Fernando Ávalos

Fernando Ávalos

This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.

⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.

This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.

⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.