Fernando Avalos – Carreras con Impacto

Open-source Replication of Decomposition of Language Models with Dictionary Learning

Artificial Intelligence

Date: March 2024

Fernando Ávalos

This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.

⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.

See English Version

Ver Versión en Español

This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.

⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.

See English Version

Ver Versión en Español

Go Back to Projects