This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.
⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.
This project will replicate Anthropic’s paper on Dictionary Learning for feature extraction from Language Models [Bri+23], and explore potential extensions if possible.
⭐ Fernando was accepted to participate on the ML4Good Bootcamp during his mentorship in Carreras con Impacto.