Sing Hieng Wong, Hassan Sajjad, A. B. Siddique
View original ↗Implement sparse autoencoders specifically for language direction steering. This allows users to control the output language of models without fine-tuning.
Suggested repo: lang-steer
"Inject language control vectors into any LLM at inference."
Estimated effort: 70h