Bryan Sanchez
View original ↗Develop a plug-and-play adapter library that uncovers hidden factual knowledge in models suppressed by over-alignment. This is essential for transparency and unbiased analysis.
Suggested repo: debias-adapter
"Unlock the suppressed facts hiding inside your alignment-tuned models."
Estimated effort: 50h