Gianluca Guglielmo, Marc Masana
View original ↗Develop a plug-and-play PyTorch module for robust OOD detection using activation shifting. This is a critical safety component for production LLM deployments.
Suggested repo: activation-guard
"Detect when your model is confused by out-of-distribution inputs before it starts hallucinating."
Estimated effort: 45h