Dip Roy, Rajiv Misra, Sanjay Kumar Singh, Anisha Roy
View original ↗Create an early-warning signal monitor for LLM internal activations. Detect the 'hallucination-precursor' states using probe classifiers on middle layers.
Suggested repo: hallu-scan
"Detect hallucination before the first token is even generated."
Estimated effort: 70h