Divya Shyamal, Marta Kne\v{z}evi\'c, Lan Tran, Chanakya Ekbote, Vijay Lingam, Paul Pu Liang
View original ↗Create a lightweight, calibration-focused ranking module for Best-of-N LLM inference. Focus on simple, training-free scoring functions that outperform standard PRMs in low-compute settings.
Suggested repo: scatr
"Better LLM inference via smarter candidate selection."
Estimated effort: 25h