DeepReinforce Team, Xiaoya Li, Xiaofei Sun, Guoyin Wang, Songqiao Su, Chris Shum, Jiwei Li
View original ↗Implement a multi-agent reinforcement learning setup specifically for competitive programming environments. This pushes the state-of-the-art for LLM coding agents beyond basic chat interfaces.
Suggested repo: GrandAgent
"A multi-agent coding framework that trains models to win competitive programming tournaments."
Estimated effort: 100h