Dat Tran, Douwe Kiela
View original ↗Develop a framework that standardizes compute budgets for comparing single-agent vs multi-agent systems. It will settle debates on whether MAS is actually 'smarter'.
Suggested repo: compute-equalizer
"Fair comparisons for LLM agent performance."
Estimated effort: 50h