arXiv3h ago

Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

Dat Tran, Douwe Kiela

View original ↗

Analysis

Viral velocity

low

Implementation gapNo

Novelty8/10

Categorypaper

Topics

reasoninginference

Opportunity Brief

Develop a framework that standardizes compute budgets for comparing single-agent vs multi-agent systems. It will settle debates on whether MAS is actually 'smarter'.

Suggested repo: compute-equalizer

"Fair comparisons for LLM agent performance."

Estimated effort: 50h