arXiv9h ago

Improvisational Games as a Benchmark for Social Intelligence of AI Agents: The Case of Connections

Gaurav Rajesh Parikh, Angikar Ghosal

View original ↗

Analysis

Viral velocity

low

Implementation gapYES

Novelty7/10

Categorypaper

Topics

reasoningbenchmarkingagents

Opportunity Brief

Create a standardized benchmark suite for social intelligence using the 'Connections' game structure. This offers a more nuanced way to test agent reasoning than static QA datasets.

Suggested repo: social-bench

"Move beyond accuracy scores; test how your agents think, connect, and collaborate."

Estimated effort: 25h