arXiv1d ago

GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis

Bo Yu, Cheng Yang, Dongyang Hou, Chengfu Liu, Jiayao Liu, Chi Wang, Zhiming Zhang, Haifeng Li, Wentao Yang

Analysis

Viral velocity

low

Implementation gapYES

Novelty7/10

Categorypaper

Topics

agentstoolgeospatialbenchmarking

Build an execution-aware benchmark for GIS-augmented agents. Focus on multi-step geospatial workflows where dynamic runtime feedback is required.

Suggested repo: geoEval

"Move beyond text-only benchmarks; evaluate spatial reasoning agents in live GIS environments."

Estimated effort: 70h