Pei-Fu Guo, Ya-An Tsai, Chun-Chia Hsu, Kai-Xin Chen, Yun-Da Tsai, Kai-Wei Chang, Nanyun Peng, Mi-Yen Yeh, Shou-De Lin
View original ↗Develop a benchmark tool that measures an LLM's ability to extract 'distributional knowledge'—inferring population-level trends from large corpora. Current RAG focuses on local facts; this addresses the broader analytical capacity.
Suggested repo: DistribBench
"Move beyond facts: test if your LLM actually understands trends."
Estimated effort: 40h