arXiv3h ago

Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

Zonghuan Xu, Xiang Zheng, Yutao Wu, Xingjun Ma

View original ↗

Analysis

Viral velocity

low

Implementation gapYES

Novelty5/10

Categorypaper

Topics

llmevaluation

Opportunity Brief

Create an open-source library that facilitates human-in-the-loop (HITL) calibration for LLM-based risk evaluators. This helps developers replace pure LLM judges with hybrid human-AI scoring pipelines.

Suggested repo: truth-gauge

"Your LLM judge is biased—calibrate its risk assessments with real human feedback."

Estimated effort: 30h