Ziyi He, Yushi Feng, Shuangyu Yang, Yinghao Zhu, Xichen Zhang, Pak Chuen Patrick Tai, Hei Yuet Lo, Songying Wu, Weifa Yang, Lequan Yu
View original ↗Create a unified evaluation harness for specialized clinical triage models. Use this to compare multimodal models (like LLaVA or GPT-4o) on hierarchical dental referral accuracy.
Suggested repo: dentaleval
"Benchmark your clinical agents against the first expert-annotated dental triage dataset."
Estimated effort: 20h