Channy Yun (윤석찬)
View original ↗Create an automated benchmarking tool for open-weight models to compare Mistral Large 3 against local alternatives using standard industry evaluation suites. Focus on throughput per-dollar cost analysis for self-hosted instances.
Suggested repo: bench-models-now
"Don't guess which model is cheaper; verify the performance per dollar yourself."
Estimated effort: 20h