GreenGames
View original ↗Build a deployment tool that automatically optimizes and benches large models for consumer-grade RTX cards. This helps developers achieve maximum throughput without manual tuning.
Suggested repo: turbo-inference
"Get 200+ tokens/sec on your home PC."
Estimated effort: 60h