Yixian Shen, Chaoyao Shen, Jan Deen, George Floros, Andy Pimentel, Anuj Pathania
View original ↗Develop a lightweight scheduling library for LFM inference on many-core systems that accounts for thermal and cache locality. This is critical for deploying local models on hardware other than traditional GPUs.
Suggested repo: ThermalCore
"Optimized LFM inference that keeps your cores cool."
Estimated effort: 90h