Gabriel Afriat, Xiang Meng, Shibal Ibrahim, Hussein Hazimeh, Rahul Mazumder
View original ↗Develop an open-source framework that combines multiple objectives (Taylor approximation + layer-wise reconstruction) for one-shot pruning. This provides a more robust way to compress LLMs without retraining cycles.
Suggested repo: moonPrune
"One-shot model compression that actually keeps performance."
Estimated effort: 80h