Mohammad AL-Smadi
View original ↗Implement a modular pipeline for high-precision reasoning fine-tuning. This allows domain experts to inject specific rule-based knowledge into LLMs with minimal resource overhead.
Suggested repo: reason-qlora
"Inject complex legal/math reasoning into LLMs on a consumer GPU."
Estimated effort: 30h