Brian Hsu, Ozan G\"okdemir, Carlo Siebenschuh, Bruce Parrello, Neil Getty, Thomas S. Brettin, Rick L. Stevens, Ian T. Foster, Nicholas Chia, Arvind Ramanathan
View original ↗Create an open-source dataset pipeline that extracts reasoning-heavy Q&A from biology literature. Use this to fine-tune a small model for biological reasoning benchmarks.
Suggested repo: bio-reason-gen
"Train your models on actual scientific reasoning, not just textbook biology."
Estimated effort: 60h