Qinghua Zhao, Xueling Gong, Xinyu Chen, Zhongfeng Kang, Xinlu Li
View original ↗Create an automated diagnostics tool that maps SFT progress and catastrophic forgetting risks layer-by-layer during training. This tool would visualize where alignment occurs and where information is lost.
Suggested repo: sftWatcher
"Visualize where your model learns (and forgets) during fine-tuning."
Estimated effort: 20h