arXiv2d ago

A Layer-wise Analysis of Supervised Fine-Tuning

Qinghua Zhao, Xueling Gong, Xinyu Chen, Zhongfeng Kang, Xinlu Li

View original ↗

Analysis

Viral velocity

low

Implementation gapYES

Novelty5/10

Categorypaper

Topics

fine-tuninganalysistransformers

Opportunity Brief

Create an automated diagnostics tool that maps SFT progress and catastrophic forgetting risks layer-by-layer during training. This tool would visualize where alignment occurs and where information is lost.

Suggested repo: sftWatcher

"Visualize where your model learns (and forgets) during fine-tuning."

Estimated effort: 20h