Dingzhi Yu, Rui Pan, Yuxing Liu, Tong Zhang
View original ↗Release a robust implementation of StoSignSGD as an optimizer plug-in for PyTorch to improve training stability on non-smooth model objectives.
Suggested repo: stosing-sgd
"Robust sign-based optimization for your foundation models."
Estimated effort: 40h