Lute Lillo, Nick Cheney
View original ↗Create a modular RL trainer that supports 'plasticity-preserving' updates, preventing model interference in multi-task scenarios.
Suggested repo: plastic-rl
"Don't let your RL agents lose what they learned."
Estimated effort: 50h