Yuanhang Li
View original ↗Build a testbed to evaluate the stability of DRL agents under dynamic vs. static reward regimes. This helps developers verify if their agent rewards are causing instability.
Suggested repo: rl-stable
"Does your reward function cause instability? Find out with this testbench."
Estimated effort: 45h