Aadyot Bhatnagar, Peter M{\o}rch Groth, Ali Madani
View original ↗Build a library implementing Smooth Tchebysheff Scalarization for multi-objective offline RL. This helps researchers align models on multiple conflicting constraints without scalarization bias.
Suggested repo: ParetoRL
"Multi-objective alignment without the compromises of linear weighting."
Estimated effort: 40h