no code implementations • 22 Aug 2023 • Linjian Meng, Zhenxing Ge, Wenbin Li, Bo An, Yang Gao
Recent works propose a Reward Transformation (RT) framework for MWU, which removes the uniqueness condition and achieves competitive performance with OMWU.