1 code implementation • 3 May 2024 • Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko
We frame this as a decision-making task, where the scalarisation weights are actions taken to maximise an overall North Star reward (e. g. long-term user retention or growth).