no code implementations • 4 Oct 2022 • Edoardo Cetin, Benjamin Chamberlain, Michael Bronstein, Jonathan J Hunt
We propose a new class of deep reinforcement learning (RL) algorithms that model latent representations in hyperbolic space.
1 code implementation • 12 Aug 2022 • Zhendong Wang, Jonathan J Hunt, Mingyuan Zhou
In our approach, we learn an action-value function and we add a term maximizing action-values into the training loss of the conditional diffusion model, which results in a loss that seeks optimal actions that are near the behavior policy.
no code implementations • 17 Feb 2022 • Conor O'Brien, Huasen Wu, Shaodan Zhai, Dalin Guo, Wenzhe Shi, Jonathan J Hunt
In this work we focus on mobile push notifications, where the long term effects of recommender system decisions can be particularly strong.
no code implementations • 29 Jan 2022 • Conor O'Brien, Arvind Thiagarajan, Sourav Das, Rafael Barreto, Chetan Verma, Tim Hsu, James Neufield, Jonathan J Hunt
In this paper we outline the recent privacy-related changes in the online advertising ecosystem from a machine learning perspective.
no code implementations • 19 Jan 2022 • Yuguang Yue, Yuanpu Xie, Huasen Wu, Haofeng Jia, Shaodan Zhai, Wenzhe Shi, Jonathan J Hunt
Listwise ranking losses have been widely studied in recommender systems.
no code implementations • 18 Aug 2021 • Conor O'Brien, Kin Sum Liu, James Neufeld, Rafael Barreto, Jonathan J Hunt
Industrial recommender systems are frequently tasked with approximating probabilities for multiple, often closely related, user actions.