1 code implementation • 17 Oct 2023 • Yandi Li, Jianxiong Guo, Yupeng Li, Tian Wang, Weijia Jia
Thus, we formulate an adversarial MAB problem with multi-user delayed feedback and design a modified EXP3 algorithm MUD-EXP3, which makes a decision at each round by considering the importance-weighted estimator of the received feedback from different users.
no code implementations • 6 Nov 2022 • Yandi Li, Haobo Gao, Yunxuan Gao, Jianxiong Guo, Weili Wu
Therefore, we abandon the traditional algorithms based on iterative search and review the recent development of ML-based methods, especially Deep Reinforcement Learning, to solve the IM problem and other variants in social networks.