1 code implementation • 24 Aug 2023 • Hanchi Huang, Li Shen, Deheng Ye, Wei Liu
We propose a novel master-slave architecture to solve the top-$K$ combinatorial multi-armed bandits problem with non-linear bandit feedback and diversity constraints, which, to the best of our knowledge, is the first combinatorial bandits setting considering diversity constraints under bandit feedback.
1 code implementation • 7 Nov 2022 • Hanchi Huang, Deheng Ye, Li Shen, Wei Liu
To mitigate the negative influence of customizing the one-off training order in curriculum-based AMTL, CAMRL switches its training mode between parallel single-task RL and asymmetric multi-task RL (MTRL), according to an indicator regarding the training time, the overall performance, and the performance gap among tasks.
1 code implementation • AAAI 2021 • Chao Chen, Dongsheng Li, Junchi Yan, Hanchi Huang, Xiaokang Yang
One-bit matrix completion is an important class of positiveunlabeled (PU) learning problems where the observations consist of only positive examples, eg, in top-N recommender systems.