Search Results for author: Zehong Hu

Found 4 papers, 1 papers with code

Thompson Sampling for Unimodal Bandits

no code implementations15 Jun 2021 Long Yang, Zhao Li, Zehong Hu, Shasha Ruan, Shijian Li, Gang Pan, Hongyang Chen

In this paper, we propose a Thompson Sampling algorithm for \emph{unimodal} bandits, where the expected reward is unimodal over the partially ordered arms.

Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning

no code implementations24 Nov 2019 Zining Liu, Chong Long, Xiaolu Lu, Zehong Hu, Jie Zhang, Yafang Wang

These observations suggest that our proposed method can seek the trade-off where both channel resources and customers' satisfaction are optimal.

Chatbot Q-Learning

Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing

no code implementations NeurIPS 2018 Zehong Hu, Yitao Liang, Yang Liu, Jie Zhang

Incentive mechanisms for crowdsourcing are designed to incentivize financially self-interested workers to generate and report high-quality labels.

Bayesian Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.