Search Results for author: Jibang Wu

Found 6 papers, 2 papers with code

Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

no code implementations15 Mar 2023 Siyu Chen, Jibang Wu, Yifan Wu, Zhuoran Yang

Such a problem is modeled as a Stackelberg game between the principal and the agent, where the principal announces a scoring rule that specifies the payment, and then the agent then chooses an effort level that maximizes her own profit and reports the information.

Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

no code implementations22 Feb 2022 Jibang Wu, Zixuan Zhang, Zhe Feng, Zhaoran Wang, Zhuoran Yang, Michael I. Jordan, Haifeng Xu

This paper proposes a novel model of sequential information design, namely the Markov persuasion processes (MPPs), where a sender, with informational advantage, seeks to persuade a stream of myopic receivers to take actions that maximizes the sender's cumulative utilities in a finite horizon Markovian environment with varying prior and utility functions.

reinforcement-learning Reinforcement Learning (RL)

Least Square Calibration for Peer Reviews

1 code implementation NeurIPS 2021 Sijun Tan, Jibang Wu, Xiaohui Bei, Haifeng Xu

Peer review systems such as conference paper review often suffer from the issue of miscalibration.

Uncoupled Bandit Learning towards Rationalizability: Benchmarks, Barriers, and Algorithms

no code implementations10 Nov 2021 Jibang Wu, Haifeng Xu, Fan Yao

Under the uncoupled learning setup, the last-iterate convergence guarantee towards Nash equilibrium is shown to be impossible in many games.

Decision Making

Least Square Calibration for Peer Review

1 code implementation25 Oct 2021 Sijun Tan, Jibang Wu, Xiaohui Bei, Haifeng Xu

Peer review systems such as conference paper review often suffer from the issue of miscalibration.

Déjà vu: A Contextualized Temporal Attention Mechanism for Sequential Recommendation

no code implementations29 Jan 2020 Jibang Wu, Renqin Cai, Hongning Wang

Predicting users' preferences based on their sequential behaviors in history is challenging and crucial for modern recommender systems.

Sequential Recommendation

Cannot find the paper you are looking for? You can Submit a new open access paper.