Search Results for author: Jiarui Gan

Found 10 papers, 0 papers with code

Markov Decision Processes with Time-Varying Geometric Discounting

no code implementations19 Jul 2023 Jiarui Gan, Annika Hennes, Rupak Majumdar, Debmalya Mandal, Goran Radanovic

We take a game-theoretic perspective -- whereby each time step is treated as an independent decision maker with their own (fixed) discount factor -- and we study the subgame perfect equilibrium (SPE) of the resulting game as well as the related algorithmic problems.

Sequential Principal-Agent Problems with Communication: Efficient Computation and Learning

no code implementations6 Jun 2023 Jiarui Gan, Rupak Majumdar, Debmalya Mandal, Goran Radanovic

In this model, the principal and the agent interact in a stochastic environment, and each is privy to observations about the state not available to the other.

Decision Making

Learning to Manipulate a Commitment Optimizer

no code implementations23 Feb 2023 Yurong Chen, Xiaotie Deng, Jiarui Gan, Yuhao Li

We consider the scenario where the follower is not given any information about the leader's payoffs to begin with but has to learn to manipulate by interacting with the leader.

Online Reinforcement Learning with Uncertain Episode Lengths

no code implementations7 Feb 2023 Debmalya Mandal, Goran Radanovic, Jiarui Gan, Adish Singla, Rupak Majumdar

We show that minimizing regret with this new general discounting is equivalent to minimizing regret with uncertain episode lengths.

reinforcement-learning Reinforcement Learning (RL)

Socially Fair Reinforcement Learning

no code implementations26 Aug 2022 Debmalya Mandal, Jiarui Gan

We consider the problem of minimizing regret with respect to the fair policies maximizing three different fair objectives -- minimum welfare, generalized Gini welfare, and Nash social welfare.

reinforcement-learning Reinforcement Learning (RL)

Admissible Policy Teaching through Reward Design

no code implementations6 Jan 2022 Kiarash Banihashem, Adish Singla, Jiarui Gan, Goran Radanovic

This problem can be viewed as a dual to the problem of optimal reward poisoning attacks: instead of forcing an agent to adopt a specific policy, the reward designer incentivizes an agent to avoid taking actions that are inadmissible in certain states.

Your College Dorm and Dormmates: Fair Resource Sharing with Externalities

no code implementations8 Dec 2020 Jiarui Gan, Bo Li, Yingkai Li

Clearly, the strong notion of envy-freeness, where no agent envies another for their resource or mates, cannot always be achieved and we show that even deciding the existence of such a strongly envy-free assignment is an intractable problem.

Computer Science and Game Theory

Budget-feasible Maximum Nash Social Welfare Allocation is Almost Envy-free

no code implementations7 Dec 2020 Xiaowei Wu, Bo Li, Jiarui Gan

The Nash social welfare (NSW) is a well-known social welfare measurement that balances individual utilities and the overall efficiency.

Fairness Computer Science and Game Theory Multiagent Systems

Optimally Deceiving a Learning Leader in Stackelberg Games

no code implementations NeurIPS 2020 Georgios Birmpas, Jiarui Gan, Alexandros Hollender, Francisco J. Marmolejo-Cossío, Ninad Rajgopal, Alexandros A. Voudouris

For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the payoffs that would make the learning algorithm compute a commitment so that best responding to it maximizes the follower's utility, according to his true payoffs.

Manipulating a Learning Defender and Ways to Counteract

no code implementations NeurIPS 2019 Jiarui Gan, Qingyu Guo, Long Tran-Thanh, Bo An, Michael Wooldridge

We then apply a game-theoretic framework at a higher level to counteract such manipulation, in which the defender commits to a policy that specifies her strategy commitment according to the learned information.

Cannot find the paper you are looking for? You can Submit a new open access paper.