Search Results for author: Rati Devidze

Found 11 papers, 5 papers with code

Informativeness of Reward Functions in Reinforcement Learning

1 code implementation10 Feb 2024 Rati Devidze, Parameswaran Kamalaruban, Adish Singla

Reward functions are central in specifying the task we want a reinforcement learning agent to perform.

Informativeness reinforcement-learning

Learning to Play Text-based Adventure Games with Maximum Entropy Reinforcement Learning

1 code implementation21 Feb 2023 Weichen Li, Rati Devidze, Sophie Fellenz

To deal with sparse extrinsic rewards from the environment, we combine it with a potential-based reward shaping technique to provide more informative (dense) reward signals to the RL agent.

Q-Learning reinforcement-learning +2

Explicable Reward Design for Reinforcement Learning Agents

1 code implementation NeurIPS 2021 Rati Devidze, Goran Radanovic, Parameswaran Kamalaruban, Adish Singla

By being explicable, we seek to capture two properties: (a) informativeness so that the rewards speed up the agent's convergence, and (b) sparseness as a proxy for ease of interpretability of the rewards.

Informativeness reinforcement-learning +1

Policy Teaching in Reinforcement Learning via Environment Poisoning Attacks

no code implementations21 Nov 2020 Amin Rakhsha, Goran Radanovic, Rati Devidze, Xiaojin Zhu, Adish Singla

We provide lower/upper bounds on the attack cost, and instantiate our attacks in two settings: (i) an offline setting where the agent is doing planning in the poisoned environment, and (ii) an online setting where the agent is learning a policy with poisoned feedback.

reinforcement-learning Reinforcement Learning (RL)

Environment Shaping in Reinforcement Learning using State Abstraction

no code implementations23 Jun 2020 Parameswaran Kamalaruban, Rati Devidze, Volkan Cevher, Adish Singla

However, the applicability of potential-based reward shaping is limited in settings where (i) the state space is very large, and it is challenging to compute an appropriate potential function, (ii) the feedback signals are noisy, and even with shaped rewards the agent could be trapped in local optima, and (iii) changing the rewards alone is not sufficient, and effective shaping requires changing the dynamics.

reinforcement-learning Reinforcement Learning (RL)

Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning

1 code implementation ICML 2020 Amin Rakhsha, Goran Radanovic, Rati Devidze, Xiaojin Zhu, Adish Singla

We study a security threat to reinforcement learning where an attacker poisons the learning environment to force the agent into executing a target policy chosen by the attacker.

reinforcement-learning Reinforcement Learning (RL)

Understanding the Power and Limitations of Teaching with Imperfect Knowledge

no code implementations21 Mar 2020 Rati Devidze, Farnam Mansouri, Luis Haug, Yuxin Chen, Adish Singla

Machine teaching studies the interaction between a teacher and a student/learner where the teacher selects training examples for the learner to learn a specific task.

Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

no code implementations NeurIPS 2019 Sebastian Tschiatschek, Ahana Ghosh, Luis Haug, Rati Devidze, Adish Singla

We study two teaching approaches: learner-agnostic teaching, where the teacher provides demonstrations from an optimal policy ignoring the learner's preferences, and learner-aware teaching, where the teacher accounts for the learner's preferences.

reinforcement-learning Reinforcement Learning (RL)

Interactive Teaching Algorithms for Inverse Reinforcement Learning

no code implementations28 May 2019 Parameswaran Kamalaruban, Rati Devidze, Volkan Cevher, Adish Singla

We study the problem of inverse reinforcement learning (IRL) with the added twist that the learner is assisted by a helpful teacher.

reinforcement-learning Reinforcement Learning (RL)

Learning to Collaborate in Markov Decision Processes

no code implementations23 Jan 2019 Goran Radanovic, Rati Devidze, David C. Parkes, Adish Singla

We consider a two-agent MDP framework where agents repeatedly solve a task in a collaborative setting.

Cannot find the paper you are looking for? You can Submit a new open access paper.