Search Results for author: Gokul Swamy

Found 15 papers, 7 papers with code

Hybrid Inverse Reinforcement Learning

1 code implementation • 13 Feb 2024 • Juntao Ren, Gokul Swamy, Zhiwei Steven Wu, J. Andrew Bagnell, Sanjiban Choudhury

In this work, we propose using hybrid RL -- training on a mixture of online and expert data -- to curtail unnecessary exploration.

Continuous Control Imitation Learning +2

Paper
Code

The Virtues of Pessimism in Inverse Reinforcement Learning

no code implementations • 4 Feb 2024 • David Wu, Gokul Swamy, J. Andrew Bagnell, Zhiwei Steven Wu, Sanjiban Choudhury

Inverse Reinforcement Learning (IRL) is a powerful framework for learning complex behaviors from expert demonstrations.

Offline RL reinforcement-learning +1

Paper
Add Code

A Minimaximalist Approach to Reinforcement Learning from Human Feedback

no code implementations • 8 Jan 2024 • Gokul Swamy, Christoph Dann, Rahul Kidambi, Zhiwei Steven Wu, Alekh Agarwal

Our approach is maximalist in that it provably handles non-Markovian, intransitive, and stochastic preferences while being robust to the compounding errors that plague offline approaches to sequential prediction.

Continuous Control reinforcement-learning

Paper
Add Code

Learning Shared Safety Constraints from Multi-task Demonstrations

1 code implementation • NeurIPS 2023 • Konwoo Kim, Gokul Swamy, Zuxin Liu, Ding Zhao, Sanjiban Choudhury, Zhiwei Steven Wu

Regardless of the particular task we want them to perform in an environment, there are often shared safety constraints we want our agents to respect.

Continuous Control

Paper
Code

Inverse Reinforcement Learning without Reinforcement Learning

1 code implementation • 26 Mar 2023 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

In this work, we demonstrate for the first time a more informed imitation learning reduction where we utilize the state distribution of the expert to alleviate the global exploration component of the RL subroutine, providing an exponential speedup in theory.

Continuous Control Imitation Learning +2

Paper
Code

Game-Theoretic Algorithms for Conditional Moment Matching

no code implementations • 19 Aug 2022 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

A variety of problems in econometrics and machine learning, including instrumental variable regression and Bellman residual minimization, can be formulated as satisfying a set of conditional moment restrictions (CMR).

Econometrics regression

Paper
Add Code

Sequence Model Imitation Learning with Unobserved Contexts

1 code implementation • 3 Aug 2022 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

We consider imitation learning problems where the learner's ability to mimic the expert increases throughout the course of an episode as more information is revealed.

Continuous Control Imitation Learning

Paper
Code

Minimax Optimal Online Imitation Learning via Replay Estimation

1 code implementation • 30 May 2022 • Gokul Swamy, Nived Rajaraman, Matthew Peng, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu, Jiantao Jiao, Kannan Ramchandran

In the tabular setting or with linear function approximation, our meta theorem shows that the performance gap incurred by our approach achieves the optimal $\widetilde{O} \left( \min({H^{3/2}} / {N}, {H} / {\sqrt{N}} \right)$ dependency, under significantly weaker assumptions compared to prior work.

Continuous Control Imitation Learning

Paper
Code

Causal Imitation Learning under Temporally Correlated Noise

1 code implementation • 2 Feb 2022 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

We develop algorithms for imitation learning from policy data that was corrupted by temporally correlated noise in expert actions.

Econometrics Imitation Learning

Paper
Code

A Critique of Strictly Batch Imitation Learning

no code implementations • 5 Oct 2021 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

Recent work by Jarrett et al. attempts to frame the problem of offline imitation learning (IL) as one of learning a joint energy-based model, with the hope of out-performing standard behavioral cloning.

Imitation Learning

Paper
Add Code

What Would the Expert $do(\cdot)$?: Causal Imitation Learning

no code implementations • 29 Sep 2021 • Gokul Swamy, Sanjiban Choudhury, Drew Bagnell, Steven Wu

Both approaches are able to find policies that match the result of a query to an unconfounded expert.

Imitation Learning

Paper
Add Code

Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap

2 code implementations • 4 Mar 2021 • Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu

We provide a unifying view of a large family of previous imitation learning algorithms through the lens of moment matching.

Imitation Learning

385

Paper
Code

Scaled Autonomy: Enabling Human Operators to Control Robot Fleets

no code implementations • 22 Sep 2019 • Gokul Swamy, Siddharth Reddy, Sergey Levine, Anca D. Dragan

We learn a model of the user's preferences from observations of the user's choices in easy settings with a few robots, and use it in challenging settings with more robots to automatically identify which robot the user would most likely choose to control, if they were able to evaluate the states of all robots at all times.

Robot Navigation

Paper
Add Code

On the Utility of Model Learning in HRI

no code implementations • 4 Jan 2019 • Gokul Swamy, Jens Schulz, Rohan Choudhury, Dylan Hadfield-Menell, Anca Dragan

Fundamental to robotics is the debate between model-based and model-free learning: should the robot build an explicit model of the world, or learn a policy directly?

Autonomous Driving

Paper
Add Code

Generative Models for Pose Transfer

no code implementations • 24 Jun 2018 • Patrick Chao, Alexander Li, Gokul Swamy

We investigate nearest neighbor and generative models for transferring pose between persons.

Face Detection Pose Transfer

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.