Search Results for author: Anirudha Majumdar

Found 32 papers, 12 papers with code

Explore until Confident: Efficient Exploration for Embodied Question Answering

no code implementations23 Mar 2024 Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigh

We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question.

Conformal Prediction Efficient Exploration +3

PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction

no code implementations NeurIPS 2023 Apoorva Sharma, Sushant Veer, Asher Hancock, Heng Yang, Marco Pavone, Anirudha Majumdar

To remedy this, recent work has proposed learning model and score function parameters using data to directly optimize the efficiency of the ICP prediction sets.

Conformal Prediction Generalization Bounds

Physically Grounded Vision-Language Models for Robotic Manipulation

no code implementations5 Sep 2023 Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

We incorporate this physically grounded VLM in an interactive framework with a large language model-based robotic planner, and show improved planning performance on tasks that require reasoning about physical object concepts, compared to baselines that do not leverage physically grounded VLMs.

Image Captioning Language Modelling +4

Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners

no code implementations4 Jul 2023 Allen Z. Ren, Anushri Dixit, Alexandra Bodrova, Sumeet Singh, Stephen Tu, Noah Brown, Peng Xu, Leila Takayama, Fei Xia, Jake Varley, Zhenjia Xu, Dorsa Sadigh, Andy Zeng, Anirudha Majumdar

Large language models (LLMs) exhibit a wide range of promising capabilities -- from step-by-step planning to commonsense reasoning -- that may provide utility for robots, but remain prone to confidently hallucinated predictions.

Conformal Prediction Language Modelling +1

Fundamental Tradeoffs in Learning with Prior Information

no code implementations26 Apr 2023 Anirudha Majumdar

We seek to understand fundamental tradeoffs between the accuracy of prior information that a learner has on a given problem and its learning performance.

AdaptSim: Task-Driven Simulation Adaptation for Sim-to-Real Transfer

no code implementations9 Feb 2023 Allen Z. Ren, Hongkai Dai, Benjamin Burchfiel, Anirudha Majumdar

Addressing this issue, we propose AdaptSim, a new task-driven adaptation framework for sim-to-real transfer that aims to optimize task performance in target (real) environments -- instead of matching dynamics between simulation and reality.

Leveraging Language for Accelerated Learning of Tool Manipulation

no code implementations27 Jun 2022 Allen Z. Ren, Bharat Govil, Tsung-Yen Yang, Karthik Narasimhan, Anirudha Majumdar

Robust and generalized tool manipulation requires an understanding of the properties and affordances of different tools.

Meta-Learning

Fundamental Limits for Sensor-Based Robot Control

1 code implementation31 Jan 2022 Anirudha Majumdar, Zhiting Mei, Vincent Pacelli

Our goal is to develop theory and algorithms for establishing fundamental limits on performance imposed by a robot's sensors for a given task.

Decision Making

Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

no code implementations20 Jan 2022 Kai-Chieh Hsu, Allen Z. Ren, Duy Phuong Nguyen, Anirudha Majumdar, Jaime F. Fisac

To improve safety, we apply a dual policy setup where a performance policy is trained using the cumulative task reward and a backup (safety) policy is trained by solving the Safety Bellman Equation based on Hamilton-Jacobi (HJ) reachability analysis.

reinforcement-learning Reinforcement Learning (RL) +1

Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

no code implementations16 Nov 2021 Abhinav Agarwal, Sushant Veer, Allen Z. Ren, Anirudha Majumdar

The key idea behind our approach is to utilize the generative model in order to implicitly specify a prior over policies.

Learning Provably Robust Motion Planners Using Funnel Libraries

no code implementations16 Nov 2021 Ali Ekin Gurgen, Anirudha Majumdar, Sushant Veer

This paper presents an approach for learning motion planners that are accompanied with probabilistic guarantees of success on new environments that hold uniformly for any disturbance to the robot's dynamics within an admissible set.

Generalization Bounds

Distributionally Robust Policy Learning via Adversarial Environment Generation

1 code implementation13 Jul 2021 Allen Z. Ren, Anirudha Majumdar

Our goal is to train control policies that generalize well to unseen environments.

Task-Driven Detection of Distribution Shifts with Statistical Guarantees for Robot Learning

1 code implementation25 Jun 2021 Alec Farid, Sushant Veer, Divyanshu Pachisia, Anirudha Majumdar

Our goal is to perform out-of-distribution (OOD) detection, i. e., to detect when a robot is operating in environments drawn from a different distribution than the ones used to train the robot.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

A Regret Minimization Approach to Iterative Learning Control

no code implementations26 Feb 2021 Naman Agarwal, Elad Hazan, Anirudha Majumdar, Karan Singh

We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics.

Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking

1 code implementation19 Feb 2021 Paula Gradu, John Hallman, Daniel Suo, Alex Yu, Naman Agarwal, Udaya Ghai, Karan Singh, Cyril Zhang, Anirudha Majumdar, Elad Hazan

We present an open-source library of natively differentiable physics and robotics environments, accompanied by gradient-based control methods and a benchmark-ing suite.

Benchmarking OpenAI Gym

Generating Adversarial Disturbances for Controller Verification

no code implementations12 Dec 2020 Udaya Ghai, David Snyder, Anirudha Majumdar, Elad Hazan

We consider the problem of generating maximally adversarial disturbances for a given controller assuming only blackbox access to it.

LagNetViP: A Lagrangian Neural Network for Video Prediction

no code implementations24 Oct 2020 Christine Allen-Blanchette, Sushant Veer, Anirudha Majumdar, Naomi Ehrich Leonard

In this paper, we introduce a video prediction model where the equations of motion are explicitly constructed from learned representations of the underlying physical quantities.

Acrobot Video Prediction

Learning to Actively Reduce Memory Requirements for Robot Control Tasks

1 code implementation17 Aug 2020 Meghan Booker, Anirudha Majumdar

State-of-the-art approaches for controlling robots often use memory representations that are excessively rich for the task or rely on hand-crafted tricks for memory efficiency.

Generalization Guarantees for Imitation Learning

2 code implementations5 Aug 2020 Allen Z. Ren, Sushant Veer, Anirudha Majumdar

Control policies from imitation learning can often fail to generalize to novel environments due to imperfect demonstrations or the inability of imitation learning algorithms to accurately infer the expert's policies.

Generalization Bounds Imitation Learning

CoNES: Convex Natural Evolutionary Strategies

no code implementations16 Jul 2020 Sushant Veer, Anirudha Majumdar

We present a novel algorithm -- convex natural evolutionary strategies (CoNES) -- for optimizing high-dimensional blackbox functions by leveraging tools from convex optimization and information geometry.

Benchmarking reinforcement-learning +1

Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning

1 code implementation1 Jun 2020 Anoopkumar Sonar, Vincent Pacelli, Anirudha Majumdar

A fundamental challenge in reinforcement learning is to learn policies that generalize beyond the operating domains experienced during training.

Policy Gradient Methods reinforcement-learning +1

Probably Approximately Correct Vision-Based Planning using Motion Primitives

1 code implementation28 Feb 2020 Sushant Veer, Anirudha Majumdar

This paper presents an approach for learning vision-based planners that provably generalize to novel environments (i. e., environments unseen during training).

Learning Task-Driven Control Policies via Information Bottlenecks

no code implementations4 Feb 2020 Vincent Pacelli, Anirudha Majumdar

Standard reinforcement learning algorithms typically produce policies that tightly couple control actions to the entirety of the system's state and rich sensor observations.

reinforcement-learning Reinforcement Learning (RL)

A Survey of Recent Scalability Improvements for Semidefinite Programming with Applications in Machine Learning, Control, and Robotics

no code implementations14 Aug 2019 Anirudha Majumdar, Georgina Hall, Amir Ali Ahmadi

Historically, scalability has been a major challenge to the successful application of semidefinite programming in fields such as machine learning, control, and robotics.

BIG-bench Machine Learning

Task-Driven Estimation and Control via Information Bottlenecks

1 code implementation20 Sep 2018 Vincent Pacelli, Anirudha Majumdar

We propose novel iterative algorithms for automatically synthesizing (offline) a task-driven representation (given in terms of a set of task-relevant variables (TRVs)) and a performant control policy that is a function of the TRVs.

Optimization and Control Robotics Systems and Control

PAC-Bayes Control: Learning Policies that Provably Generalize to Novel Environments

1 code implementation11 Jun 2018 Anirudha Majumdar, Alec Farid, Anoopkumar Sonar

The key technical idea behind our approach is to leverage tools from generalization theory in machine learning by exploiting a precise analogy (which we present in the form of a reduction) between generalization of control policies to novel environments and generalization of hypotheses in the supervised learning setting.

Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

1 code implementation28 Nov 2017 Sumeet Singh, Jonathan Lacotte, Anirudha Majumdar, Marco Pavone

The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i. e., that humans are risk neutral.

Decision Making reinforcement-learning +1

How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

no code implementations30 Oct 2017 Anirudha Majumdar, Marco Pavone

We discuss general representation theorems that precisely characterize the class of metrics that satisfy these axioms (referred to as distortion risk metrics), and provide instantiations that can be used in applications.

Decision Making

Response to "Counterexample to global convergence of DSOS and SDSOS hierarchies"

no code implementations9 Oct 2017 Amir Ali Ahmadi, Anirudha Majumdar

In a recent note [8], the author provides a counterexample to the global convergence of what his work refers to as "the DSOS and SDSOS hierarchies" for polynomial optimization problems (POPs) and purports that this refutes claims in our extended abstract [4] and slides in [3].

DSOS and SDSOS Optimization: More Tractable Alternatives to Sum of Squares and Semidefinite Optimization

no code implementations8 Jun 2017 Amir Ali Ahmadi, Anirudha Majumdar

The reliance of this technique on large-scale semidefinite programs however, has limited the scale of problems to which it can be applied.

valid

Funnel Libraries for Real-Time Robust Feedback Motion Planning

no code implementations15 Jan 2016 Anirudha Majumdar, Russ Tedrake

We consider the problem of generating motion plans for a robot that are guaranteed to succeed despite uncertainty in the environment, parametric model uncertainty, and disturbances.

Motion Planning

Cannot find the paper you are looking for? You can Submit a new open access paper.