Search Results for author: Bhavya Sukhija

Found 9 papers, 3 papers with code

Safe Exploration Using Bayesian World Models and Log-Barrier Optimization

no code implementations9 May 2024 Yarden As, Bhavya Sukhija, Andreas Krause

A major challenge in deploying reinforcement learning in online tasks is ensuring that safety is maintained throughout the learning process.

Information-based Transductive Active Learning

no code implementations13 Feb 2024 Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

We generalize active learning to address real-world settings where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region.

Active Learning Bayesian Optimization +1

Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning

no code implementations13 Nov 2023 Arjun Bhardwaj, Jonas Rothfuss, Bhavya Sukhija, Yarden As, Marco Hutter, Stelian Coros, Andreas Krause

We introduce PACOH-RL, a novel model-based Meta-Reinforcement Learning (Meta-RL) algorithm designed to efficiently adapt control policies to changing dynamics.

Meta-Learning Meta Reinforcement Learning +2

Tuning Legged Locomotion Controllers via Safe Bayesian Optimization

1 code implementation12 Jun 2023 Daniel Widmer, Dongho Kang, Bhavya Sukhija, Jonas Hübotter, Andreas Krause, Stelian Coros

This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms.

Bayesian Optimization Efficient Exploration

Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

1 code implementation2 Mar 2023 Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance.

Continuous Control Off-policy evaluation +1

Gradient-Based Trajectory Optimization With Learned Dynamics

no code implementations9 Apr 2022 Bhavya Sukhija, Nathanael Köhler, Miguel Zamora, Simon Zimmermann, Sebastian Curi, Andreas Krause, Stelian Coros

In our hardware experiments, we demonstrate that our learned model can represent complex dynamics for both the Spot and Radio-controlled (RC) car, and gives good performance in combination with trajectory optimization methods.

GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

1 code implementation24 Jan 2022 Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage.

Safe Exploration

Cannot find the paper you are looking for? You can Submit a new open access paper.