Search Results for author: Siddhant Agarwal

Found 15 papers, 5 papers with code

Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning

no code implementations6 May 2025 Caleb Chuck, Fan Feng, Carl Qi, Chang Shi, Siddhant Agarwal, Amy Zhang, Scott Niekum

To leverage this intuition, we introduce Hindsight Relabeling using Interactions (HInt), which combines interactions with hindsight relabeling to improve the sample efficiency of downstream RL.

counterfactual Object +2

RLZero: Direct Policy Inference from Language Without In-Domain Supervision

no code implementations7 Dec 2024 Harshit Sikchi, Siddhant Agarwal, Pranaya Jajoo, Samyak Parajuli, Caleb Chuck, Max Rudolph, Peter Stone, Amy Zhang, Scott Niekum

We further show that components of RLZero can be used to generate policies zero-shot from cross-embodied videos, such as those available on YouTube, even for complex embodiments like humanoids.

Imitation Learning Reinforcement Learning (RL)

Proto Successor Measure: Representing the Space of All Possible Solutions of Reinforcement Learning

no code implementations29 Nov 2024 Siddhant Agarwal, Harshit Sikchi, Peter Stone, Amy Zhang

We present \emph{Proto Successor Measure}: the basis set for all possible solutions of Reinforcement Learning in a dynamical system.

All reinforcement-learning +2

Accelerating the discovery of steady-states of planetary interior dynamics with machine learning

no code implementations30 Aug 2024 Siddhant Agarwal, Nicola Tosi, Christian Hüttig, David S. Greenberg, Ali Can Bekar

Consequently, achieving steady-state requires a large number of time steps due to the disparate time scales governing the stagnant and convecting regions.

Benchmarking

Aero-Nef: Neural Fields for Rapid Aircraft Aerodynamics Simulations

1 code implementation29 Jul 2024 Giovanni Catalani, Siddhant Agarwal, Xavier Bertrand, Frederic Tost, Michael Bauerheim, Joseph Morlier

This paper presents a methodology to learn surrogate models of steady state fluid dynamics simulations on meshed domains, based on Implicit Neural Representations (INRs).

Operator learning PDE Surrogate Modeling

Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

no code implementations6 May 2024 Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

Reinforcement Learning is a promising tool for learning complex policies even in fast-moving and object-interactive domains where human teleoperation or hard-coded policies might fail.

Offline RL

$f$-Policy Gradients: A General Framework for Goal Conditioned RL using $f$-Divergences

no code implementations10 Oct 2023 Siddhant Agarwal, Ishan Durugkar, Peter Stone, Amy Zhang

We further introduce an entropy-regularized policy optimization objective, that we call $state$-MaxEnt RL (or $s$-MaxEnt RL) as a special case of our objective.

Efficient Exploration Policy Gradient Methods +1

Reinforcement Explanation Learning

no code implementations26 Nov 2021 Siddhant Agarwal, Owais Iqbal, Sree Aditya Buridi, Madda Manjusha, Abir Das

Black-box methods to generate saliency maps are particularly interesting due to the fact that they do not utilize the internals of the model to explain the decision.

image-classification Image Classification +3

Deep learning for surrogate modelling of 2D mantle convection

no code implementations23 Aug 2021 Siddhant Agarwal, Nicola Tosi, Pan Kessel, Doris Breuer, Grégoire Montavon

Using a dataset of 10, 525 two-dimensional simulations of the thermal evolution of the mantle of a Mars-like planet, we show that deep learning techniques can produce reliable parameterized surrogates (i. e. surrogates that predict state variables such as temperature based only on parameters) of the underlying partial differential equations.

Deep Learning

Poisoned classifiers are not only backdoored, they are fundamentally broken

1 code implementation18 Oct 2020 MingJie Sun, Siddhant Agarwal, J. Zico Kolter

Under this threat model, we propose a test-time, human-in-the-loop attack method to generate multiple effective alternative triggers without access to the initial backdoor and the training data.

Cannot find the paper you are looking for? You can Submit a new open access paper.