Search Results for author: Siddharth Mysore

Found 6 papers, 1 papers with code

The SwaNNFlight System: On-the-Fly Sim-to-Real Adaptation via Anchored Learning

1 code implementation17 Jan 2023 Bassel El Mabsout, Shahin Roozkhosh, Siddharth Mysore, Kate Saenko, Renato Mancuso

And (iii) anchor critics to help stabilize the fine-tuning of agents during sim-to-real transfer, online learning from real data while retaining behavior optimized in simulation.

Reinforcement Learning (RL)

Multi-Critic Actor Learning: Teaching RL Policies to Act with Style

no code implementations ICLR 2022 Siddharth Mysore, George Cheng, Yunqi Zhao, Kate Saenko, Meng Wu

MultiCriticAL is tested in the context of multi-style learning, a special case of MTRL where agents are trained to behave with different distinct behavior styles, and yields up to 45% performance gains over the single-critic baselines and even successfully learns behavior styles in cases where single-critic approaches may simply fail to learn.

Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL

no code implementations23 Feb 2021 Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures.

Reinforcement Learning (RL)

Regularizing Action Policies for Smooth Control with Reinforcement Learning

no code implementations11 Dec 2020 Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

A critical problem with the practical utility of controllers trained with deep Reinforcement Learning (RL) is the notable lack of smoothness in the actions learned by the RL policies.

reinforcement-learning Reinforcement Learning (RL)

Exploiting Environmental Variation to Improve Policy Robustness in Reinforcement Learning

no code implementations27 Sep 2018 Siddharth Mysore, Robert Platt, Kate Saenko

We propose a novel method to exploit this observation to develop robust actor policies, by automatically developing a sampling curriculum over environment settings to use in training.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.