Search Results for author: JB Lanier

Found 6 papers, 0 papers with code

Adapting World Models with Latent-State Dynamics Residuals

no code implementations3 Apr 2025 JB Lanier, KyungMin Kim, Armin Karamzade, Yifei Liu, Ankita Sinha, Kat He, Davide Corsi, Roy Fox

Simulation-to-reality reinforcement learning (RL) faces the critical challenge of reconciling discrepancies between simulated and real-world dynamics, which can severely degrade agent performance.

MuJoCo Reinforcement Learning (RL)

Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions

no code implementations13 Oct 2024 KyungMin Kim, JB Lanier, Pierre Baldi, Charless Fowlkes, Roy Fox

Training in the presence of visual distractions is particularly difficult due to the high variation they introduce to representation learning.

Model-based Reinforcement Learning Representation Learning

Realizable Continuous-Space Shields for Safe Reinforcement Learning

no code implementations2 Oct 2024 KyungMin Kim, Davide Corsi, Andoni Rodriguez, JB Lanier, Benjami Parellada, Pierre Baldi, Cesar Sanchez, Roy Fox

For real-world robotic domains, it is essential to define safety specifications over continuous state and action spaces to accurately account for system dynamics and compute new actions that minimally deviate from the agent's original decision.

Deep Reinforcement Learning reinforcement-learning +1

Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

no code implementations21 Jul 2023 Kolby Nottingham, Yasaman Razeghi, KyungMin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

Large language models (LLMs) are being applied as actors for sequential decision making tasks in domains such as robotics and games, utilizing their general world knowledge and planning abilities.

Decision Making Language Modeling +4

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

no code implementations19 Jul 2022 JB Lanier, Stephen Mcaleer, Pierre Baldi, Roy Fox

In this paper, we propose Feasible Adversarial Robust RL (FARR), a novel problem formulation and objective for automatically determining the set of environment parameter values over which to be robust.

MuJoCo reinforcement-learning +2

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

no code implementations13 Jul 2022 Stephen Mcaleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm

Instead of adding only deterministic best responses to the opponent's least exploitable population mixture, SP-PSRO also learns an approximately optimal stochastic policy and adds it to the population as well.

Deep Reinforcement Learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.