no code implementations • 21 Feb 2024 • Chenhao Li, Elijah Stanger-Jones, Steve Heim, Sangbae Kim
Motion trajectories offer reliable references for physics-based motion learning but suffer from sparsity, particularly in regions that lack sufficient data coverage.
1 code implementation • 13 Feb 2024 • Jenny Zhang, Steve Heim, Se Hwan Jeon, Sangbae Kim
We present a minimal phase oscillator model for learning quadrupedal locomotion.
1 code implementation • 19 Jul 2023 • Se Hwan Jeon, Steve Heim, Charles Khazoom, Sangbae Kim
Although several studies have explored the use of potential based reward shaping to accelerate learning convergence, most have been limited to grid-worlds and low-dimensional systems, and RL in robotics has predominantly relied on standard forms of reward shaping.
1 code implementation • 25 May 2021 • Pierre-François Massiani, Steve Heim, Friedrich Solowjow, Sebastian Trimpe
Although it is often not possible to compute the minimum required penalty, we reveal clear structure of how the penalty, rewards, discount factor, and dynamics interact.
1 code implementation • 17 May 2021 • Pierre-François Massiani, Steve Heim, Sebastian Trimpe
In particular, we discuss the family of constraints that enforce safety in the context of a nominal control policy, and expose that these constraints do not need to be accurate everywhere.
1 code implementation • 7 Oct 2019 • Steve Heim, Alexander von Rohr, Sebastian Trimpe, Alexander Badri-Spröwitz
While safety can only be guaranteed after learning the safety measure, we show that failures can already be greatly reduced by using the estimated measure during learning.
1 code implementation • 30 Sep 2019 • Felix Grimminger, Avadesh Meduri, Majid Khadiv, Julian Viereck, Manuel Wüthrich, Maximilien Naveau, Vincent Berenz, Steve Heim, Felix Widmaier, Thomas Flayols, Jonathan Fiene, Alexander Badri-Spröwitz, Ludovic Righetti
Finally, to demonstrate the capabilities of the quadruped, we present a novel controller which combines feedforward contact forces computed from a kino-dynamic optimizer with impedance control of the center of mass and base orientation.
Robotics
1 code implementation • 21 Jun 2018 • Steve Heim, Alexander Spröwitz
Most studies of simple walking and running models have focused on the basins of attraction of passive limit-cycles and the notion of self-stability.
Robotics
no code implementations • 18 Jun 2018 • Steve Heim, Alexander Spröwitz
Despite impressive results using reinforcement learning to solve complex problems from scratch, in robotics this has still been largely limited to model-based learning with very informative reward functions.