no code implementations • 10 Jan 2025 • Akhil S Anand, Shambhuraj Sawant, Dirk Reinhardt, Sebastien Gros
This is primarily because AI models are typically constructed to best fit the data, and hence to predict the most likely future rather than to enable high-performance decision-making.
no code implementations • 23 Jul 2024 • Dirk Reinhardt, Akhil S. Anand, Shambhuraj Sawant, Sebastien Gros
However, computing the optimal policy of an MDP can be very difficult due to the curse of dimensionality present in solving the underlying Bellman equations.
no code implementations • 4 Jan 2023 • Shambhuraj Sawant, Akhil S Anand, Dirk Reinhardt, Sebastien Gros
The state-of-the-art learning methods use RL to improve the performance of parameterized MPC schemes.
no code implementations • 18 May 2022 • Shambhuraj Sawant, Sebastien Gros
We propose simple tools to promote structures in the QP, pushing it to resemble a linear MPC scheme.
no code implementations • ICLR 2021 • Cristina Pinneri, Shambhuraj Sawant, Sebastian Blaes, Georg Martius
Solving high-dimensional, continuous robotic tasks is a challenging optimization problem.
1 code implementation • 14 Aug 2020 • Cristina Pinneri, Shambhuraj Sawant, Sebastian Blaes, Jan Achterhold, Joerg Stueckler, Michal Rolinek, Georg Martius
However, their sampling inefficiency prevents them from being used for real-time planning and control.
Model-based Reinforcement Learning reinforcement-learning +1