Search Results for author: Alejandro Escontrela

Found 5 papers, 2 papers with code

Learning a Diffusion Model Policy from Rewards via Q-Score Matching

1 code implementation18 Dec 2023 Michael Psenka, Alejandro Escontrela, Pieter Abbeel, Yi Ma

However, previous works fail to exploit the score-based structure of diffusion models, and instead utilize a simple behavior cloning term to train the actor, limiting their ability in the actor-critic setting.

Denoising reinforcement-learning +1

DayDreamer: World Models for Physical Robot Learning

1 code implementation28 Jun 2022 Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter Abbeel

Learning a world model to predict the outcomes of potential actions enables planning in imagination, reducing the amount of trial and error needed in the real environment.

Navigate reinforcement-learning +1

Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions

no code implementations28 Mar 2022 Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel

We also demonstrate that an effective style reward can be learned from a few seconds of motion capture data gathered from a German Shepherd and leads to energy-efficient locomotion strategies with natural gait transitions.

Cannot find the paper you are looking for? You can Submit a new open access paper.