1 code implementation • 30 May 2023 • Melissa Mozifian, Tristan Sylvain, Dave Evans, Lili Meng
Attention-based sequential recommendation methods have shown promise in accurately capturing users' evolving interests from their past interactions.
no code implementations • 29 Sep 2021 • Melissa Mozifian, Dieter Fox, David Meger, Fabio Ramos, Animesh Garg
In this paper, we consider the problem of continuous control for various robot manipulation tasks with an explicit representation that promotes skill reuse while learning multiple tasks, related through the reward function.
no code implementations • 7 Dec 2020 • Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White
This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference.
1 code implementation • 3 Dec 2020 • Melissa Mozifian, Amy Zhang, Joelle Pineau, David Meger
The goal of this work is to address the recent success of domain randomization and data augmentation for the sim2real setting.
no code implementations • 2 Nov 2020 • Yuchen Wu, Melissa Mozifian, Florian Shkurti
Unlike the majority of existing methods that assume optimal demonstrations and incorporate the demonstration data as hard constraints on policy optimization, we instead incorporate demonstration data as advice in the form of a reward shaping potential trained as a generative model of states and actions.
no code implementations • 2 Jun 2019 • Melissa Mozifian, Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek
We explore the use of gradient-based search methods to learn a domain randomization with the following properties: 1) The trained policy should be successful in environments sampled from the domain randomization distribution 2) The domain randomization distribution should be wide enough so that the experience similar to the target robot system is observed during training, while addressing the practicality of training finite capacity models.
4 code implementations • 6 Dec 2017 • Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven Waslander
We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios.