no code implementations • 25 Oct 2024 • Ondrej Biza, Thomas Weng, Lingfeng Sun, Karl Schmeckpeper, Tarik Kelestemur, Yecheng Jason Ma, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong
We find that GCR leads to a more-sample efficient RL, enabling model-free RL to solve about twice as many tasks as our baseline reward learning methods.
no code implementations • 20 Jun 2024 • Arsh Tangri, Ondrej Biza, Dian Wang, David Klee, Owen Howell, Robert Platt
Sample efficiency is critical when applying learning-based methods to robotic manipulation due to the high cost of collecting expert demonstrations and the challenges of on-robot policy learning through online Reinforcement Learning (RL).
no code implementations • 17 Jun 2024 • Haojie Huang, Karl Schmeckpeper, Dian Wang, Ondrej Biza, Yaoyao Qian, Haotian Liu, Mingxi Jia, Robert Platt, Robin Walters
Humans can imagine goal states during planning and perform actions to match those goals.
no code implementations • 7 Jul 2023 • Owen Howell, David Klee, Ondrej Biza, Linfeng Zhao, Robin Walters
We show that an algorithm that learns a three-dimensional representation of the world from two dimensional images must satisfy certain geometric consistency properties which we formulate as SO(2)-equivariance constraints.
no code implementations • 21 Jun 2023 • Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt
We propose a new method, Interaction Warping, for learning SE(3) robotic manipulation policies from a single demonstration.
1 code implementation • 10 Jun 2023 • Xupeng Zhu, Dian Wang, Guanang Su, Ondrej Biza, Robin Walters, Robert Platt
Real-world grasp detection is challenging due to the stochasticity in grasp dynamics and the noise in hardware.
1 code implementation • 27 Feb 2023 • David M. Klee, Ondrej Biza, Robert Platt, Robin Walters
Predicting the pose of objects from a single image is an important but difficult computer vision problem.
2 code implementations • 9 Feb 2023 • Ondrej Biza, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Gamaleldin F. Elsayed, Aravindh Mahendran, Thomas Kipf
Automatically discovering composable abstractions from raw perceptual data is a long-standing challenge in machine learning.
no code implementations • 18 Jul 2022 • David Klee, Ondrej Biza, Robert Platt, Robin Walters
In this paper, we propose a novel architecture based on icosahedral group convolutions that reasons in $\mathrm{SO(3)}$ by learning a projection of the input image onto an icosahedron.
1 code implementation • 27 Apr 2022 • Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong, Thomas Kipf
We study the problem of binding actions to objects in object-factored world models using action-attention mechanisms.
1 code implementation • 24 Apr 2022 • Jung Yeon Park, Ondrej Biza, Linfeng Zhao, Jan Willem van de Meent, Robin Walters
Incorporating symmetries can lead to highly data-efficient and generalizable models by defining equivalence classes of data samples related by transformations.
1 code implementation • 10 Feb 2022 • Ondrej Biza, Thomas Kipf, David Klee, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong
In this paper, we learn to generalize over robotic pick-and-place tasks using object-factored world models, which combat the combinatorial explosion by ensuring that predictions are equivariant to permutations of objects.
no code implementations • 29 Sep 2021 • Jung Yeon Park, Ondrej Biza, Linfeng Zhao, Jan-Willem van de Meent, Robin Walters
In this paper, we use equivariant transition models as an inductive bias to learn symmetric latent representations in a self-supervised manner.
no code implementations • 29 Sep 2021 • Xupeng Zhu, Dian Wang, Ondrej Biza, Robert Platt
Visual grasp detection is a key problem in robotics where the agent must learn to model the grasp function, a mapping from an image of a scene onto a set of feasible grasp poses.
1 code implementation • 24 Jul 2021 • Ondrej Biza, Elise van der Pol, Thomas Kipf
World models trained by contrastive learning are a compelling alternative to autoencoder-based world models, which learn by reconstructing pixel states.
1 code implementation • 11 Jan 2021 • Ondrej Biza, Dian Wang, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong
This paper proposes an alternative approach where the solutions of previously solved tasks are used to produce an action prior that can facilitate exploration in future tasks.
1 code implementation • pproximateinference AABI Symposium 2021 • Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S. Wong
In this work, we propose an information bottleneck method for learning approximate bisimulations, a type of state abstraction.
1 code implementation • 30 Nov 2018 • Ondrej Biza, Robert Platt
Abstraction of Markov Decision Processes is a useful tool for solving complex problems, as it can ignore unimportant aspects of an environment, simplifying the process of learning an optimal policy.