no code implementations • 6 Dec 2023 • Jimmy Li, Igor Kozlov, Di wu, Xue Liu, Gregory Dudek
This coincides with a rapid increase in the number of cell sites worldwide, driven largely by dramatic growth in cellular network traffic.
no code implementations • 4 Dec 2023 • Oliver Limoyo, Abhisek Konar, Trevor Ablett, Jonathan Kelly, Francois R. Hogan, Gregory Dudek
With LPP, we approach the learning of robotic object placement policies by reversing the grasping process and exploiting the inherent symmetry of the pick and place problems.
no code implementations • 30 Nov 2023 • Jean-François Tremblay, David Meger, Francois Hogan, Gregory Dudek
These robots will need to sense these properties through interaction prior to performing downstream tasks with the objects.
no code implementations • 15 Nov 2023 • Wei-Di Chang, Francois Hogan, David Meger, Gregory Dudek
In this paper we leverage self-supervised vision transformer models and their emergent semantic abilities to improve the generalization abilities of imitation learning policies.
no code implementations • 2 Nov 2023 • Trevor Ablett, Oliver Limoyo, Adam Sigal, Affan Jilani, Jonathan Kelly, Kaleem Siddiqi, Francois Hogan, Gregory Dudek
Optical tactile sensors have emerged as an effective means to acquire dense contact information during robotic manipulation.
no code implementations • 5 Oct 2023 • Junliang Luo, Yi Tian Xu, Di wu, Michael Jenkin, Xue Liu, Gregory Dudek
In this work, we propose an approximate dynamic programming (ADP)-based method coupled with online optimization to switch on/off the cells of base stations to reduce network power consumption while maintaining adequate Quality of Service (QoS) metrics.
no code implementations • 2 Oct 2023 • Wei-Di Chang, Scott Fujimoto, David Meger, Gregory Dudek
Imitation Learning from Observation (ILfO) is a setting in which a learner tries to imitate the behavior of an expert, using only observational data and without the direct guidance of demonstrated actions.
no code implementations • 21 Jul 2023 • Dmitriy Rivkin, Nikhil Kakodkar, Francois Hogan, Bobak H. Baghi, Gregory Dudek
This work explores the capacity of large language models (LLMs) to address problems at the intersection of spatial planning and natural language interfaces for navigation. Our focus is on following relatively complex instructions that are more akin to natural conversation than traditional explicit procedural directives seen in robotics.
no code implementations • 18 May 2023 • Andrew Holliday, Gregory Dudek
In this work we explore the combination of metaheuristics and learned neural network solvers for combinatorial optimization.
no code implementations • 22 Mar 2023 • Abhisek Konar, Di wu, Yi Tian Xu, Seowoo Jang, Steve Liu, Gregory Dudek
Engineering this reward function is challenging, because it involves the need for expert knowledge and there lacks a general consensus on the form of an optimal reward function.
no code implementations • 22 Mar 2023 • Yi Tian Xu, Jimmy Li, Di wu, Michael Jenkin, Seowoo Jang, Xue Liu, Gregory Dudek
When deploying to an unknown traffic scenario, we select a policy from the policy bank based on the similarity between the previous-day traffic of the current scenario and the traffic observed during training.
no code implementations • 14 Mar 2023 • Jikun Kang, Di wu, Ju Wang, Ekram Hossain, Xue Liu, Gregory Dudek
In cellular networks, User Equipment (UE) handoff from one Base Station (BS) to another, giving rise to the load balancing problem among the BSs.
no code implementations • 3 Feb 2023 • Igor Kozlov, Dmitriy Rivkin, Wei-Di Chang, Di wu, Xue Liu, Gregory Dudek
Such networks undergo frequent and often heterogeneous changes caused by network operators, who are seeking to tune their system parameters for optimal performance.
no code implementations • 28 Nov 2022 • Sahand Rezaei-Shoshtari, Charlotte Morissette, Francois Robert Hogan, Gregory Dudek, David Meger
In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks.
no code implementations • 1 Oct 2022 • Fengdi Che, Xiru Zhu, Doina Precup, David Meger, Gregory Dudek
Guided exploration with expert demonstrations improves data efficiency for reinforcement learning, but current algorithms often overuse expert information.
no code implementations • 19 May 2022 • Wei-Di Chang, Juan Camilo Gamboa Higuera, Scott Fujimoto, David Meger, Gregory Dudek
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only.
no code implementations • 9 Dec 2021 • Stefan Wapnick, Travis Manderson, David Meger, Gregory Dudek
We present a reward-predictive, model-based deep learning method featuring trajectory-constrained visual attention for local planning in visual navigation tasks.
no code implementations • 27 Nov 2021 • Morteza Rezanejad, Babak Samari, Elham Karimi, Ioannis Rekleitis, Gregory Dudek, Kaleem Siddiqi
In topology matching between two given maps and their AOF skeletons, we first find correspondences between points on the AOF skeletons of two different environments.
no code implementations • 29 Sep 2021 • Jikun Kang, Xi Chen, Ju Wang, Chengming Hu, Xue Liu, Gregory Dudek
Results show that, compared with SOTA model-free methods, our method can improve the data efficiency and system performance by up to 75% and 10%, respectively.
no code implementations • 29 Sep 2021 • Di wu, Tianyu Li, David Meger, Michael Jenkin, Xue Liu, Gregory Dudek
Unfortunately, most online reinforcement learning algorithms require a large number of interactions with the environment to learn a reliable control policy.
no code implementations • 18 Jun 2021 • Bobak H. Baghi, Gregory Dudek
In this paper, we present an algorithm to efficiently learn socially-compliant navigation policies from observations of human trajectories.
no code implementations • 20 May 2021 • Sandeep Manjanna, M. Ani Hsieh, Gregory Dudek
This paper presents a distributed scalable multi-robot planning algorithm for informed sampling of quasistatic spatial fields.
1 code implementation • 12 Jan 2021 • Sahand Rezaei-Shoshtari, Francois Robert Hogan, Michael Jenkin, David Meger, Gregory Dudek
Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions.
no code implementations • 9 Apr 2020 • Travis Manderson, Stefan Wapnick, David Meger, Gregory Dudek
We present a method for learning to drive on smooth terrain while simultaneously avoiding collisions in challenging off-road and unstructured outdoor environments using only visual inputs.
1 code implementation • 22 Mar 2020 • Karim Koreitem, Florian Shkurti, Travis Manderson, Wei-Di Chang, Juan Camilo Gamboa Higuera, Gregory Dudek
In this paper we propose a method that enables informed visual navigation via a learned visual similarity operator that guides the robot's visual search towards parts of the scene that look like an exemplar image, which is given by the user as a high-level specification for data collection.
1 code implementation • 11 Mar 2020 • Bharat Joshi, Md Modasshir, Travis Manderson, Hunter Damron, Marios Xanthidis, Alberto Quattrini Li, Ioannis Rekleitis, Gregory Dudek
In this paper, we propose a real-time deep learning approach for determining the 6D relative pose of Autonomous Underwater Vehicles (AUV) from a single image.
no code implementations • 2 Dec 2019 • Xiru Zhu, Fengdi Che, Tianzi Yang, Tzuyang Yu, David Meger, Gregory Dudek
This is because the task of evaluating the quality of a generated image differs from deciding if an image is real or fake.
no code implementations • 15 Jun 2019 • Sandeep Manjanna, Herke van Hoof, Gregory Dudek
In this paper, we present a search algorithm that generates efficient trajectories that optimize the rate at which probability mass is covered by a searcher.
no code implementations • 2 Jun 2019 • Melissa Mozifian, Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek
We explore the use of gradient-based search methods to learn a domain randomization with the following properties: 1) The trained policy should be successful in environments sampled from the domain randomization distribution 2) The domain randomization distribution should be wide enough so that the experience similar to the target robot system is observed during training, while addressing the practicality of training finite capacity models.
1 code implementation • 25 Nov 2018 • Johanna Hansen, Kyle Kastner, Aaron Courville, Gregory Dudek
We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b) for forward planning with MCTS.
3 code implementations • 6 Mar 2018 • Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek
Finally, we assess the performance of the algorithm for learning motor controllers for a six legged autonomous underwater vehicle.
Model-based Reinforcement Learning
reinforcement-learning
+2
no code implementations • 28 Oct 2017 • Andrew Holliday, Gregory Dudek
Visual localization under large changes in scale is an important capability in many robotic mapping applications, such as localizing at low altitudes in maps built at high altitudes, or performing loop closure over long distances.
1 code implementation • 25 Sep 2017 • Florian Shkurti, Wei-Di Chang, Peter Henderson, Md Jahidul Islam, Juan Camilo Gamboa Higuera, Jimmy Li, Travis Manderson, Anqi Xu, Gregory Dudek, Junaed Sattar
We present a robust multi-robot convoying approach that relies on visual detection of the leading agent, thus enabling target following in unstructured 3-D environments.
1 code implementation • 14 Aug 2017 • Peter Henderson, Wei-Di Chang, Florian Shkurti, Johanna Hansen, David Meger, Gregory Dudek
As demand drives systems to generalize to various domains and problems, the study of multitask, transfer and lifelong learning has become an increasingly important pursuit.
no code implementations • 26 Sep 2015 • Yogesh Girdhar, Gregory Dudek
This paper presents a novel approach to modeling curiosity in a mobile robot, which is useful for monitoring and adaptive data collection tasks, especially in the context of long term autonomous missions where pre-programmed missions are likely to have limited utility.
no code implementations • 10 Sep 2015 • Yogesh Girdhar, Gregory Dudek
Topic modeling of streaming sensor data can be used for high level perception of the environment by a mobile robot.