1 code implementation • 8 Apr 2024 • Andrew Holliday, Ahmed El-Geneidy, Gregory Dudek
The design of these low-level heuristics has a major impact on the quality of the result.
no code implementations • 2 Apr 2024 • Faraz Lotfi, Farnoosh Faraji, Nikhil Kakodkar, Travis Manderson, David Meger, Gregory Dudek
This paper explores leveraging large language models for map-free off-road navigation using generative AI, reducing the need for traditional data collection and annotation.
no code implementations • 27 Feb 2024 • Andrew Holliday, Gregory Dudek
Planning a public transit network is a challenging optimization problem, but essential in order to realize the benefits of autonomous buses.
1 code implementation • 29 Jan 2024 • Faraz Lotfi, Khalil Virji, Nicholas Dudek, Gregory Dudek
We use established methods for vision-based tracking and introduce a centralized DQN controller.
no code implementations • 19 Jan 2024 • Oliver Limoyo, Jimmy Li, Dmitriy Rivkin, Jonathan Kelly, Gregory Dudek
We leverage a visual language model (VLM) and an object detector to characterize the reference images via textual descriptions and then use a large language model (LLM) to retrieve relevant reference images based on a user's language query through text-based reasoning.
no code implementations • 16 Jan 2024 • Junliang Luo, Tianyu Li, Di wu, Michael Jenkin, Steve Liu, Gregory Dudek
Large language models (LLMs), including ChatGPT, Bard, and Llama, have achieved remarkable successes over the last two years in a range of different applications.
no code implementations • 26 Dec 2023 • Saria Al Laham, Bobak H. Baghi, Pierre-Yves Lajoie, Amal Feriani, Sachini Herath, Steve Liu, Gregory Dudek
We make use of the channel impulse response (CIR) measurements from the UWB sensors to estimate the human state - comprised of location and activity - in a given area.
no code implementations • 6 Dec 2023 • Jimmy Li, Igor Kozlov, Di wu, Xue Liu, Gregory Dudek
This coincides with a rapid increase in the number of cell sites worldwide, driven largely by dramatic growth in cellular network traffic.
no code implementations • 4 Dec 2023 • Oliver Limoyo, Abhisek Konar, Trevor Ablett, Jonathan Kelly, Francois R. Hogan, Gregory Dudek
By doing so, the policy can generalize to object placement scenarios outside of the training environment without privileged information (e. g., placing a plate picked up from a table).
no code implementations • 30 Nov 2023 • Jean-François Tremblay, David Meger, Francois Hogan, Gregory Dudek
These robots will need to sense these properties through interaction prior to performing downstream tasks with the objects.
no code implementations • 15 Nov 2023 • Wei-Di Chang, Francois Hogan, David Meger, Gregory Dudek
In this paper we leverage self-supervised vision transformer models and their emergent semantic abilities to improve the generalization abilities of imitation learning policies.
1 code implementation • 2 Nov 2023 • Trevor Ablett, Oliver Limoyo, Adam Sigal, Affan Jilani, Jonathan Kelly, Kaleem Siddiqi, Francois Hogan, Gregory Dudek
Our results show that the inclusion of force matching raises average policy success rates by 62. 5%, visuotactile mode switching by 30. 3%, and visuotactile data as a policy input by 42. 5%, emphasizing the value of see-through tactile sensing for IL, both for data collection to allow force matching, and for policy execution to enable accurate task feedback.
no code implementations • 5 Oct 2023 • Junliang Luo, Yi Tian Xu, Di wu, Michael Jenkin, Xue Liu, Gregory Dudek
In this work, we propose an approximate dynamic programming (ADP)-based method coupled with online optimization to switch on/off the cells of base stations to reduce network power consumption while maintaining adequate Quality of Service (QoS) metrics.
no code implementations • 2 Oct 2023 • Wei-Di Chang, Scott Fujimoto, David Meger, Gregory Dudek
We demonstrate the effectiveness of this simple approach on a variety of continuous control tasks and find that it surpasses the state of the art in the IlfO setting, achieving expert-level performance across a range of evaluation domains even when observing only a single expert trajectory without actions.
no code implementations • 21 Jul 2023 • Dmitriy Rivkin, Nikhil Kakodkar, Francois Hogan, Bobak H. Baghi, Gregory Dudek
This work explores the capacity of large language models (LLMs) to address problems at the intersection of spatial planning and natural language interfaces for navigation.
no code implementations • 18 May 2023 • Andrew Holliday, Gregory Dudek
In this work we explore the combination of metaheuristics and learned neural network solvers for combinatorial optimization.
no code implementations • 22 Mar 2023 • Abhisek Konar, Di wu, Yi Tian Xu, Seowoo Jang, Steve Liu, Gregory Dudek
Engineering this reward function is challenging, because it involves the need for expert knowledge and there lacks a general consensus on the form of an optimal reward function.
no code implementations • 22 Mar 2023 • Yi Tian Xu, Jimmy Li, Di wu, Michael Jenkin, Seowoo Jang, Xue Liu, Gregory Dudek
When deploying to an unknown traffic scenario, we select a policy from the policy bank based on the similarity between the previous-day traffic of the current scenario and the traffic observed during training.
no code implementations • 14 Mar 2023 • Jikun Kang, Di wu, Ju Wang, Ekram Hossain, Xue Liu, Gregory Dudek
In cellular networks, User Equipment (UE) handoff from one Base Station (BS) to another, giving rise to the load balancing problem among the BSs.
no code implementations • 3 Feb 2023 • Igor Kozlov, Dmitriy Rivkin, Wei-Di Chang, Di wu, Xue Liu, Gregory Dudek
Such networks undergo frequent and often heterogeneous changes caused by network operators, who are seeking to tune their system parameters for optimal performance.
no code implementations • 28 Nov 2022 • Sahand Rezaei-Shoshtari, Charlotte Morissette, Francois Robert Hogan, Gregory Dudek, David Meger
In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks.
no code implementations • 1 Oct 2022 • Fengdi Che, Xiru Zhu, Doina Precup, David Meger, Gregory Dudek
Guided exploration with expert demonstrations improves data efficiency for reinforcement learning, but current algorithms often overuse expert information.
no code implementations • 19 May 2022 • Wei-Di Chang, Juan Camilo Gamboa Higuera, Scott Fujimoto, David Meger, Gregory Dudek
We present an algorithm for Inverse Reinforcement Learning (IRL) from expert state observations only.
no code implementations • 9 Dec 2021 • Stefan Wapnick, Travis Manderson, David Meger, Gregory Dudek
We present a reward-predictive, model-based deep learning method featuring trajectory-constrained visual attention for local planning in visual navigation tasks.
no code implementations • 27 Nov 2021 • Morteza Rezanejad, Babak Samari, Elham Karimi, Ioannis Rekleitis, Gregory Dudek, Kaleem Siddiqi
In topology matching between two given maps and their AOF skeletons, we first find correspondences between points on the AOF skeletons of two different environments.
no code implementations • 29 Sep 2021 • Di wu, Tianyu Li, David Meger, Michael Jenkin, Xue Liu, Gregory Dudek
Unfortunately, most online reinforcement learning algorithms require a large number of interactions with the environment to learn a reliable control policy.
no code implementations • 29 Sep 2021 • Jikun Kang, Xi Chen, Ju Wang, Chengming Hu, Xue Liu, Gregory Dudek
Results show that, compared with SOTA model-free methods, our method can improve the data efficiency and system performance by up to 75% and 10%, respectively.
no code implementations • 18 Jun 2021 • Bobak H. Baghi, Gregory Dudek
In this paper, we present an algorithm to efficiently learn socially-compliant navigation policies from observations of human trajectories.
no code implementations • 20 May 2021 • Sandeep Manjanna, M. Ani Hsieh, Gregory Dudek
This paper presents a distributed scalable multi-robot planning algorithm for informed sampling of quasistatic spatial fields.
1 code implementation • 12 Jan 2021 • Sahand Rezaei-Shoshtari, Francois Robert Hogan, Michael Jenkin, David Meger, Gregory Dudek
Predicting the future interaction of objects when they come into contact with their environment is key for autonomous agents to take intelligent and anticipatory actions.
no code implementations • 9 Apr 2020 • Travis Manderson, Stefan Wapnick, David Meger, Gregory Dudek
We present a method for learning to drive on smooth terrain while simultaneously avoiding collisions in challenging off-road and unstructured outdoor environments using only visual inputs.
1 code implementation • 22 Mar 2020 • Karim Koreitem, Florian Shkurti, Travis Manderson, Wei-Di Chang, Juan Camilo Gamboa Higuera, Gregory Dudek
In this paper we propose a method that enables informed visual navigation via a learned visual similarity operator that guides the robot's visual search towards parts of the scene that look like an exemplar image, which is given by the user as a high-level specification for data collection.
1 code implementation • 11 Mar 2020 • Bharat Joshi, Md Modasshir, Travis Manderson, Hunter Damron, Marios Xanthidis, Alberto Quattrini Li, Ioannis Rekleitis, Gregory Dudek
In this paper, we propose a real-time deep learning approach for determining the 6D relative pose of Autonomous Underwater Vehicles (AUV) from a single image.
no code implementations • 2 Dec 2019 • Xiru Zhu, Fengdi Che, Tianzi Yang, Tzuyang Yu, David Meger, Gregory Dudek
This is because the task of evaluating the quality of a generated image differs from deciding if an image is real or fake.
no code implementations • 15 Jun 2019 • Sandeep Manjanna, Herke van Hoof, Gregory Dudek
In this paper, we present a search algorithm that generates efficient trajectories that optimize the rate at which probability mass is covered by a searcher.
no code implementations • 2 Jun 2019 • Melissa Mozifian, Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek
We explore the use of gradient-based search methods to learn a domain randomization with the following properties: 1) The trained policy should be successful in environments sampled from the domain randomization distribution 2) The domain randomization distribution should be wide enough so that the experience similar to the target robot system is observed during training, while addressing the practicality of training finite capacity models.
1 code implementation • 25 Nov 2018 • Johanna Hansen, Kyle Kastner, Aaron Courville, Gregory Dudek
We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b) for forward planning with MCTS.
3 code implementations • 6 Mar 2018 • Juan Camilo Gamboa Higuera, David Meger, Gregory Dudek
Finally, we assess the performance of the algorithm for learning motor controllers for a six legged autonomous underwater vehicle.
Model-based Reinforcement Learning reinforcement-learning +2
no code implementations • 28 Oct 2017 • Andrew Holliday, Gregory Dudek
Visual localization under large changes in scale is an important capability in many robotic mapping applications, such as localizing at low altitudes in maps built at high altitudes, or performing loop closure over long distances.
1 code implementation • 25 Sep 2017 • Florian Shkurti, Wei-Di Chang, Peter Henderson, Md Jahidul Islam, Juan Camilo Gamboa Higuera, Jimmy Li, Travis Manderson, Anqi Xu, Gregory Dudek, Junaed Sattar
We present a robust multi-robot convoying approach that relies on visual detection of the leading agent, thus enabling target following in unstructured 3-D environments.
1 code implementation • 14 Aug 2017 • Peter Henderson, Wei-Di Chang, Florian Shkurti, Johanna Hansen, David Meger, Gregory Dudek
As demand drives systems to generalize to various domains and problems, the study of multitask, transfer and lifelong learning has become an increasingly important pursuit.
no code implementations • 26 Sep 2015 • Yogesh Girdhar, Gregory Dudek
This paper presents a novel approach to modeling curiosity in a mobile robot, which is useful for monitoring and adaptive data collection tasks, especially in the context of long term autonomous missions where pre-programmed missions are likely to have limited utility.
no code implementations • 10 Sep 2015 • Yogesh Girdhar, Gregory Dudek
Topic modeling of streaming sensor data can be used for high level perception of the environment by a mobile robot.