Search Results for author: Jonathan Francis

Found 37 papers, 10 papers with code

Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding

1 code implementation ECCV 2020 Seong Hyeon Park, Gyubok Lee, Jimin Seo, Manoj Bhat, Minseok Kang, Jonathan Francis, Ashwin Jadhav, Paul Pu Liang, Louis-Philippe Morency

Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable decision-making.

Autonomous Driving Decision Making +3

Your Learned Constraint is Secretly a Backward Reachable Tube

no code implementations26 Jan 2025 Mohamad Qadri, Gokul Swamy, Jonathan Francis, Michael Kaess, Andrea Bajcsy

In the language of safe control, this means we recover a backwards reachable tube (BRT) rather than a failure set.

STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning

no code implementations19 Dec 2024 Marius Memmel, Jacob Berg, Bingqing Chen, Abhishek Gupta, Jonathan Francis

Many robot learning methods treat such datasets as multi-task expert data and learn a multi-task, generalist policy by training broadly across them.

Dynamic Time Warping Multi-Task Learning +1

GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering

no code implementations19 Dec 2024 Saumya Saxena, Blake Buchanan, Chris Paxton, Bingqing Chen, Narunas Vaskevicius, Luigi Palmieri, Jonathan Francis, Oliver Kroemer

In Embodied Question Answering (EQA), agents must explore and develop a semantic understanding of an unseen environment in order to answer a situated question with confidence.

Efficient Exploration Embodied Question Answering +2

Object and Contact Point Tracking in Demonstrations Using 3D Gaussian Splatting

no code implementations5 Nov 2024 Michael Büttner, Jonathan Francis, Helge Rhodin, Andrew Melnik

This paper introduces a method to enhance Interactive Imitation Learning (IIL) by extracting touch interaction points and tracking object movement from video demonstrations.

Imitation Learning Point Tracking

Grounding Robot Policies with Visuomotor Language Guidance

no code implementations9 Oct 2024 Arthur Bucker, Pablo Ortega-Kral, Jonathan Francis, Jean Oh

Recent advances in the fields of natural language processing and computer vision have shown great potential in understanding the underlying dynamics of the world from large-scale internet data.

Visual Grounding

SEAL: Towards Safe Autonomous Driving via Skill-Enabled Adversary Learning for Closed-Loop Scenario Generation

1 code implementation16 Sep 2024 Benjamin Stoler, Ingrid Navarro, Jonathan Francis, Jean Oh

Verification and validation of autonomous driving (AD) systems and components is of increasing importance, as such technology increases in real-world prevalence.

Autonomous Driving

Cognitive LLMs: Towards Integrating Cognitive Architectures and Large Language Models for Manufacturing Decision-making

no code implementations17 Aug 2024 Siyu Wu, Alessandro Oltramari, Jonathan Francis, C. Lee Giles, Frank E. Ritter

Resolving the dichotomy between the human-like yet constrained reasoning processes of Cognitive Architectures and the broad but often noisy inference behavior of Large Language Models (LLMs) remains a challenging but exciting pursuit, for enabling reliable machine reasoning capabilities in production systems.

Decision Making Hallucination

Enhancing Vision-Language Models with Scene Graphs for Traffic Accident Understanding

no code implementations8 Jul 2024 Aaron Lohner, Francesco Compagno, Jonathan Francis, Alessandro Oltramari

This representation of a traffic scene is referred to as a scene graph, and can be used as input for an accident classifier.

Autonomous Driving

Attribution Regularization for Multimodal Paradigms

no code implementations2 Apr 2024 Sahiti Yerramilli, Jayant Sravan Tamarapalli, Jonathan Francis, Eric Nyberg

This research project aims to address these challenges by proposing a novel regularization term that encourages multimodal models to effectively utilize information from all modalities when making decisions.

Decision Making

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

no code implementations14 Dec 2023 Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar, Nikhil Keetha, Seungchan Kim, Yaqi Xie, Tianyi Zhang, Hao-Shu Fang, Shibo Zhao, Shayegan Omidshafiei, Dong-Ki Kim, Ali-akbar Agha-mohammadi, Katia Sycara, Matthew Johnson-Roberson, Dhruv Batra, Xiaolong Wang, Sebastian Scherer, Chen Wang, Zsolt Kira, Fei Xia, Yonatan Bisk

Motivated by the impressive open-set performance and content generation capabilities of web-scale, large-capacity pre-trained models (i. e., foundation models) in research fields such as Natural Language Processing (NLP) and Computer Vision (CV), we devote this survey to exploring (i) how these existing foundation models from NLP and CV can be applied to the field of general-purpose robotics, and also exploring (ii) what a robotics-specific foundation model would look like.

A Study of Situational Reasoning for Traffic Understanding

1 code implementation5 Jun 2023 Jiarui Zhang, Filip Ilievski, Kaixin Ma, Aravinda Kollaa, Jonathan Francis, Alessandro Oltramari

Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure.

Decision Making Knowledge Graphs +2

Knowledge-enhanced Agents for Interactive Text Games

no code implementations8 May 2023 Prateek Chhikara, Jiarui Zhang, Filip Ilievski, Jonathan Francis, Kaixin Ma

We experiment with four models on the 10 tasks in the ScienceWorld text-based game environment, to illustrate the impact of knowledge injection on various model configurations and challenging task settings.

Instruction Following Knowledge Graphs +5

Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints

no code implementations29 Apr 2023 Rajshekhar Das, Jonathan Francis, Sanket Vaibhav Mehta, Jean Oh, Emma Strubell, Jose Moura

Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems.

Object Semantic Segmentation +1

Core Challenges in Embodied Vision-Language Planning

no code implementations5 Apr 2023 Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

Recent advances in the areas of Multimodal Machine Learning and Artificial Intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Robotics.

Survey

Knowledge-driven Scene Priors for Semantic Audio-Visual Embodied Navigation

no code implementations21 Dec 2022 Gyan Tatiya, Jonathan Francis, Luca Bondi, Ingrid Navarro, Eric Nyberg, Jivko Sinapov, Jean Oh

We also define a new audio-visual navigation sub-task, where agents are evaluated on novel sounding objects, as opposed to unheard clips of known objects.

Visual Navigation

Distribution-aware Goal Prediction and Conformant Model-based Planning for Safe Autonomous Driving

no code implementations16 Dec 2022 Jonathan Francis, Bingqing Chen, Weiran Yao, Eric Nyberg, Jean Oh

The feasibility of collecting a large amount of expert demonstrations has inspired growing research interests in learning-to-drive settings, where models learn by imitating the driving behaviour from experts.

Autonomous Driving Density Estimation +1

Utilizing Background Knowledge for Robust Reasoning over Traffic Situations

1 code implementation4 Dec 2022 Jiarui Zhang, Filip Ilievski, Aravinda Kollaa, Jonathan Francis, Kaixin Ma, Alessandro Oltramari

Understanding novel situations in the traffic domain requires an intricate combination of domain-specific and causal commonsense knowledge.

Knowledge Graphs Multiple-choice +2

Coalescing Global and Local Information for Procedural Text Understanding

1 code implementation COLING 2022 Kaixin Ma, Filip Ilievski, Jonathan Francis, Eric Nyberg, Alessandro Oltramari

In this paper, we propose Coalescing Global and Local Information (CGLI), a new model that builds entity- and timestep-aware input representations (local input) considering the whole context (global input), and we jointly model the entity states with a structured prediction objective (global output).

Procedural Text Understanding Structured Prediction

An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs

no code implementations21 May 2022 Jiarui Zhang, Filip Ilievski, Kaixin Ma, Jonathan Francis, Alessandro Oltramari

In this paper, we study the effect of knowledge sampling strategies and sizes that can be used to generate synthetic data for adapting language models.

Decoder Knowledge Graphs

Generalizable Neuro-symbolic Systems for Commonsense Question Answering

no code implementations17 Jan 2022 Alessandro Oltramari, Jonathan Francis, Filip Ilievski, Kaixin Ma, Roshanak Mirzaee

This chapter illustrates how suitable neuro-symbolic models for language understanding can enable domain generalizability and robustness in downstream tasks.

Knowledge Graphs Question Answering

Safe Autonomous Racing via Approximate Reachability on Ego-vision

no code implementations14 Oct 2021 Bingqing Chen, Jonathan Francis, Jean Oh, Eric Nyberg, Sylvia L. Herbert

Given the nature of the task, autonomous agents need to be able to 1) identify and avoid unsafe scenarios under the complex vehicle dynamics, and 2) make sub-second decision in a fast-changing environment.

Autonomous Driving Autonomous Racing +2

Unsupervised Domain Adaptation Via Pseudo-labels And Objectness Constraints

no code implementations29 Sep 2021 Rajshekhar Das, Jonathan Francis, Sanket Vaibhav Mehta, Jean Oh, Emma Strubell, Jose Moura

Crucially, the objectness constraint is agnostic to the ground-truth semantic segmentation labels and, therefore, remains appropriate for unsupervised adaptation settings.

Object Pseudo Label +4

Core Challenges in Embodied Vision-Language Planning

no code implementations26 Jun 2021 Jonathan Francis, Nariaki Kitamura, Felix Labelle, Xiaopeng Lu, Ingrid Navarro, Jean Oh

Recent advances in the areas of multimodal machine learning and artificial intelligence (AI) have led to the development of challenging tasks at the intersection of Computer Vision, Natural Language Processing, and Embodied AI.

Survey

Learn-to-Race: A Multimodal Control Environment for Autonomous Racing

1 code implementation ICCV 2021 James Herman, Jonathan Francis, Siddha Ganju, Bingqing Chen, Anirudh Koul, Abhinav Gupta, Alexey Skabelkin, Ivan Zhukov, Max Kumskoy, Eric Nyberg

Existing research on autonomous driving primarily focuses on urban driving, which is insufficient for characterising the complex driving behaviour underlying high-speed racing.

Autonomous Driving Autonomous Racing +1

Trajformer: Trajectory Prediction with Local Self-Attentive Contexts for Autonomous Driving

2 code implementations30 Nov 2020 Manoj Bhat, Jonathan Francis, Jean Oh

Effective feature-extraction is critical to models' contextual understanding, particularly for applications to robotics and autonomous driving, such as multimodal trajectory prediction.

Autonomous Driving Trajectory Prediction

Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

1 code implementation7 Nov 2020 Kaixin Ma, Filip Ilievski, Jonathan Francis, Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

Guided by a set of hypotheses, the framework studies how to transform various pre-existing knowledge resources into a form that is most effective for pre-training models.

Language Modeling Language Modelling +1

Neuro-symbolic Architectures for Context Understanding

no code implementations9 Mar 2020 Alessandro Oltramari, Jonathan Francis, Cory Henson, Kaixin Ma, Ruwan Wickramarachchi

Computational context understanding refers to an agent's ability to fuse disparate sources of information for decision-making and is, therefore, generally regarded as a prerequisite for sophisticated machine reasoning capabilities, such as in artificial intelligence (AI).

Decision Making

Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding

1 code implementation6 Mar 2020 Seong Hyeon Park, Gyubok Lee, Manoj Bhat, Jimin Seo, Minseok Kang, Jonathan Francis, Ashwin R. Jadhav, Paul Pu Liang, Louis-Philippe Morency

Multi-agent trajectory forecasting in autonomous driving requires an agent to accurately anticipate the behaviors of the surrounding vehicles and pedestrians, for safe and reliable decision-making.

Autonomous Driving Decision Making +2

Towards Generalizable Neuro-Symbolic Systems for Commonsense Question Answering

no code implementations WS 2019 Kaixin Ma, Jonathan Francis, Quanyang Lu, Eric Nyberg, Alessandro Oltramari

Non-extractive commonsense QA remains a challenging AI task, as it requires systems to reason about, synthesize, and gather disparate pieces of information, in order to generate responses to queries.

Common Sense Reasoning Question Answering +1

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing

no code implementations WS 2017 Ravich, Abhilasha er, Thomas Manzini, Matthias Grabmair, Graham Neubig, Jonathan Francis, Eric Nyberg

Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording.

Semantic Parsing

Cannot find the paper you are looking for? You can Submit a new open access paper.