Search Results for author: Nathaniel Blanchard

Found 17 papers, 4 papers with code

The VoxWorld Platform for Multimodal Embodied Agents

no code implementations LREC 2022 Nikhil Krishnaswamy, William Pickard, Brittany Cates, Nathaniel Blanchard, James Pustejovsky

We present a five-year retrospective on the development of the VoxWorld platform, first introduced as a multimodal platform for modeling motion language, that has evolved into a platform for rapidly building and deploying embodied agents with contextual and situational awareness, capable of interacting with humans in multiple modalities, and exploring their environments.

Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

1 code implementation13 Apr 2024 Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard, Nikhil Krishnaswamy

We establish three methods that incorporate images and text for coreference: 1) a standard fused model with finetuning, 2) a novel linear mapping method without finetuning and 3) an ensembling approach based on splitting mention pairs by semantic and discourse-level difficulty.

coreference-resolution Event Coreference Resolution

Common Ground Tracking in Multimodal Dialogue

1 code implementation26 Mar 2024 Ibrahim Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard Brutti, Christopher Tam, Jingxuan Tu, Benjamin Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy, James Pustejovsky

Within Dialogue Modeling research in AI and NLP, considerable attention has been spent on ``dialogue state tracking'' (DST), which is the ability to update the representations of the speaker's needs at each turn in the dialogue by taking into account the past dialogue moves and history.

Dialogue State Tracking

How Good is Automatic Segmentation as a Multimodal Discourse Annotation Aid?

no code implementations27 May 2023 Corbin Terpstra, Ibrahim Khebour, Mariah Bradford, Brett Wisniewski, Nikhil Krishnaswamy, Nathaniel Blanchard

We (1) manually transcribe utterances in a dataset of triads collaboratively solving a problem involving dialogue and physical object manipulation, (2) annotate collaborative moves according to these gold-standard transcripts, and then (3) apply these annotations to utterances that have been automatically segmented using toolkits from Google and OpenAI's Whisper.

Segmentation

Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image Detection

no code implementations2 May 2023 Huma Jamil, Yajing Liu, Turgay Caglar, Christina M. Cole, Nathaniel Blanchard, Christopher Peterson, Michael Kirby

Here, we investigate the potential for ReLU activation patterns (encoded as bit vectors) to aid in understanding and interpreting the behavior of neural networks.

A Computer Vision Method for Estimating Velocity from Jumps

no code implementations9 Dec 2022 Soumyadip Roy, Chaitanya Roygaga, Nathaniel Blanchard, Aparna Bharati

Athletes routinely undergo fitness evaluations to evaluate their training progress.

Dual Graphs of Polyhedral Decompositions for the Detection of Adversarial Attacks

no code implementations23 Nov 2022 Huma Jamil, Yajing Liu, Christina M. Cole, Nathaniel Blanchard, Emily J. King, Michael Kirby, Christopher Peterson

This paper illustrates how one can utilize the dual graph to detect and analyze adversarial attacks in the context of digital images.

APE-V: Athlete Performance Evaluation using Video

1 code implementation IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops 2022 Chaitanya Roygaga, Dhruva Patil, Michael Boyle, William Pickard, Raoul Reiser, Aparna Bharati, Nathaniel Blanchard

However, athlete error calls into question 46. 2% of movements — in these cases, an expert assessor would have the athlete redo the movement to eliminate the error.

Canonical Face Embeddings

no code implementations15 Jun 2021 David McNeely-White, Ben Sattelberg, Nathaniel Blanchard, Ross Beveridge

When instead evaluating embeddings generated from two CNNs, where one CNN's embeddings are mapped with a linear transformation, the mean true accept rate drops to 0. 95 using the same verification paradigm.

Face Verification

Motion Prediction Performance Analysis for Autonomous Driving Systems and the Effects of Tracking Noise

no code implementations16 Apr 2021 Ameni Trabelsi, Ross J. Beveridge, Nathaniel Blanchard

In this work, we systematically explore the importance of the tracking module for the motion prediction task and ultimately conclude that the overall motion prediction performance is highly sensitive to the tracking module's imperfections.

Autonomous Driving motion prediction

Exploring the Interchangeability of CNN Embedding Spaces

no code implementations5 Oct 2020 David McNeely-White, Benjamin Sattelberg, Nathaniel Blanchard, Ross Beveridge

When image embeddings generated by one CNN are transformed into embeddings corresponding to the feature space of a second CNN trained on the same task, their respective image classification or face verification performance is largely preserved.

Classification Face Recognition +3

A Pose Proposal and Refinement Network for Better Object Pose Estimation

no code implementations11 Apr 2020 Ameni Trabelsi, Mohamed Chaabane, Nathaniel Blanchard, Ross Beveridge

Our approach is composed of 2 main components: the first component classifies the objects in the input image and proposes an initial 6D pose estimate through a multi-task, CNN-based encoder/multi-decoder module.

6D Pose Estimation 6D Pose Estimation using RGB +1

Utilizing Network Properties to Detect Erroneous Inputs

no code implementations28 Feb 2020 Matt Gorbett, Nathaniel Blanchard

Neural networks are vulnerable to a wide range of erroneous inputs such as adversarial, corrupted, out-of-distribution, and misclassified examples.

Looking Ahead: Anticipating Pedestrians Crossing with Future Frames Prediction

no code implementations20 Oct 2019 Mohamed Chaabane, Ameni Trabelsi, Nathaniel Blanchard, Ross Beveridge

Our end-to-end model consists of two stages: the first stage is an encoder/decoder network that learns to predict future video frames.

Action Recognition Activity Recognition In Videos +3

A Neurobiological Evaluation Metric for Neural Network Model Search

1 code implementation CVPR 2019 Nathaniel Blanchard, Jeffery Kinnison, Brandon RichardWebster, Pouya Bashivan, Walter J. Scheirer

In this paper we introduce a human-model similarity (HMS) metric, which quantifies the similarity of human fMRI and network activation behavior.

Cannot find the paper you are looking for? You can Submit a new open access paper.