no code implementations • LREC 2022 • Nikhil Krishnaswamy, William Pickard, Brittany Cates, Nathaniel Blanchard, James Pustejovsky
We present a five-year retrospective on the development of the VoxWorld platform, first introduced as a multimodal platform for modeling motion language, that has evolved into a platform for rapidly building and deploying embodied agents with contextual and situational awareness, capable of interacting with humans in multiple modalities, and exploring their environments.
1 code implementation • 13 Apr 2024 • Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard, Nikhil Krishnaswamy
We establish three methods that incorporate images and text for coreference: 1) a standard fused model with finetuning, 2) a novel linear mapping method without finetuning and 3) an ensembling approach based on splitting mention pairs by semantic and discourse-level difficulty.
1 code implementation • 26 Mar 2024 • Ibrahim Khebour, Kenneth Lai, Mariah Bradford, Yifan Zhu, Richard Brutti, Christopher Tam, Jingxuan Tu, Benjamin Ibarra, Nathaniel Blanchard, Nikhil Krishnaswamy, James Pustejovsky
Within Dialogue Modeling research in AI and NLP, considerable attention has been spent on ``dialogue state tracking'' (DST), which is the ability to update the representations of the speaker's needs at each turn in the dialogue by taking into account the past dialogue moves and history.
no code implementations • 27 May 2023 • Corbin Terpstra, Ibrahim Khebour, Mariah Bradford, Brett Wisniewski, Nikhil Krishnaswamy, Nathaniel Blanchard
We (1) manually transcribe utterances in a dataset of triads collaboratively solving a problem involving dialogue and physical object manipulation, (2) annotate collaborative moves according to these gold-standard transcripts, and then (3) apply these annotations to utterances that have been automatically segmented using toolkits from Google and OpenAI's Whisper.
no code implementations • 2 May 2023 • Huma Jamil, Yajing Liu, Turgay Caglar, Christina M. Cole, Nathaniel Blanchard, Christopher Peterson, Michael Kirby
Here, we investigate the potential for ReLU activation patterns (encoded as bit vectors) to aid in understanding and interpreting the behavior of neural networks.
no code implementations • 9 Dec 2022 • Soumyadip Roy, Chaitanya Roygaga, Nathaniel Blanchard, Aparna Bharati
Athletes routinely undergo fitness evaluations to evaluate their training progress.
no code implementations • 23 Nov 2022 • Huma Jamil, Yajing Liu, Christina M. Cole, Nathaniel Blanchard, Emily J. King, Michael Kirby, Christopher Peterson
This paper illustrates how one can utilize the dual graph to detect and analyze adversarial attacks in the context of digital images.
1 code implementation • IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops 2022 • Chaitanya Roygaga, Dhruva Patil, Michael Boyle, William Pickard, Raoul Reiser, Aparna Bharati, Nathaniel Blanchard
However, athlete error calls into question 46. 2% of movements — in these cases, an expert assessor would have the athlete redo the movement to eliminate the error.
no code implementations • 15 Jun 2021 • David McNeely-White, Ben Sattelberg, Nathaniel Blanchard, Ross Beveridge
When instead evaluating embeddings generated from two CNNs, where one CNN's embeddings are mapped with a linear transformation, the mean true accept rate drops to 0. 95 using the same verification paradigm.
no code implementations • 16 Apr 2021 • Ameni Trabelsi, Ross J. Beveridge, Nathaniel Blanchard
In this work, we systematically explore the importance of the tracking module for the motion prediction task and ultimately conclude that the overall motion prediction performance is highly sensitive to the tracking module's imperfections.
no code implementations • 5 Oct 2020 • David McNeely-White, Benjamin Sattelberg, Nathaniel Blanchard, Ross Beveridge
When image embeddings generated by one CNN are transformed into embeddings corresponding to the feature space of a second CNN trained on the same task, their respective image classification or face verification performance is largely preserved.
no code implementations • 11 Apr 2020 • Ameni Trabelsi, Mohamed Chaabane, Nathaniel Blanchard, Ross Beveridge
Our approach is composed of 2 main components: the first component classifies the objects in the input image and proposes an initial 6D pose estimate through a multi-task, CNN-based encoder/multi-decoder module.
no code implementations • 28 Feb 2020 • Matt Gorbett, Nathaniel Blanchard
Neural networks are vulnerable to a wide range of erroneous inputs such as adversarial, corrupted, out-of-distribution, and misclassified examples.
no code implementations • 20 Oct 2019 • Mohamed Chaabane, Ameni Trabelsi, Nathaniel Blanchard, Ross Beveridge
Our end-to-end model consists of two stages: the first stage is an encoder/decoder network that learns to predict future video frames.
no code implementations • WS 2018 • Nathaniel Blanchard, Daniel Moreira, Aparna Bharati, Walter J. Scheirer
We discard traditional transcription features in order to minimize human intervention and to maximize the deployability of our model on at-scale real-world data.
1 code implementation • CVPR 2019 • Nathaniel Blanchard, Jeffery Kinnison, Brandon RichardWebster, Pouya Bashivan, Walter J. Scheirer
In this paper we introduce a human-model similarity (HMS) metric, which quantifies the similarity of human fMRI and network activation behavior.
no code implementations • WS 2016 • Nathaniel Blanchard, Patrick Donnelly, Andrew M. Olney, Samei Borhan, Brooke Ward, Xiaoyi Sun, Sean Kelly, Martin Nystrand, Sidney K. D'Mello
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1