Search Results for author: Spencer Whitehead

Found 16 papers, 7 papers with code

Improving Selective Visual Question Answering by Learning from Your Peers

1 code implementation • CVPR 2023 • Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach

In this work, we explore Selective VQA in both in-distribution (ID) and OOD scenarios, where models are presented with mixtures of ID and OOD data.

Question Answering Visual Question Answering

Paper
Code

Simple Token-Level Confidence Improves Caption Correctness

no code implementations • 11 May 2023 • Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach

The ability to judge whether a caption correctly describes an image is a critical part of vision-language understanding.

Ranked #62 on Visual Reasoning on Winoground

Hallucination Image Captioning +2

Paper
Add Code

Segment Anything

18 code implementations • ICCV 2023 • Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick

We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation.

Ranked #2 on Zero-Shot Instance Segmentation on LVIS v1.0 val

Event-based Object Segmentation Image Segmentation +3

126,503

Paper
Code

Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly

1 code implementation • 28 Apr 2022 • Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach

We first enable abstention capabilities for several VQA models, and analyze both their coverage, the portion of questions answered, and risk, the error on that portion.

Question Answering Visual Question Answering

Paper
Code

Separating Skills and Concepts for Novel Visual Question Answering

1 code implementation • CVPR 2021 • Spencer Whitehead, Hui Wu, Heng Ji, Rogerio Feris, Kate Saenko

Generalization to out-of-distribution data has been a problem for Visual Question Answering (VQA) models.

Attribute Contrastive Learning +2

Paper
Code

Learning from Lexical Perturbations for Consistent Visual Question Answering

1 code implementation • 26 Nov 2020 • Spencer Whitehead, Hui Wu, Yi Ren Fung, Heng Ji, Rogerio Feris, Kate Saenko

Existing Visual Question Answering (VQA) models are often fragile and sensitive to input variations.

Question Answering Visual Question Answering +1

Paper
Code

Global Attention for Name Tagging

no code implementations • CONLL 2018 • Boliang Zhang, Spencer Whitehead, Lifu Huang, Heng Ji

Many name tagging approaches use local contextual information with much success, but fail when the local context is ambiguous or limited.

Paper
Add Code

GAIA: A Fine-grained Multimedia Knowledge Extraction System

no code implementations • ACL 2020 • Manling Li, Alireza Zareian, Ying Lin, Xiaoman Pan, Spencer Whitehead, Brian Chen, Bo Wu, Heng Ji, Shih-Fu Chang, Clare Voss, Daniel Napierski, Marjorie Freedman

We present the first comprehensive, open source multimedia knowledge extraction system that takes a massive stream of unstructured, heterogeneous multimedia data from various sources and languages as input, and creates a coherent, structured knowledge base, indexing entities, relations, and events, following a rich, fine-grained ontology.

Paper
Add Code

Cross-media Structured Common Space for Multimedia Event Extraction

no code implementations • ACL 2020 • Manling Li, Alireza Zareian, Qi Zeng, Spencer Whitehead, Di Lu, Heng Ji, Shih-Fu Chang

We introduce a new task, MultiMedia Event Extraction (M2E2), which aims to extract events and their arguments from multimedia documents.

Event Extraction

Paper
Add Code

A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

1 code implementation • 5 Nov 2019 • Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni, Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search.

Automated Theorem Proving reinforcement-learning +1

Paper
Code

Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

no code implementations • 5 Nov 2019 • Pavan Kapanipathi, Veronika Thost, Siva Sankalp Patel, Spencer Whitehead, Ibrahim Abdelaziz, Avinash Balakrishnan, Maria Chang, Kshitij Fadnis, Chulaka Gunasekara, Bassem Makni, Nicholas Mattei, Kartik Talamadupula, Achille Fokoue

A few approaches have shown that information from external knowledge sources like knowledge graphs (KGs) can add value, in addition to the textual content, by providing background knowledge that may be critical for a task.

Knowledge Graphs Natural Language Inference

Paper
Add Code

Studying Wythoff and Zometool Constructions using Maple

no code implementations • 16 Aug 2019 • Benoit Charbonneau, Spencer Whitehead

We describe a Maple package that serves at least four purposes.

Computational Geometry

Paper
Add Code

Multilingual Entity, Relation, Event and Human Value Extraction

no code implementations • NAACL 2019 • Manling Li, Ying Lin, Joseph Hoover, Spencer Whitehead, Clare Voss, Morteza Dehghani, Heng Ji

This paper demonstrates a state-of-the-art end-to-end multilingual (English, Russian, and Ukrainian) knowledge extraction system that can perform entity discovery and linking, relation extraction, event extraction, and coreference.

Event Extraction Relation +1

Paper
Add Code

Incorporating Background Knowledge into Video Description Generation

no code implementations • EMNLP 2018 • Spencer Whitehead, Heng Ji, Mohit Bansal, Shih-Fu Chang, Clare Voss

We develop an approach that uses video meta-data to retrieve topically related news documents for a video and extracts the events and named entities from these documents.

Decoder Text Generation +2

Paper
Add Code

Paper Abstract Writing through Editing Mechanism

2 code implementations • ACL 2018 • Qingyun Wang, Zhi-Hao Zhou, Lifu Huang, Spencer Whitehead, Boliang Zhang, Heng Ji, Kevin Knight

We present a paper abstract writing system based on an attentive neural sequence-to-sequence model that can take a title as input and automatically generate an abstract.

Ranked #1 on Paper generation on ACL Title and Abstract Dataset

Paper generation

Paper
Code

Entity-aware Image Caption Generation

no code implementations • EMNLP 2018 • Di Lu, Spencer Whitehead, Lifu Huang, Heng Ji, Shih-Fu Chang

Current image captioning approaches generate descriptions which lack specific information, such as named entities that are involved in the images.

Caption Generation Image Captioning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.