Search Results for author: Kate Sanders

Found 9 papers, 2 papers with code

Core: Robust Factual Precision Scoring with Informative Sub-Claim Identification

1 code implementation4 Jul 2024 Zhengping Jiang, Jingyu Zhang, Nathaniel Weir, Seth Ebner, Miriam Wanner, Kate Sanders, Daniel Khashabi, Anqi Liu, Benjamin Van Durme

Hallucinations -- the generation of untrue claims -- pose a challenge to the application of large language models (LLMs) [1] thereby motivating the development of metrics to evaluate factual precision.

Informativeness Text Generation

A Survey of Video Datasets for Grounded Event Understanding

1 code implementation14 Jun 2024 Kate Sanders, Benjamin Van Durme

In this paper, we survey 105 video datasets that require event understanding capability, consider how they contribute to the study of robust event understanding in video, and assess proposed video event extraction tasks in the context of this body of research.

Common Sense Reasoning Event Extraction +2

Tur[k]ingBench: A Challenge Benchmark for Web Agents

no code implementations18 Mar 2024 Kevin Xu, Yeganeh Kordi, Tanay Nayak, Ado Asija, Yizhong Wang, Kate Sanders, Adam Byerly, Jingyu Zhang, Benjamin Van Durme, Daniel Khashabi

To support the evaluation of TurkingBench, we have developed a framework that links chatbot responses to actions on web pages (e. g., modifying a text box, selecting a radio button).

Chatbot

TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning

no code implementations29 Feb 2024 Kate Sanders, Nathaniel Weir, Benjamin Van Durme

It is challenging to perform question-answering over complex, multimodal content such as television clips.

Question Answering Video Understanding

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

no code implementations22 Feb 2024 Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme

Recent language models enable new opportunities for structured reasoning with text, such as the construction of intuitive, proof-like textual entailment trees without relying on brittle formal logic.

Formal Logic Knowledge Distillation +2

MultiVENT: Multilingual Videos of Events with Aligned Natural Text

no code implementations6 Jul 2023 Kate Sanders, David Etter, Reno Kriz, Benjamin Van Durme

Everyday news coverage has shifted from traditional broadcasts towards a wide range of presentation formats such as first-hand, unedited video footage.

Information Retrieval Retrieval +1

Ambiguous Images With Human Judgments for Robust Visual Event Classification

no code implementations6 Oct 2022 Kate Sanders, Reno Kriz, Anqi Liu, Benjamin Van Durme

However, humans are frequently presented with visual data that they cannot classify with 100% certainty, and models trained on standard vision benchmarks achieve low performance when evaluated on this data.

Non-Markov Policies to Reduce Sequential Failures in Robot Bin Picking

no code implementations20 Jul 2020 Kate Sanders, Michael Danielczuk, Jeffrey Mahler, Ajay Tanwani, Ken Goldberg

A new generation of automated bin picking systems using deep learning is evolving to support increasing demand for e-commerce.

Cannot find the paper you are looking for? You can Submit a new open access paper.