Search Results for author: Peter Jansen

Found 30 papers, 12 papers with code

A Systematic Survey of Text Worlds as Embodied Natural Language Environments

no code implementations NAACL (Wordplay) 2022 Peter Jansen

Text Worlds are virtual environments for embodied agents that, unlike 2D or 3D environments, are rendered exclusively using textual descriptions.

Common Sense Reasoning Knowledge Graphs +1

TextGraphs 2020 Shared Task on Multi-Hop Inference for Explanation Regeneration

no code implementations COLING (TextGraphs) 2020 Peter Jansen, Dmitry Ustalov

In this second iteration of the explanation regeneration shared task, participants are supplied with more than double the training and evaluation data of the first shared task, as well as a knowledge base nearly double in size, both of which expand into more challenging scientific topics that increase the difficulty of the task.

World Knowledge

TextGraphs 2021 Shared Task on Multi-Hop Inference for Explanation Regeneration

1 code implementation NAACL (TextGraphs) 2021 Peter Jansen, Mokanarangan Thayaparan, Marco Valentino, Dmitry Ustalov

While previous editions of this shared task aimed to evaluate explanatory completeness – finding a set of facts that form a complete inference chain, without gaps, to arrive from question to correct answer, this 2021 instantiation concentrates on the subtask of determining relevance in large multi-hop explanations.

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

no code implementations22 Feb 2024 Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme

Contemporary language models enable new opportunities for structured reasoning with text, such as the construction and evaluation of intuitive, proof-like textual entailment trees without relying on brittle formal logic.

Formal Logic Knowledge Distillation +2

Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games

no code implementations7 Dec 2023 Ruoyao Wang, Peter Jansen

In this work, we introduce a self-supervised behavior cloning transformer for text games, which are challenging benchmarks for multi-step reasoning in virtual environments.

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization

no code implementations16 Oct 2023 Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Peter Jansen, Oyvind Tafjord, Niket Tandon, Li Zhang, Chris Callison-Burch, Peter Clark

Language agents have shown some ability to interact with an external environment, e. g., a virtual world such as ScienceWorld, to perform complex tasks, e. g., growing a plant, without the startup costs of reinforcement learning.

From Words to Wires: Generating Functioning Electronic Devices from Natural Language Descriptions

1 code implementation24 May 2023 Peter Jansen

In this work, we show that contemporary language models have a previously unknown skill -- the capacity for electronic circuit design from high-level textual descriptions, akin to code generation.

Code Generation

ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games

1 code implementation24 May 2023 Ruoyao Wang, Graham Todd, Eric Yuan, Ziang Xiao, Marc-Alexandre Côté, Peter Jansen

In this work, we investigate the capacity of language models to generate explicit, interpretable, and interactive world models of scientific and common-sense reasoning tasks.

Code Generation Common Sense Reasoning +2

Behavior Cloned Transformers are Neurosymbolic Reasoners

1 code implementation13 Oct 2022 Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

In this work, we explore techniques for augmenting interactive agents with information from symbolic modules, much like humans use tools like calculators and GPS systems to assist with arithmetic and navigation.

Common Sense Reasoning

ScienceWorld: Is your Agent Smarter than a 5th Grader?

1 code implementation14 Mar 2022 Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

We present ScienceWorld, a benchmark to test agents' scientific reasoning abilities in a new interactive text environment at the level of a standard elementary school science curriculum.

Question Answering

Extracting Space Situational Awareness Events from News Text

1 code implementation LREC 2022 Zhengnan Xie, Alice Saebom Kwak, Enfa George, Laura W. Dozal, Hoang Van, Moriba Jah, Roberto Furfaro, Peter Jansen

Space situational awareness typically makes use of physical measurements from radar, telescopes, and other assets to monitor satellites and other spacecraft for operational, navigational, and defense purposes.

Event Extraction

On the Challenges of Evaluating Compositional Explanations in Multi-Hop Inference: Relevance, Completeness, and Expert Ratings

no code implementations EMNLP 2021 Peter Jansen, Kelly Smith, Dan Moreno, Huitzilin Ortiz

Building compositional explanations requires models to combine two or more facts that, together, describe why the answer to a question is correct.

valid

Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language

1 code implementation16 Jul 2021 Peter Jansen, Jordan Boyd-Graber

Tamarian, a fictional language introduced in the Star Trek episode Darmok, communicates meaning through utterances of metaphorical references, such as "Darmok and Jalad at Tanagra" instead of "We should work together."

Language Modelling Large Language Model +2

Explaining Answers with Entailment Trees

1 code implementation EMNLP 2021 Bhavana Dalvi, Peter Jansen, Oyvind Tafjord, Zhengnan Xie, Hannah Smith, Leighanna Pipatanangkura, Peter Clark

Our approach is to generate explanations in the form of entailment trees, namely a tree of multipremise entailment steps from facts that are known, through intermediate conclusions, to the hypothesis of interest (namely the question + answer).

Language Modelling Question Answering +1

Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions

1 code implementation Findings of the Association for Computational Linguistics 2020 Peter Jansen

The recently proposed ALFRED challenge task aims for a virtual robotic agent to complete complex multi-step everyday tasks in a virtual home environment from high-level natural language directives, such as {``}put a hot piece of bread on a plate{''}.

Translation

CoSaTa: A Constraint Satisfaction Solver and Interpreted Language for Semi-Structured Tables of Sentences

1 code implementation EMNLP 2020 Peter Jansen

This work presents CoSaTa, an intuitive constraint satisfaction solver and interpreted language for knowledge bases of semi-structured tables expressed as text.

Question Answering

WorldTree V2: A Corpus of Science-Domain Structured Explanations and Inference Patterns supporting Multi-Hop Inference

no code implementations LREC 2020 Zhengnan Xie, Sebastian Thiem, Jaycie Martin, Elizabeth Wainwright, Steven Marmorstein, Peter Jansen

Explainable question answering for complex questions often requires combining large numbers of facts to answer a question while providing a human-readable explanation for the answer, a process known as multi-hop inference.

Question Answering World Knowledge

ScienceExamCER: A High-Density Fine-Grained Science-Domain Corpus for Common Entity Recognition

no code implementations LREC 2020 Hannah Smith, Zeyu Zhang, John Culnan, Peter Jansen

Named entity recognition identifies common classes of entities in text, but these entity labels are generally sparse, limiting utility to downstream tasks.

Classification General Classification +5

Extracting Common Inference Patterns from Semi-Structured Explanations

no code implementations WS 2019 Sebastian Thiem, Peter Jansen

Complex questions often require combining multiple facts to correctly answer, particularly when generating detailed explanations for why those answers are correct.

World Knowledge

TextGraphs 2019 Shared Task on Multi-Hop Inference for Explanation Regeneration

no code implementations WS 2019 Peter Jansen, Dmitry Ustalov

While automated question answering systems are increasingly able to retrieve answers to natural language questions, their ability to generate detailed human-readable explanations for their answers is still quite limited.

Information Retrieval Question Answering +2

Multi-hop Inference for Sentence-level TextGraphs: How Challenging is Meaningfully Combining Information for Science Question Answering?

no code implementations WS 2018 Peter Jansen

Question Answering for complex questions is often modeled as a graph construction or traversal task, where a solver must build or traverse a graph of facts that answer and explain a given question.

graph construction Knowledge Graphs +2

Framing QA as Building and Ranking Intersentence Answer Justifications

no code implementations CL 2017 Peter Jansen, Rebecca Sharp, Mihai Surdeanu, Peter Clark

Our best configuration answers 44{\%} of the questions correctly, where the top justifications for 57{\%} of these correct answers contain a compelling human-readable justification that explains the inference required to arrive at the correct answer.

Multiple-choice Question Answering

Creating Causal Embeddings for Question Answering with Minimal Supervision

no code implementations EMNLP 2016 Rebecca Sharp, Mihai Surdeanu, Peter Jansen, Peter Clark, Michael Hammond

We argue that a better approach is to look for answers that are related to the question in a relevant way, according to the information need of the question, which may be determined through task-specific embeddings.

Question Answering Word Embeddings

Higher-order Lexical Semantic Models for Non-factoid Answer Reranking

no code implementations TACL 2015 Daniel Fried, Peter Jansen, Gustave Hahn-Powell, Mihai Surdeanu, Peter Clark

We introduce a higher-order formalism that allows all these lexical semantic models to chain direct evidence to construct indirect associations between question and answer texts, by casting the task as the traversal of graphs that encode direct term associations.

Open-Domain Question Answering Semantic Similarity +1

Cannot find the paper you are looking for? You can Submit a new open access paper.