Search Results for author: Keisuke Sakaguchi

Found 50 papers, 31 papers with code

proScript: Partially Ordered Scripts Generation

no code implementations • Findings (EMNLP) 2021 • Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi

Scripts – prototypical event sequences describing everyday activities – have been shown to help understand narratives by providing expectations, resolving ambiguity, and filling in unstated information.

Text Generation valid

Paper
Add Code

J-UniMorph: Japanese Morphological Annotation through the Universal Feature Schema

1 code implementation • 22 Feb 2024 • Kosuke Matsuzaki, Masaya Taniguchi, Kentaro Inui, Keisuke Sakaguchi

We introduce a Japanese Morphology dataset, J-UniMorph, developed based on the UniMorph feature schema.

Paper
Code

Test-time Augmentation for Factual Probing

1 code implementation • 26 Oct 2023 • Go Kamoda, Benjamin Heinzerling, Keisuke Sakaguchi, Kentaro Inui

Factual probing is a method that uses prompts to test if a language model "knows" certain world knowledge facts.

Language Modelling Relation +1

Paper
Code

PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning

1 code implementation • 31 May 2023 • Faeze Brahman, Chandra Bhagavatula, Valentina Pyatkin, Jena D. Hwang, Xiang Lorraine Li, Hirona J. Arai, Soumya Sanyal, Keisuke Sakaguchi, Xiang Ren, Yejin Choi

In addition, we introduce a novel task, Counterfactual Planning, that requires a revision of a plan to cope with a counterfactual situation.

Common Sense Reasoning counterfactual +3

Paper
Code

Evaluating GPT-4 and ChatGPT on Japanese Medical Licensing Examinations

1 code implementation • 31 Mar 2023 • Jungo Kasai, Yuhei Kasai, Keisuke Sakaguchi, Yutaro Yamada, Dragomir Radev

In this work, we evaluate LLM APIs (ChatGPT, GPT-3, and GPT-4) on the Japanese national medical licensing examinations from the past five years, including the current year.

Paper
Code

Causal schema induction for knowledge discovery

1 code implementation • 27 Mar 2023 • Michael Regan, Jena D. Hwang, Keisuke Sakaguchi, James Pustejovsky

In this work, we investigate how to apply schema induction models to the task of knowledge discovery for enhanced search of English-language news texts.

Paper
Code

Analyzing the Performance of GPT-3.5 and GPT-4 in Grammatical Error Correction

no code implementations • 25 Mar 2023 • Steven Coyne, Keisuke Sakaguchi, Diana Galvan-Sosa, Michael Zock, Kentaro Inui

GPT-3 and GPT-4 models are powerful, achieving high performance on a variety of Natural Language Processing tasks.

Grammatical Error Correction Sentence

Paper
Add Code

Empirical Investigation of Neural Symbolic Reasoning Strategies

1 code implementation • 16 Feb 2023 • Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa, Keisuke Sakaguchi, Kentaro Inui

Neural reasoning accuracy improves when generating intermediate reasoning steps.

Paper
Code

Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning?

1 code implementation • 15 Feb 2023 • Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa, Keisuke Sakaguchi, Kentaro Inui

Compositionality is a pivotal property of symbolic reasoning.

Arithmetic Reasoning

Paper
Code

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation

no code implementations • 19 Dec 2022 • Chandra Bhagavatula, Jena D. Hwang, Doug Downey, Ronan Le Bras, Ximing Lu, Lianhui Qin, Keisuke Sakaguchi, Swabha Swayamdipta, Peter West, Yejin Choi

Here, we investigate an alternative that a priori seems impossible: can smaller language models (e. g., GPT-2) win over models that are orders of magnitude larger and better (e. g., GPT-3), if powered with novel commonsense distillation algorithms?

Imitation Learning Knowledge Distillation

Paper
Add Code

RealTime QA: What's the Answer Right Now?

1 code implementation • NeurIPS 2023 • Jungo Kasai, Keisuke Sakaguchi, Yoichi Takahashi, Ronan Le Bras, Akari Asai, Xinyan Yu, Dragomir Radev, Noah A. Smith, Yejin Choi, Kentaro Inui

We introduce REALTIME QA, a dynamic question answering (QA) platform that announces questions and evaluates systems on a regular basis (weekly in this version).

Information Retrieval Question Answering +1

Paper
Code

Towards Automated Document Revision: Grammatical Error Correction, Fluency Edits, and Beyond

1 code implementation • 23 May 2022 • Masato Mita, Keisuke Sakaguchi, Masato Hagiwara, Tomoya Mizumoto, Jun Suzuki, Kentaro Inui

Natural language processing technology has rapidly improved automated grammatical error correction tasks, and the community begins to explore document-level revision as one of the next challenges.

Grammatical Error Correction Language Modelling +1

Paper
Code

Twist Decoding: Diverse Generators Guide Each Other

1 code implementation • 19 May 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Hao Peng, Ximing Lu, Dragomir Radev, Yejin Choi, Noah A. Smith

Our extensive evaluations on machine translation and scientific paper summarization demonstrate that Twist decoding substantially outperforms each model decoded in isolation over various scenarios, including cases where domain-specific and general-purpose models are both available.

Machine Translation Text Generation +1

Paper
Code

ELQA: A Corpus of Metalinguistic Questions and Answers about English

1 code implementation • 1 May 2022 • Shabnam Behzad, Keisuke Sakaguchi, Nathan Schneider, Amir Zeldes

We present ELQA, a corpus of questions and answers in and about the English language.

Answer Generation Question Answering

Paper
Code

A Call for Clarity in Beam Search: How It Works and When It Stops

1 code implementation • 11 Apr 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Dragomir Radev, Yejin Choi, Noah A. Smith

Based on this finding, we introduce a patience factor, a simple modification to this beam decoding implementation, that generalizes the stopping criterion and provides flexibility to the depth of search.

Machine Translation Text Generation +2

Paper
Code

Interscript: A dataset for interactive learning of scripts through error feedback

1 code implementation • 15 Dec 2021 • Niket Tandon, Aman Madaan, Peter Clark, Keisuke Sakaguchi, Yiming Yang

We present a new dataset, Interscript, containing user feedback on a deployed model that generates complex everyday tasks.

Structured Prediction

Paper
Code

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand

2 code implementations • NAACL 2022 • Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Morrison, Alexander R. Fabbri, Yejin Choi, Noah A. Smith

We therefore propose a generalization of leaderboards, bidimensional leaderboards (Billboards), that simultaneously tracks progress in language generation models and metrics for their evaluation.

Image Captioning Machine Translation +1

Paper
Code

Transparent Human Evaluation for Image Captioning

2 code implementations • NAACL 2022 • Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Morrison, Ronan Le Bras, Yejin Choi, Noah A. Smith

We establish THumB, a rubric-based human evaluation protocol for image captioning models.

Image Captioning

Paper
Code

Can Machines Learn Morality? The Delphi Experiment

1 code implementation • 14 Oct 2021 • Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jenny Liang, Jesse Dodge, Keisuke Sakaguchi, Maxwell Forbes, Jon Borchardt, Saadia Gabriel, Yulia Tsvetkov, Oren Etzioni, Maarten Sap, Regina Rini, Yejin Choi

As AI systems become increasingly powerful and pervasive, there are growing concerns about machines' morality or a lack thereof.

Descriptive Ethics

Paper
Code

Improving Neural Model Performance through Natural Language Feedback on Their Explanations

no code implementations • 18 Apr 2021 • Aman Madaan, Niket Tandon, Dheeraj Rajagopal, Yiming Yang, Peter Clark, Keisuke Sakaguchi, Ed Hovy

A class of explainable NLP models for reasoning tasks support their decisions by generating free-form or structured explanations, but what happens when these supporting structures contain errors?

Paper
Add Code

proScript: Partially Ordered Scripts Generation via Pre-trained Language Models

no code implementations • 16 Apr 2021 • Keisuke Sakaguchi, Chandra Bhagavatula, Ronan Le Bras, Niket Tandon, Peter Clark, Yejin Choi

Scripts - standardized event sequences describing typical everyday activities - have been shown to help understand narratives by providing expectations, resolving ambiguity, and filling in unstated information.

Text Generation valid

Paper
Add Code

GrammarTagger: A Multilingual, Minimally-Supervised Grammar Profiler for Language Education

1 code implementation • 7 Apr 2021 • Masato Hagiwara, Joshua Tanner, Keisuke Sakaguchi

We present GrammarTagger, an open-source grammar profiler which, given an input text, identifies grammatical features useful for language education.

Paper
Code

A Dataset for Tracking Entities in Open Domain Procedural Text

no code implementations • EMNLP 2020 • Niket Tandon, Keisuke Sakaguchi, Bhavana Dalvi Mishra, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson, Eduard Hovy

Our solution is a new task formulation where given just a procedural text as input, the task is to generate a set of state change tuples(entity, at-tribute, before-state, after-state)for each step, where the entity, attribute, and state values must be predicted from an open vocabulary.

Attribute

Paper
Add Code

COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs

3 code implementations • 12 Oct 2020 • Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Jeff Da, Keisuke Sakaguchi, Antoine Bosselut, Yejin Choi

Next, we show that ATOMIC 2020 is better suited for training knowledge models that can generate accurate, representative knowledge for new, unseen entities and events.

Knowledge Graphs Natural Language Understanding

213

Paper
Code

WIQA: A dataset for ``What if...'' reasoning over procedural text

no code implementations • IJCNLP 2019 • T, Niket on, Bhavana Dalvi, Keisuke Sakaguchi, Peter Clark, Antoine Bosselut

We introduce WIQA, the first large-scale dataset of {``}What if...{''} questions over procedural text.

Multiple-choice

Paper
Add Code

The Universal Decompositional Semantics Dataset and Decomp Toolkit

1 code implementation • LREC 2020 • Aaron Steven White, Elias Stengel-Eskin, Siddharth Vashishtha, Venkata Govindarajan, Dee Ann Reisinger, Tim Vieira, Keisuke Sakaguchi, Sheng Zhang, Francis Ferraro, Rachel Rudinger, Kyle Rawlins, Benjamin Van Durme

We present the Universal Decompositional Semantics (UDS) dataset (v1. 0), which is bundled with the Decomp toolkit (v0. 1).

Paper
Code

WIQA: A dataset for "What if..." reasoning over procedural text

1 code implementation • 10 Sep 2019 • Niket Tandon, Bhavana Dalvi Mishra, Keisuke Sakaguchi, Antoine Bosselut, Peter Clark

We introduce WIQA, the first large-scale dataset of "What if..." questions over procedural text.

Multiple-choice

Paper
Code

Uncertain Natural Language Inference

no code implementations • ACL 2020 • Tongfei Chen, Zhengping Jiang, Adam Poliak, Keisuke Sakaguchi, Benjamin Van Durme

We introduce Uncertain Natural Language Inference (UNLI), a refinement of Natural Language Inference (NLI) that shifts away from categorical labels, targeting instead the direct prediction of subjective probability assessments.

Learning-To-Rank Natural Language Inference +1

Paper
Add Code

Abductive Commonsense Reasoning

2 code implementations • ICLR 2020 • Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Scott Wen-tau Yih, Yejin Choi

Abductive reasoning is inference to the most plausible explanation.

Multiple-choice Natural Language Inference +1

Paper
Code

WinoGrande: An Adversarial Winograd Schema Challenge at Scale

3 code implementations • 24 Jul 2019 • Keisuke Sakaguchi, Ronan Le Bras, Chandra Bhagavatula, Yejin Choi

The key steps of the dataset construction consist of (1) a carefully designed crowdsourcing procedure, followed by (2) systematic bias reduction using a novel AfLite algorithm that generalizes human-detectable word associations to machine-detectable embedding associations.

Ranked #9 on Coreference Resolution on Winograd Schema Challenge

Common Sense Reasoning Coreference Resolution +2

Paper
Code

Efficient Online Scalar Annotation with Bounded Support

no code implementations • ACL 2018 • Keisuke Sakaguchi, Benjamin Van Durme

We describe a novel method for efficiently eliciting scalar annotations for dataset construction and system quality estimation by human judgments.

Paper
Add Code

GEC into the future: Where are we going and how do we get there?

no code implementations • WS 2017 • Keisuke Sakaguchi, Courtney Napoles, Joel Tetreault

The field of grammatical error correction (GEC) has made tremendous bounds in the last ten years, but new questions and obstacles are revealing themselves.

Grammatical Error Correction Machine Translation +1

Paper
Add Code

Grammatical Error Correction with Neural Reinforcement Learning

no code implementations • IJCNLP 2017 • Keisuke Sakaguchi, Matt Post, Benjamin Van Durme

We propose a neural encoder-decoder model with reinforcement learning (NRL) for grammatical error correction (GEC).

Grammatical Error Correction reinforcement-learning +2

Paper
Add Code

Error-repair Dependency Parsing for Ungrammatical Texts

1 code implementation • ACL 2017 • Keisuke Sakaguchi, Matt Post, Benjamin Van Durme

We propose a new dependency parsing scheme which jointly parses a sentence and repairs grammatical errors by extending the non-directional transition-based formalism of Goldberg and Elhadad (2010) with three additional actions: SUBSTITUTE, DELETE, INSERT.

Dependency Parsing Sentence

Paper
Code

JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction

1 code implementation • EACL 2017 • Courtney Napoles, Keisuke Sakaguchi, Joel Tetreault

We present a new parallel corpus, JHU FLuency-Extended GUG corpus (JFLEG) for developing and evaluating grammatical error correction (GEC).

Grammatical Error Correction

111

Paper
Code

Universal Decompositional Semantics on Universal Dependencies

1 code implementation • EMNLP 2016 • Aaron Steven White, Drew Reisinger, Keisuke Sakaguchi, Tim Vieira, Sheng Zhang, Rachel Rudinger, Kyle Rawlins, Benjamin Van Durme

110

Paper
Code

There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction

1 code implementation • EMNLP 2016 • Courtney Napoles, Keisuke Sakaguchi, Joel Tetreault

We show that reference-less grammaticality metrics correlate very strongly with human judgments and are competitive with the leading reference-based evaluation metrics.

Benchmarking Grammatical Error Correction +1

Paper
Code

Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network

1 code implementation • 7 Aug 2016 • Keisuke Sakaguchi, Kevin Duh, Matt Post, Benjamin Van Durme

Inspired by the findings from the Cmabrigde Uinervtisy effect, we propose a word recognition model based on a semi-character level recurrent neural network (scRNN).

Spelling Correction

Paper
Code

Phrase Structure Annotation and Parsing for Learner English

no code implementations • ACL 2016 • Ryo Nagata, Keisuke Sakaguchi

Grammatical Error Correction Part-Of-Speech Tagging

Paper
Add Code

GLEU Without Tuning

1 code implementation • 9 May 2016 • Courtney Napoles, Keisuke Sakaguchi, Matt Post, Joel Tetreault

The GLEU metric was proposed for evaluating grammatical error corrections using n-gram overlap with a set of reference sentences, as opposed to precision/recall of specific annotated errors (Napoles et al., 2015).

Paper
Code

Reassessing the Goals of Grammatical Error Correction: Fluency Instead of Grammaticality

1 code implementation • TACL 2016 • Keisuke Sakaguchi, Courtney Napoles, Matt Post, Joel Tetreault

The field of grammatical error correction (GEC) has grown substantially in recent years, with research directed at both evaluation metrics and improved system performance against those metrics.

Grammatical Error Correction Sentence