Search Results for author: Daisuke Kawahara

Found 65 papers, 9 papers with code

JGLUE: Japanese General Language Understanding Evaluation

2 code implementations • LREC 2022 • Kentaro Kurihara, Daisuke Kawahara, Tomohide Shibata

We build a Japanese NLU benchmark, JGLUE, from scratch without translation to measure the general NLU ability in Japanese.

FLUE Natural Language Understanding +1

284

Paper
Code

Annotating a Driving Experience Corpus with Behavior and Subjectivity

no code implementations • PACLIC 2018 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Paper
Add Code

A Method for Building a Commonsense Inference Dataset based on Basic Events

no code implementations • EMNLP 2020 • Kazumasa Omura, Daisuke Kawahara, Sadao Kurohashi

We present a scalable, low-bias, and low-cost method for building a commonsense inference dataset that combines automatic extraction from a corpus and crowdsourcing.

Multiple-choice Transfer Learning

Paper
Add Code

Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance

no code implementations • 22 Feb 2024 • Ziqi Yin, Hao Wang, Kaito Horio, Daisuke Kawahara, Satoshi Sekine

We investigate the impact of politeness levels in prompts on the performance of large language models (LLMs).

Paper
Add Code

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

no code implementations • 18 Jan 2024 • Hao Wang, Shuhei Kurita, Shuichiro Shimizu, Daisuke Kawahara

Audio-visual speech recognition (AVSR) is a multimodal extension of automatic speech recognition (ASR), using video as a complement to audio.

Audio-Visual Speech Recognition Automatic Speech Recognition +4

Paper
Add Code

Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

no code implementations • 17 Oct 2023 • Tomohito Kasahara, Daisuke Kawahara

Automatic evaluation of text generation is essential for improving the accuracy of generation tasks.

In-Context Learning Machine Translation +2

Paper
Add Code

PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model

1 code implementation • 11 Oct 2023 • Tatsuya Ide, Eiki Murata, Daisuke Kawahara, Takato Yamazaki, Shengzhe Li, Kenta Shinzato, Toshinori Sato

In this paper, we propose PHALM, a method of building a knowledge graph from scratch, by prompting both crowdworkers and a large language model (LLM).

Language Modelling Large Language Model +1

Paper
Code

Kanbun-LM: Reading and Translating Classical Chinese in Japanese Methods by Language Models

1 code implementation • 22 May 2023 • Hao Wang, Hirofumi Shimizu, Daisuke Kawahara

To solve this problem, we construct the first Classical-Chinese-to-Kanbun dataset in the world.

Machine Translation

Paper
Code

Grounding in social media: An approach to building a chit-chat dialogue model

no code implementations • NAACL (ACL) 2022 • Ritvik Choudhary, Daisuke Kawahara

Building open-domain dialogue systems capable of rich human-like conversational ability is one of the fundamental challenges in language generation.

Dialogue Generation

Paper
Add Code

Building a Personalized Dialogue System with Prompt-Tuning

no code implementations • NAACL (ACL) 2022 • Tomohito Kasahara, Daisuke Kawahara, Nguyen Tung, Shengzhe Li, Kenta Shinzato, Toshinori Sato

Dialogue systems without consistent responses are not fascinating.

Paper
Add Code

Generate, Evaluate, and Select: A Dialogue System with a Response Evaluator for Diversity-Aware Response Generation

no code implementations • NAACL (ACL) 2022 • Ryoma Sakaeda, Daisuke Kawahara

We aim to overcome the lack of diversity in responses of current dialogue systems and to develop a dialogue system that is engaging as a conversational partner.

Response Generation

Paper
Add Code

Building a Dialogue Corpus Annotated with Expressed and Experienced Emotions

1 code implementation • ACL 2022 • Tatsuya Ide, Daisuke Kawahara

We hope that the constructed corpus will facilitate the study on emotion recognition in a dialogue and emotion-aware dialogue response generation.

Emotion Recognition Multi-Task Learning +1

Paper
Code

Multi-Task Learning of Generation and Classification for Emotion-Aware Dialogue Response Generation

no code implementations • NAACL 2021 • Tatsuya Ide, Daisuke Kawahara

For a computer to naturally interact with a human, it needs to be human-like.

Multi-Task Learning Response Generation

Paper
Add Code

BERT-based Cohesion Analysis of Japanese Texts

1 code implementation • COLING 2020 • Nobuhiro Ueda, Daisuke Kawahara, Sadao Kurohashi

The meaning of natural language text is supported by cohesion among various kinds of entities, including coreference relations, predicate-argument structures, and bridging anaphora relations.

coreference-resolution

Paper
Code

Reverse Operation based Data Augmentation for Solving Math Word Problems

1 code implementation • 4 Oct 2020 • Qianying Liu, Wenyu Guan, Sujian Li, Fei Cheng, Daisuke Kawahara, Sadao Kurohashi

Automatically solving math word problems is a critical task in the field of natural language processing.

Data Augmentation Math +1

Paper
Code

Minimize Exposure Bias of Seq2Seq Models in Joint Entity and Relation Extraction

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ranran Haoran Zhang, Qianying Liu, Aysa Xuemo Fan, Heng Ji, Daojian Zeng, Fei Cheng, Daisuke Kawahara, Sadao Kurohashi

We propose a novel Sequence-to-Unordered-Multi-Tree (Seq2UMTree) model to minimize the effects of exposure bias by limiting the decoding length to three within a triplet and removing the order among triplets.

Joint Entity and Relation Extraction Relation

Paper
Code

A System for Worldwide COVID-19 Information Aggregation

no code implementations • EMNLP (NLP-COVID19) 2020 • Akiko Aizawa, Frederic Bergeron, Junjie Chen, Fei Cheng, Katsuhiko Hayashi, Kentaro Inui, Hiroyoshi Ito, Daisuke Kawahara, Masaru Kitsuregawa, Hirokazu Kiyomaru, Masaki Kobayashi, Takashi Kodama, Sadao Kurohashi, Qianying Liu, Masaki Matsubara, Yusuke Miyao, Atsuyuki Morishima, Yugo Murawaki, Kazumasa Omura, Haiyue Song, Eiichiro Sumita, Shinji Suzuki, Ribeka Tanaka, Yu Tanaka, Masashi Toyoda, Nobuhiro Ueda, Honai Ueoka, Masao Utiyama, Ying Zhong

The global pandemic of COVID-19 has made the public pay close attention to related news, covering various domains, such as sanitation, treatment, and effects on education.

Machine Translation Translation

Paper
Add Code

Building a Japanese Typo Dataset from Wikipedia's Revision History

no code implementations • ACL 2020 • Yu Tanaka, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

User generated texts contain many typos for which correction is necessary for NLP systems to work.

Paper
Add Code

Development of a Japanese Personality Dictionary based on Psychological Methods

no code implementations • LREC 2020 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

In this study, we collect personality words, using word embeddings, and construct a personality dictionary with weights for Big Five traits.

Word Embeddings

Paper
Add Code

Acquiring Social Knowledge about Personality and Driving-related Behavior

no code implementations • LREC 2020 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Using them, we automatically extracted collocations between personality descriptors and driving-related behavior from a driving behavior and subjectivity corpus (1, 803, 328 sentences after filtering) and obtained unique 5, 334 collocations.

Paper
Add Code

Tree-structured Decoding for Solving Math Word Problems

no code implementations • IJCNLP 2019 • Qianying Liu, Wenyv Guan, Sujian Li, Daisuke Kawahara

To address this problem, we propose a tree-structured decoding method that generates the abstract syntax tree of the equation in a top-down manner.

Math

Paper
Add Code

Machine Comprehension Improves Domain-Specific Japanese Predicate-Argument Structure Analysis

no code implementations • WS 2019 • Norio Takahashi, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

To improve the accuracy of predicate-argument structure (PAS) analysis, large-scale training data and knowledge for PAS analysis are indispensable.

Reading Comprehension

Paper
Add Code

Diversity-aware Event Prediction based on a Conditional Variational Autoencoder with Reconstruction

no code implementations • WS 2019 • Hirokazu Kiyomaru, Kazumasa Omura, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

Typical event sequences are an important class of commonsense knowledge.

Paper
Add Code

Applying Machine Translation to Psychology: Automatic Translation of Personality Adjectives

no code implementations • WS 2019 • Ritsuko Iwai, Daisuke Kawahara, Takatsune Kumada, Sadao Kurohashi

Machine Translation Translation

Paper
Add Code

Shrinking Japanese Morphological Analyzers With Neural Networks and Semi-supervised Learning

no code implementations • NAACL 2019 • Arseny Tolmachev, Daisuke Kawahara, Sadao Kurohashi

Morphological analyzers are trained on data hand-annotated with segmentation boundaries and part of speech tags.

Chinese Word Segmentation Morphological Analysis +2

Paper
Add Code

Juman++: A Morphological Analysis Toolkit for Scriptio Continua

1 code implementation • EMNLP 2018 • Arseny Tolmachev, Daisuke Kawahara, Sadao Kurohashi

We present a three-part toolkit for developing morphological analyzers for languages without natural word boundaries.

Art Analysis Language Modelling +2

365

Paper
Code

Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion

1 code implementation • COLING 2018 • Naoki Otani, Hirokazu Kiyomaru, Daisuke Kawahara, Sadao Kurohashi

Considerable effort has been devoted to building commonsense knowledge bases.

Knowledge Base Completion Machine Translation +1

Paper
Code

Neural Adversarial Training for Semi-supervised Japanese Predicate-argument Structure Analysis

no code implementations • ACL 2018 • Shuhei Kurita, Daisuke Kawahara, Sadao Kurohashi

Japanese predicate-argument structure (PAS) analysis involves zero anaphora resolution, which is notoriously difficult.

Paper
Add Code

Knowledge-enriched Two-layered Attention Network for Sentiment Analysis

no code implementations • NAACL 2018 • Abhishek Kumar, Daisuke Kawahara, Sadao Kurohashi

We propose a novel two-layered attention network based on Bidirectional Long Short-Term Memory for sentiment analysis.

Knowledge Graph Embedding Sentiment Analysis +1

Paper
Add Code

Comprehensive Annotation of Various Types of Temporal Information on the Time Axis

no code implementations • LREC 2018 • Tomohiro Sakaguchi, Daisuke Kawahara, Sadao Kurohashi

Common Sense Reasoning

Paper
Add Code

JFCKB: Japanese Feature Change Knowledge Base

no code implementations • LREC 2018 • Tetsuaki Nakamura, Daisuke Kawahara

Common Sense Reasoning

Paper
Add Code

Improving Crowdsourcing-Based Annotation of Japanese Discourse Relations

no code implementations • LREC 2018 • Yudai Kishimoto, Shinnosuke Sawada, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi

Paper
Add Code

JDCFC: A Japanese Dialogue Corpus with Feature Changes

no code implementations • LREC 2018 • Tetsuaki Nakamura, Daisuke Kawahara

Dialogue Understanding

Paper
Add Code

Automatically Acquired Lexical Knowledge Improves Japanese Joint Morphological and Dependency Analysis

no code implementations • WS 2017 • Daisuke Kawahara, Yuta Hayashibe, Hajime Morita, Sadao Kurohashi

This paper presents a joint model for morphological and dependency analysis based on automatically acquired lexical knowledge.

Lemmatization Morphological Analysis +2

Paper
Add Code

Neural Joint Model for Transition-based Chinese Syntactic Analysis

no code implementations • ACL 2017 • Shuhei Kurita, Daisuke Kawahara, Sadao Kurohashi

We present neural network-based joint models for Chinese word segmentation, POS tagging and dependency parsing.

Chinese Word Segmentation Dependency Parsing +4

Paper
Add Code

Improving Chinese Semantic Role Labeling using High-quality Surface and Deep Case Frames

no code implementations • EACL 2017 • Gongye Jin, Daisuke Kawahara, Sadao Kurohashi

To compensate the deficiency of the surface case frames, we compile deep case frames from automatic semantic roles.

Chinese Semantic Role Labeling Dependency Parsing +4

Paper
Add Code

Reading Comprehension using Entity-based Memory Network

no code implementations • 12 Dec 2016 • Xun Wang, Katsuhito Sudoh, Masaaki Nagata, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

This paper introduces a novel neural network model for question answering, the \emph{entity-based memory network}.

Question Answering Reading Comprehension

Paper
Add Code

SCTB: A Chinese Treebank in Scientific Domain

no code implementations • WS 2016 • Chenhui Chu, Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohashi

Treebanks are curial for natural language processing (NLP).

Chinese Word Segmentation Machine Translation +1

Paper
Add Code

Consistent Word Segmentation, Part-of-Speech Tagging and Dependency Labelling Annotation for Chinese Language

no code implementations • COLING 2016 • Mo Shen, Wingmui Li, HyunJeong Choe, Chenhui Chu, Daisuke Kawahara, Sadao Kurohashi

In this paper, we propose a new annotation approach to Chinese word segmentation, part-of-speech (POS) tagging and dependency labelling that aims to overcome the two major issues in traditional morphology-based annotation: Inconsistency and data sparsity.

Chinese Word Segmentation Machine Translation +6