Search Results for author: Yusuke Miyao

Found 105 papers, 25 papers with code

Towards Grounding of Formulae

no code implementations EMNLP (sdp) 2020 Takuto Asakura, André Greiner-Petter, Akiko Aizawa, Yusuke Miyao

Our results indicate that it is worthwhile to grow the techniques for the proposed task to contribute to the further progress of mathematical language processing.

Information Retrieval Retrieval

Bayesian Argumentation-Scheme Networks: A Probabilistic Model of Argument Validity Facilitated by Argumentation Schemes

no code implementations EMNLP (ArgMining) 2021 Takahiro Kondo, Koki Washio, Katsuhiko Hayashi, Yusuke Miyao

We propose a methodology for representing the reasoning structure of arguments using Bayesian networks and predicate logic facilitated by argumentation schemes.

Building Dataset for Grounding of Formulae — Annotating Coreference Relations Among Math Identifiers

1 code implementation LREC 2022 Takuto Asakura, Yusuke Miyao, Akiko Aizawa

Therefore, coreference relations between symbols need to be identified for grounding, and the task has aspects of both description alignment and coreference analysis.

Math

Collection and Analysis of Travel Agency Task Dialogues with Age-Diverse Speakers

no code implementations LREC 2022 Michimasa Inaba, Yuya Chiba, Ryuichiro Higashinaka, Kazunori Komatani, Yusuke Miyao, Takayuki Nagai

This paper provides details of the dialogue task, the collection procedure and annotations, and the analysis on the characteristics of the dialogues and facial expressions focusing on the age of the speakers.

Development of a Multilingual CCG Treebank via Universal Dependencies Conversion

no code implementations LREC 2022 Tu-Anh Tran, Yusuke Miyao

This paper introduces an algorithm to convert Universal Dependencies (UD) treebanks to Combinatory Categorial Grammar (CCG) treebanks.

Generating Racing Game Commentary from Vision, Language, and Structured Data

no code implementations INLG (ACL) 2021 Tatsuya Ishigaki, Goran Topic, Yumi Hamazono, Hiroshi Noji, Ichiro Kobayashi, Yusuke Miyao, Hiroya Takamura

In this study, we introduce a new large-scale dataset that contains aligned video data, structured numerical data, and transcribed commentaries that consist of 129, 226 utterances in 1, 389 races in a game.

Modeling Syntactic-Semantic Dependency Correlations in Semantic Role Labeling Using Mixture Models

1 code implementation ACL 2022 Junjie Chen, Xiangheng He, Yusuke Miyao

In this paper, we propose a mixture model-based end-to-end method to model the syntactic-semantic dependency correlation in Semantic Role Labeling (SRL).

Semantic Role Labeling

Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models

no code implementations23 May 2025 Shunsuke Kando, Yusuke Miyao, Shinnosuke Takamichi

The purpose of speech tokenization is to transform a speech signal into a sequence of discrete representations, serving as the foundation for speech language models (SLMs).

Speech Tokenization

How LLMs Learn: Tracing Internal Representations with Sparse Autoencoders

1 code implementation9 Mar 2025 Tatsuro Inaba, Kentaro Inui, Yusuke Miyao, Yohei Oseki, Benjamin Heinzerling, Yu Takagi

Large Language Models (LLMs) demonstrate remarkable multilingual capabilities and broad knowledge.

A Statistical and Multi-Perspective Revisiting of the Membership Inference Attack in Large Language Models

no code implementations18 Dec 2024 Bowen Chen, Namgi Han, Yusuke Miyao

The lack of data transparency in Large Language Models (LLMs) has highlighted the importance of Membership Inference Attack (MIA), which differentiates trained (member) and untrained (non-member) data.

Inference Attack Membership Inference Attack

Does it Chug? Towards a Data-Driven Understanding of Guitar Tone Description

1 code implementation16 Dec 2024 Pratik Sutar, Jason Naradowsky, Yusuke Miyao

In this work, we pursue a data-driven approach to further our understanding of such adjectives in the context of guitar tone.

Improving Unsupervised Constituency Parsing via Maximizing Semantic Information

1 code implementation3 Oct 2024 Junjie Chen, Xiangheng He, Yusuke Miyao, Danushka Bollegala

In this paper, we introduce a novel objective for training unsupervised parsers: maximizing the information between constituent structures and sentence semantics (SemInfo).

Constituency Parsing Sentence

GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing Identification

no code implementations2 Oct 2024 Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao, Hsin-Hsi Chen

To address this deficit, our study introduces an innovative task - the identification of news-triggered opinion expressing timing.

Opinion Mining Text Generation

Hierarchical Organization Simulacra in the Investment Sector

no code implementations1 Oct 2024 Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao

This paper explores designing artificial organizations with professional behavior in investments using a multi-agent simulation.

Articles Decision Making

Enhancing Financial Sentiment Analysis with Expert-Designed Hint

no code implementations26 Sep 2024 Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao

This paper investigates the role of expert-designed hint in enhancing sentiment analysis on financial social media posts.

Sentiment Analysis

Enhancing Investment Opinion Ranking through Argument-Based Sentiment Analysis

no code implementations25 Sep 2024 Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao

Our research introduces a dual-pronged argument mining technique to improve recommendation system effectiveness, considering both professional and amateur investor perspectives.

Argument Mining Sentiment Analysis

Analyzing Correlations Between Intrinsic and Extrinsic Bias Metrics of Static Word Embeddings With Their Measuring Biases Aligned

no code implementations14 Sep 2024 Taisei Katô, Yusuke Miyao

An intrinsic bias metric measures bias by examining a characteristic of vectors, while an extrinsic bias metric checks whether an NLP system trained with a word embedding is biased.

Word Embeddings

Self-Emotion Blended Dialogue Generation in Social Simulation Agents

no code implementations3 Aug 2024 Qiang Zhang, Jason Naradowsky, Yusuke Miyao

When engaging in conversations, dialogue agents in a virtual simulation environment may exhibit their own emotional states that are unrelated to the immediate conversational context, a phenomenon known as self-emotion.

Decision Making Dialogue Generation +3

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

no code implementations4 Jul 2024 LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano, Atsushi Keyaki, Keisuke Kiryu, Hirokazu Kiyomaru, Takashi Kodama, Takahiro Kubo, Yohei Kuga, Ryoma Kumon, Shuhei Kurita, Sadao Kurohashi, Conglong Li, Taiki Maekawa, Hiroshi Matsuda, Yusuke Miyao, Kentaro Mizuki, Sakae Mizuki, Yugo Murawaki, Akim Mousterou, Ryo Nakamura, Taishi Nakamura, Kouta Nakayama, Tomoka Nakazato, Takuro Niitsuma, Jiro Nishitoba, Yusuke Oda, Hayato Ogawa, Takumi Okamoto, Naoaki Okazaki, Yohei Oseki, Shintaro Ozaki, Koki Ryu, Rafal Rzepka, Keisuke Sakaguchi, Shota Sasaki, Satoshi Sekine, Kohei Suda, Saku Sugawara, Issa Sugiura, Hiroaki Sugiyama, Hisami Suzuki, Jun Suzuki, Toyotaro Suzumura, Kensuke Tachibana, Yu Takagi, Kyosuke Takami, Koichi Takeda, Masashi Takeshita, Masahiro Tanaka, Kenjiro Taura, Arseny Tolmachev, Nobuhiro Ueda, Zhen Wan, Shuntaro Yada, Sakiko Yahata, Yuya Yamamoto, Yusuke Yamauchi, Hitomi Yanaka, Rio Yokota, Koichiro Yoshino

This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs).

FinGen: A Dataset for Argument Generation in Finance

no code implementations31 May 2024 Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao

Based on our empirical results, we further point out several unresolved issues and challenges in this research direction.

A Multi-Perspective Analysis of Memorization in Large Language Models

no code implementations19 May 2024 Bowen Chen, Namgi Han, Yusuke Miyao

One of those behaviors is memorization, in which LLMs can generate the same content used to train them.

Memorization

Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures

no code implementations18 Apr 2024 Junjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao

Linguists identify the constituent by evaluating a set of Predicate-Argument Structure (PAS) equivalent sentences where we find the constituent appears more frequently than non-constituents (i. e., the constituent corresponds to a frequent word sequence within the sentence set).

Constituency Parsing Sentence

Mind the Gap Between Conversations for Improved Long-Term Dialogue Generation

1 code implementation24 Oct 2023 Qiang Zhang, Jason Naradowsky, Yusuke Miyao

Knowing how to end and resume conversations over time is a natural part of communication, allowing for discussions to span weeks, months, or years.

Dialogue Generation

Ask an Expert: Leveraging Language Models to Improve Strategic Reasoning in Goal-Oriented Dialogue Models

1 code implementation29 May 2023 Qiang Zhang, Jason Naradowsky, Yusuke Miyao

We propose the "Ask an Expert" framework in which the model is trained with access to an "expert" which it can consult at each turn.

StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning

1 code implementation16 Oct 2022 Hong Chen, Duc Minh Vo, Hiroya Takamura, Yusuke Miyao, Hideki Nakayama

Existing automatic story evaluation methods place a premium on story lexical level coherence, deviating from human preference.

Comment Generation Decoder

Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding

1 code implementation26 Sep 2022 Erica K. Shimomoto, Edison Marrese-Taylor, Hiroya Takamura, Ichiro Kobayashi, Hideki Nakayama, Yusuke Miyao

This paper explores the task of Temporal Video Grounding (TVG) where, given an untrimmed video and a natural language sentence query, the goal is to recognize and determine temporal boundaries of action instances in the video described by the query.

Benchmarking Natural Language Queries +2

Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem

1 code implementation Findings (ACL) 2022 Qiang Zhang, Jason Naradowsky, Yusuke Miyao

We introduce the task of implicit offensive text detection in dialogues, where a statement may have either an offensive or non-offensive interpretation, depending on the listener and context.

Text Detection

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

no code implementations spnlp (ACL) 2022 Shunsuke Kando, Hiroshi Noji, Yusuke Miyao

On average, the performance of our best model represents a 19 \% increase in accuracy over the worst choice across all languages.

Language Modeling Language Modelling

Code Generation for Unknown Libraries via Reading API Documentations

no code implementations16 Feb 2022 Koki Washio, Yusuke Miyao

Moreover, to evaluate code generation for unknown libraries and our framework, we extend an existing dataset of open-domain code generation and resplit it so that the evaluation data consist of only examples using the libraries that do not appear in the training data.

Code Generation Decoder

Learning with Contrastive Examples for Data-to-Text Generation

1 code implementation COLING 2020 Yui Uehara, Tatsuya Ishigaki, Kasumi Aoki, Hiroshi Noji, Keiichi Goshima, Ichiro Kobayashi, Hiroya Takamura, Yusuke Miyao

Existing models for data-to-text tasks generate fluent but sometimes incorrect sentences e. g., {``}Nikkei gains{''} is generated when {``}Nikkei drops{''} is expected.

Comment Generation Data-to-Text Generation

An empirical analysis of existing systems and datasets toward general simple question answering

1 code implementation COLING 2020 Namgi Han, Goran Topic, Hiroshi Noji, Hiroya Takamura, Yusuke Miyao

Our analysis, including shifting of training and test datasets and training on a union of the datasets, suggests that our progress in solving SimpleQuestions dataset does not indicate the success of more general simple question answering.

Natural Language Understanding Question Answering

Predicting Event Time by Classifying Sub-Level Temporal Relations Induced from a Unified Representation of Time Anchors

no code implementations14 Aug 2020 Fei Cheng, Yusuke Miyao

Another contribution of this work is to construct a larger event time corpus (256 news documents) with a reasonable Inter-Annotator Agreement (IAA), for the purpose of overcoming the data shortage of the existing event time corpus (36 news documents).

Articles Multi-Label Classification +1

Analyzing Word Embedding Through Structural Equation Modeling

no code implementations LREC 2020 Namgi Han, Katsuhiko Hayashi, Yusuke Miyao

Many researchers have tried to predict the accuracies of extrinsic evaluation by using intrinsic evaluation to evaluate word embedding.

Word Embeddings

Does My Rebuttal Matter? Insights from a Major NLP Conference

1 code implementation NAACL 2019 Yang Gao, Steffen Eger, Ilia Kuznetsov, Iryna Gurevych, Yusuke Miyao

We then focus on the role of the rebuttal phase, and propose a novel task to predict after-rebuttal (i. e., final) scores from initial reviews and author responses.

4k

Generating Market Comments Referring to External Resources

1 code implementation WS 2018 Tatsuya Aoki, Akira Miyazawa, Tatsuya Ishigaki, Keiichi Goshima, Kasumi Aoki, Ichiro Kobayashi, Hiroya Takamura, Yusuke Miyao

Comments on a stock market often include the reason or cause of changes in stock prices, such as {``}Nikkei turns lower as yen{'}s rise hits exporters.

Text Generation

Coordinate Structures in Universal Dependencies for Head-final Languages

no code implementations WS 2018 Hiroshi Kanayama, Na-Rae Han, Masayuki Asahara, Jena D. Hwang, Yusuke Miyao, Jinho D. Choi, Yuji Matsumoto

This paper discusses the representation of coordinate structures in the Universal Dependencies framework for two head-final languages, Japanese and Korean.

Consensus-based Sequence Training for Video Captioning

no code implementations27 Dec 2017 Sang Phan, Gustav Eje Henter, Yusuke Miyao, Shin'ichi Satoh

First we show that, by replacing model samples with ground-truth sentences, RL training can be seen as a form of weighted cross-entropy loss, giving a fast, RL-based pre-training algorithm.

Reinforcement Learning Reinforcement Learning (RL) +1

Classifying Temporal Relations by Bidirectional LSTM over Dependency Paths

no code implementations ACL 2017 Fei Cheng, Yusuke Miyao

In this work, we borrow a state-of-the-art method in relation extraction by adopting bidirectional long short-term memory (Bi-LSTM) along dependency paths (DP).

General Classification Question Answering +4

Video Event Detection by Exploiting Word Dependencies from Image Captions

no code implementations COLING 2016 Sang Phan, Yusuke Miyao, Duy-Dinh Le, Shin{'}ichi Satoh

We conduct extensive experiments to analyze the effectiveness of using the new dependency representation for event detection on two large-scale TRECVID Multimedia Event Detection 2013 and 2014 datasets.

Action Detection Event Detection +2

Universal Dependencies for Japanese

no code implementations LREC 2016 Takaaki Tanaka, Yusuke Miyao, Masayuki Asahara, Sumire Uematsu, Hiroshi Kanayama, Shinsuke Mori, Yuji Matsumoto

We present an attempt to port the international syntactic annotation scheme, Universal Dependencies, to the Japanese language in this paper.

Typed Entity and Relation Annotation on Computer Science Papers

1 code implementation LREC 2016 Yuka Tateisi, Tomoko Ohta, Sampo Pyysalo, Yusuke Miyao, Akiko Aizawa

In our scheme, mentions of entities are annotated with ontology-based types, and the roles of the entities are annotated as relations with other entities described in the text.

Articles Relation

Challenges and Solutions for Consistent Annotation of Vietnamese Treebank

no code implementations LREC 2016 Quy Nguyen, Yusuke Miyao, Ha Le, Ngan Nguyen

However, the quality of this treebank is not satisfactory and is a possible source for the low performance of Vietnamese language processing.

Part-Of-Speech Tagging speech-recognition +1

Annotation of Computer Science Papers for Semantic Relation Extrac-tion

no code implementations LREC 2014 Yuka Tateisi, Yo Shidahara, Yusuke Miyao, Akiko Aizawa

We designed a new annotation scheme for formalising relation structures in research papers, through the investigation of computer science papers.

Information Retrieval Relation +2

Annotating Factive Verbs

no code implementations LREC 2012 Alvin Grissom II, Yusuke Miyao

These embedded presuppositions provide implicit information about facts assumed to be true in the world, and are thus potentially valuable in areas of research such as textual entailment.

Natural Language Inference

Cannot find the paper you are looking for? You can Submit a new open access paper.