Search Results for author: Masaaki Nagata

Found 87 papers, 9 papers with code

SODA: Story Oriented Dense Video Captioning Evaluation Framework

1 code implementation • ECCV 2020 • Soichiro Fujita, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

This paper proposes a new evaluation framework, Story Oriented Dense video cAptioning evaluation framework (SODA), for measuring the performance of video story description systems.

Dense Video Captioning

Paper
Code

Findings of the 2021 Conference on Machine Translation (WMT21)

no code implementations • WMT (EMNLP) 2021 • Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondřej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-Jussa, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri

This paper presents the results of the newstranslation task, the multilingual low-resourcetranslation for Indo-European languages, thetriangular translation task, and the automaticpost-editing task organised as part of the Con-ference on Machine Translation (WMT) 2021. In the news task, participants were asked tobuild machine translation systems for any of10 language pairs, to be evaluated on test setsconsisting mainly of news stories.

Machine Translation Translation

Paper
Add Code

Word Rewarding for Adequate Neural Machine Translation

no code implementations • IWSLT (EMNLP) 2018 • Yuto Takebayashi, Chu Chenhui, Yuki Arase†, Masaaki Nagata

To improve the translation adequacy in neural machine translation (NMT), we propose a rewarding model with target word prediction using bilingual dictionaries inspired by the success of decoder constraints in statistical machine translation.

Machine Translation NMT +1

Paper
Add Code

Reducing Odd Generation from Neural Headline Generation

no code implementations • PACLIC 2018 • Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

Headline Generation

Paper
Add Code

WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction

2 code implementations • 9 Jun 2023 • Qiyu Wu, Masaaki Nagata, Yoshimasa Tsuruoka

Most existing word alignment methods rely on manual alignment datasets or parallel corpora, which limits their usefulness.

Word Alignment

Paper
Code

Domain Adaptation of Machine Translation with Crowdworkers

no code implementations • 28 Oct 2022 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

With the collected parallel data, we can quickly adapt a machine translation model to the target domain.

Domain Adaptation Machine Translation +1

Paper
Add Code

A Simple and Strong Baseline for End-to-End Neural RST-style Discourse Parsing

1 code implementation • 15 Oct 2022 • Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

To promote and further develop RST-style discourse parsing models, we need a strong baseline that can be regarded as a reference for reporting reliable experimental results.

Ranked #1 on Discourse Parsing on Instructional-DT (Instr-DT)

Discourse Parsing

Paper
Code

Extending Word-Level Quality Estimation for Post-Editing Assistance

no code implementations • 23 Sep 2022 • Yizhen Wei, Takehito Utsuro, Masaaki Nagata

Based on extended word alignment, we further propose a novel task called refined word-level QE that outputs refined tags and word-level correspondences.

Word Alignment XLM-R

Paper
Add Code

JParaCrawl v3.0: A Large-scale English-Japanese Parallel Corpus

no code implementations • LREC 2022 • Makoto Morishita, Katsuki Chousa, Jun Suzuki, Masaaki Nagata

Most current machine translation models are mainly trained with parallel corpora, and their translation accuracy largely depends on the quality and quantity of the corpora.

Machine Translation Sentence +1

Paper
Add Code

Zero Pronouns Identification based on Span prediction

no code implementations • ACL 2021 • Sei Iwata, Taro Watanabe, Masaaki Nagata

In the experiments, our model surpassed the sequence labeling baseline.

Paper
Add Code

Improving Neural RST Parsing Model with Silver Agreement Subtrees

no code implementations • NAACL 2021 • Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

We then pre-train a neural RST parser with the obtained silver data and fine-tune it on the RST-DT.

Ranked #2 on Discourse Parsing on RST-DT (using extra training data)

Discourse Parsing Relation

Paper
Add Code

Context-aware Neural Machine Translation with Mini-batch Embedding

1 code implementation • EACL 2021 • Makoto Morishita, Jun Suzuki, Tomoharu Iwata, Masaaki Nagata

It is crucial to provide an inter-sentence context in Neural Machine Translation (NMT) models for higher-quality translation.

Machine Translation NMT +2

Paper
Code

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

1 code implementation • COLING 2020 • Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

In particular, our method improved by +53. 9 F1 scores for extracting non-parallel sentences.

Machine Translation Sentence +2

Paper
Code

Findings of the 2020 Conference on Machine Translation (WMT20)

no code implementations • EMNLP 2020 • Loïc Barrault, Magdalena Biesialska, Ondřej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubešić, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri

In the news task, participants were asked to build machine translation systems for any of 11 language pairs, to be evaluated on test sets consisting mainly of news stories.

Machine Translation Translation

Paper
Add Code

Sequential Span Classification with Neural Semi-Markov CRFs for Biomedical Abstracts

no code implementations • Findings of the Association for Computational Linguistics 2020 • Kosuke Yamada, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda, Masaaki Nagata

Dividing biomedical abstracts into several segments with rhetorical roles is essential for supporting researchers{'} information access in the biomedical domain.

Sentence Sentence Classification

Paper
Add Code

University of Tsukuba's Machine Translation System for IWSLT20 Open Domain Translation Task

no code implementations • WS 2020 • Hongyi Cui, Yizhen Wei, Shohei Iida, Takehito Utsuro, Masaaki Nagata

In this paper, we introduce University of Tsukuba{'}s submission to the IWSLT20 Open Domain Translation Task.

Machine Translation Translation

Paper
Add Code

A Test Set for Discourse Translation from Japanese to English

no code implementations • LREC 2020 • Masaaki Nagata, Makoto Morishita

We improved the translation accuracy using context-aware neural machine translation, and the improvement mainly reflects the betterment of the translation of zero pronouns.

Machine Translation Sentence +1

Paper
Add Code

A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT

no code implementations • EMNLP 2020 • Masaaki Nagata, Chousa Katsuki, Masaaki Nishino

For example, we achieved an F1 score of 86. 7 for the Chinese-English data, which is 13. 3 points higher than the previous state-of-the-art supervised methods.

Question Answering Sentence +1

Paper
Add Code

Bilingual Text Extraction as Reading Comprehension

no code implementations • 29 Apr 2020 • Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

We also conduct a sentence alignment experiment using En-Ja newspaper articles and find that the proposed method using multilingual BERT achieves significantly better accuracy than a baseline method using a bilingual dictionary and dynamic programming.

Reading Comprehension Sentence +1

Paper
Add Code

Top-Down RST Parsing Utilizing Granularity Levels in Documents

1 code implementation • 3 Apr 2020 • Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

To obtain better discourse dependency trees, we need to improve the accuracy of RST trees at the upper parts of the structures.

Ranked #3 on Discourse Parsing on RST-DT

Discourse Parsing Relation

Paper
Code

JParaCrawl: A Large Scale Web-Based English-Japanese Parallel Corpus

no code implementations • LREC 2020 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

We constructed a parallel corpus for English-Japanese, for which the amount of publicly available parallel corpora is still limited.

Machine Translation Sentence +1

Paper
Add Code

Mixed Multi-Head Self-Attention for Neural Machine Translation

no code implementations • WS 2019 • Hongyi Cui, Shohei Iida, Po-Hsuan Hung, Takehito Utsuro, Masaaki Nagata

Recently, the Transformer becomes a state-of-the-art architecture in the filed of neural machine translation (NMT).

Machine Translation NMT +1

Paper
Add Code

Split or Merge: Which is Better for Unsupervised RST Parsing?

no code implementations • IJCNLP 2019 • Naoki Kobayashi, Tsutomu Hirao, Kengo Nakamura, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

The first one builds the optimal tree in terms of a dissimilarity score function that is defined for splitting a text span into smaller ones.

Paper
Add Code

Generating Natural Anagrams: Towards Language Generation Under Hard Combinatorial Constraints

no code implementations • IJCNLP 2019 • Masaaki Nishino, Sho Takase, Tsutomu Hirao, Masaaki Nagata

An anagram is a sentence or a phrase that is made by permutating the characters of an input sentence or a phrase.

Sentence Text Generation

Paper
Add Code

Context-aware Neural Machine Translation with Coreference Information

no code implementations • WS 2019 • Takumi Ohtani, Hidetaka Kamigaito, Masaaki Nagata, Manabu Okumura

We present neural machine translation models for translating a sentence in a text by using a graph-based encoder which can consider coreference relations provided within the text explicitly.

Machine Translation Sentence +1

Paper
Add Code

NTT Neural Machine Translation Systems at WAT 2019

no code implementations • WS 2019 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

In this paper, we describe our systems that were submitted to the translation shared tasks at WAT 2019.

Machine Translation Translation

Paper
Add Code

Selecting Informative Context Sentence by Forced Back-Translation

no code implementations • WS 2019 • Ryuichiro Kimura, Shohei Iida, Hongyi Cui, Po-Hsuan Hung, Takehito Utsuro, Masaaki Nagata

Sentence Translation

Paper
Add Code

A Multi-Hop Attention for RNN based Neural Machine Translation

no code implementations • WS 2019 • Shohei Iida, Ryuichiro Kimura, Hongyi Cui, Po-Hsuan Hung, Takehito Utsuro, Masaaki Nagata

Machine Translation Translation

Paper
Add Code

NTT's Machine Translation Systems for WMT19 Robustness Task

no code implementations • WS 2019 • Soichiro Murakami, Makoto Morishita, Tsutomu Hirao, Masaaki Nagata

This paper describes NTT's submission to the WMT19 robustness task.

Domain Adaptation Machine Translation +1

Paper
Add Code

Using Semantic Similarity as Reward for Reinforcement Learning in Sentence Generation

no code implementations • ACL 2019 • Go Yasui, Yoshimasa Tsuruoka, Masaaki Nagata

Traditional model training for sentence generation employs cross-entropy loss as the loss function.

NMT reinforcement-learning +5

Paper
Add Code

Attention over Heads: A Multi-Hop Attention for Neural Machine Translation

no code implementations • ACL 2019 • Shohei Iida, Ryuichiro Kimura, Hongyi Cui, Po-Hsuan Hung, Takehito Utsuro, Masaaki Nagata

The first hop attention is the scaled dot-product attention which is the same attention mechanism used in the original Transformer.

Machine Translation Translation

Paper
Add Code

Character n-gram Embeddings to Improve RNN Language Models

no code implementations • 13 Jun 2019 • Sho Takase, Jun Suzuki, Masaaki Nagata

This paper proposes a novel Recurrent Neural Network (RNN) language model that takes advantage of character information.

Headline Generation Language Modelling +3

Paper
Add Code

Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction

no code implementations • ACL 2019 • Kosuke Nishida, Kyosuke Nishida, Masaaki Nagata, Atsushi Otsuka, Itsumi Saito, Hisako Asano, Junji Tomita

It enables QFE to consider the dependency among the evidence sentences and cover important information in the question sentence.

Ranked #61 on Question Answering on HotpotQA

Answer Selection Extractive Summarization +4

Paper
Add Code

Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models

no code implementations • WS 2018 • Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

Developing a method for understanding the inner workings of black-box neural methods is an important research endeavor.

Machine Translation Sentence +2

Paper
Add Code

NTT's Neural Machine Translation Systems for WMT 2018

no code implementations • WS 2018 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

This paper describes NTT{'}s neural machine translation systems submitted to the WMT 2018 English-German and German-English news translation tasks.

Machine Translation Re-Ranking +1

Paper
Add Code

Automatic Pyramid Evaluation Exploiting EDU-based Extractive Reference Summaries

no code implementations • EMNLP 2018 • Tsutomu Hirao, Hidetaka Kamigaito, Masaaki Nagata

This paper tackles automation of the pyramid method, a reliable manual evaluation framework.

Semantic Textual Similarity

Paper
Add Code

Direct Output Connection for a High-Rank Language Model

1 code implementation • EMNLP 2018 • Sho Takase, Jun Suzuki, Masaaki Nagata

This paper proposes a state-of-the-art recurrent neural network (RNN) language model that combines probability distributions computed not only from a final RNN layer but also from middle layers.

Ranked #8 on Language Modelling on Penn Treebank (Word Level)

Constituency Parsing Headline Generation +4

Paper
Code

Improving Neural Machine Translation by Incorporating Hierarchical Subword Features

no code implementations • COLING 2018 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

We hypothesize that in the NMT model, the appropriate subword units for the following three modules (layers) can differ: (1) the encoder embedding layer, (2) the decoder embedding layer, and (3) the decoder output layer.

Machine Translation NMT +1

Paper
Add Code

An Empirical Study of Building a Strong Baseline for Constituency Parsing

1 code implementation • ACL 2018 • Jun Suzuki, Sho Takase, Hidetaka Kamigaito, Makoto Morishita, Masaaki Nagata

This paper investigates the construction of a strong baseline based on general purpose sequence-to-sequence models for constituency parsing.

Ranked #16 on Constituency Parsing on Penn Treebank

Abstractive Text Summarization Constituency Parsing +3

Paper
Code

Pruning Basic Elements for Better Automatic Evaluation of Summaries

no code implementations • NAACL 2018 • Ukyo Honda, Tsutomu Hirao, Masaaki Nagata

We propose a simple but highly effective automatic evaluation measure of summarization, pruned Basic Elements (pBE).

Word Embeddings Word Similarity

Paper
Add Code

Neural Tensor Networks with Diagonal Slice Matrices

no code implementations • NAACL 2018 • Takahiro Ishihara, Katsuhiko Hayashi, Hitoshi Manabe, Masashi Shimbo, Masaaki Nagata

Although neural tensor networks (NTNs) have been successful in many NLP tasks, they require a large number of parameters to be estimated, which often leads to overfitting and a long training time.

Knowledge Graph Completion Logical Reasoning +2

Paper
Add Code

Higher-Order Syntactic Attention Network for Longer Sentence Compression

no code implementations • NAACL 2018 • Hidetaka Kamigaito, Katsuhiko Hayashi, Tsutomu Hirao, Masaaki Nagata

To solve this problem, we propose a higher-order syntactic attention network (HiSAN) that can handle higher-order dependency features as an attention distribution on LSTM hidden states.

Ranked #3 on Sentence Compression on Google Dataset

Informativeness Machine Translation +2

Paper
Add Code

Provable Fast Greedy Compressive Summarization with Any Monotone Submodular Function

no code implementations • NAACL 2018 • Shinsaku Sakaue, Tsutomu Hirao, Masaaki Nishino, Masaaki Nagata

This approach is known to have three advantages: its applicability to many useful submodular objective functions, the efficiency of the greedy algorithm, and the provable performance guarantee.

Document Summarization Extractive Summarization +1

Paper
Add Code

Source-side Prediction for Neural Headline Generation

no code implementations • 22 Dec 2017 • Shun Kiyono, Sho Takase, Jun Suzuki, Naoaki Okazaki, Kentaro Inui, Masaaki Nagata

The encoder-decoder model is widely used in natural language generation tasks.

Headline Generation

Paper
Add Code

NTT Neural Machine Translation Systems at WAT 2017

no code implementations • WS 2017 • Makoto Morishita, Jun Suzuki, Masaaki Nagata

In this year, we participated in four translation subtasks at WAT 2017.

Machine Translation Translation

Paper
Add Code

Controlling Target Features in Neural Machine Translation via Prefix Constraints

no code implementations • WS 2017 • Shunsuke Takeno, Masaaki Nagata, Kazuhide Yamamoto

We propose \textit{prefix constraints}, a novel method to enforce constraints on target sentences in neural machine translation.

Domain Adaptation Machine Translation +2

Paper
Add Code

Supervised Attention for Sequence-to-Sequence Constituency Parsing

no code implementations • IJCNLP 2017 • Hidetaka Kamigaito, Katsuhiko Hayashi, Tsutomu Hirao, Hiroya Takamura, Manabu Okumura, Masaaki Nagata

The sequence-to-sequence (Seq2Seq) model has been successfully applied to machine translation (MT).

Constituency Parsing Machine Translation +3

Paper
Add Code

Input-to-Output Gate to Improve RNN Language Models

1 code implementation • IJCNLP 2017 • Sho Takase, Jun Suzuki, Masaaki Nagata

This paper proposes a reinforcing method that refines the output layers of existing Recurrent Neural Network (RNN) language models.

Paper
Code

Hierarchical Word Structure-based Parsing: A Feasibility Study on UD-style Dependency Parsing in Japanese

no code implementations • WS 2017 • Takaaki Tanaka, Katsuhiko Hayashi, Masaaki Nagata

We introduce the following hierarchical word structures to dependency parsing in Japanese: morphological units (a short unit word, SUW) and syntactic units (a long unit word, LUW).

Chunking Dependency Parsing +2

Paper
Add Code

Oracle Summaries of Compressive Summarization

no code implementations • ACL 2017 • Tsutomu Hirao, Masaaki Nishino, Masaaki Nagata

This paper derives an Integer Linear Programming (ILP) formulation to obtain an oracle summary of the compressive summarization paradigm in terms of ROUGE.

Sentence Compression

Paper
Add Code

K-best Iterative Viterbi Parsing

no code implementations • EACL 2017 • Katsuhiko Hayashi, Masaaki Nagata

This paper presents an efficient and optimal parsing algorithm for probabilistic context-free grammars (PCFGs).

Paper
Add Code

Enumeration of Extractive Oracle Summaries

no code implementations • EACL 2017 • Tsutomu Hirao, Masaaki Nishino, Jun Suzuki, Masaaki Nagata

To analyze the limitations and the future directions of the extractive summarization paradigm, this paper proposes an Integer Linear Programming (ILP) formulation to obtain extractive oracle summaries in terms of ROUGE-N. We also propose an algorithm that enumerates all of the oracle summaries for a set of reference summaries to exploit F-measures that evaluate which system summaries contain how many sentences that are extracted as an oracle summary.

document understanding Extractive Summarization

Paper
Add Code

Cutting-off Redundant Repeating Generations for Neural Abstractive Summarization

no code implementations • EACL 2017 • Jun Suzuki, Masaaki Nagata

This paper tackles the reduction of redundant repeating generation that is often observed in RNN-based encoder-decoder models.

Ranked #4 on Text Summarization on DUC 2004 Task 1

Abstractive Text Summarization

Paper
Add Code

Reading Comprehension using Entity-based Memory Network

no code implementations • 12 Dec 2016 • Xun Wang, Katsuhito Sudoh, Masaaki Nagata, Tomohide Shibata, Daisuke Kawahara, Sadao Kurohashi

This paper introduces a novel neural network model for question answering, the \emph{entity-based memory network}.

Question Answering Reading Comprehension

Paper
Add Code

Integrating empty category detection into preordering Machine Translation

no code implementations • WS 2016 • Shunsuke Takeno, Masaaki Nagata, Kazuhide Yamamoto

We propose a method for integrating Japanese empty category detection into the preordering process of Japanese-to-English statistical machine translation.

Machine Translation Sentence +2

Paper
Add Code

Exploring Text Links for Coherent Multi-Document Summarization

no code implementations • COLING 2016 • Xun Wang, Masaaki Nishino, Tsutomu Hirao, Katsuhito Sudoh, Masaaki Nagata

Existing methods focus on the extraction of key information, but often neglect coherence.

Document Summarization Informativeness +1

Paper
Add Code

Chinese-to-Japanese Patent Machine Translation based on Syntactic Pre-ordering for WAT 2016

no code implementations • WS 2016 • Katsuhito Sudoh, Masaaki Nagata

This paper presents our Chinese-to-Japanese patent machine translation system for WAT 2016 (Group ID: ntt) that uses syntactic pre-ordering over Chinese dependency structures.

Chinese Word Segmentation Dependency Parsing +5