Search Results for author: Kevin Duh

Found 116 papers, 24 papers with code

The JHU/KyotoU Speech Translation System for IWSLT 2018

no code implementations • IWSLT (EMNLP) 2018 • Hirofumi Inaguma, Xuan Zhang, Zhiqi Wang, Adithya Renduchintala, Shinji Watanabe, Kevin Duh

This paper describes the Johns Hopkins University (JHU) and Kyoto University submissions to the Speech Translation evaluation campaign at IWSLT2018.

Transfer Learning Translation

Paper
Add Code

Findings of the IWSLT 2022 Evaluation Campaign

no code implementations • IWSLT (ACL) 2022 • Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, Dávid Javorský, Vĕra Kloudová, Surafel Lakew, Xutai Ma, Prashant Mathur, Paul McNamee, Kenton Murray, Maria Nǎdejde, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, John Ortega, Juan Pino, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Yogesh Virkar, Alexander Waibel, Changhan Wang, Shinji Watanabe

The evaluation campaign of the 19th International Conference on Spoken Language Translation featured eight shared tasks: (i) Simultaneous speech translation, (ii) Offline speech translation, (iii) Speech to speech translation, (iv) Low-resource speech translation, (v) Multilingual speech translation, (vi) Dialect speech translation, (vii) Formality control for speech translation, (viii) Isometric speech translation.

Speech-to-Speech Translation Translation

Paper
Add Code

ESPnet How2 Speech Translation System for IWSLT 2019: Pre-training, Knowledge Distillation, and Going Deeper

no code implementations • EMNLP (IWSLT) 2019 • Hirofumi Inaguma, Shun Kiyono, Nelson Enrique Yalta Soplin, Jun Suzuki, Kevin Duh, Shinji Watanabe

In this year, we mainly build our systems based on Transformer architectures in all tasks and focus on the end-to-end speech translation (E2E-ST).

Knowledge Distillation NMT +1

Paper
Add Code

Data and Parameter Scaling Laws for Neural Machine Translation

no code implementations • ACL ARR May 2021 • Mitchell A Gordon, Kevin Duh, Jared Kaplan

We observe that the development cross-entropy loss of supervised neural machine translation models scales like a power law with the amount of training data and the number of non-embedding parameters in the model.

Machine Translation Translation

Paper
Add Code

Sequence Models for Computational Etymology of Borrowings

no code implementations • Findings (ACL) 2021 • Winston Wu, Kevin Duh, David Yarowsky

Paper
Add Code

Strategies for Adapting Multilingual Pre-training for Domain-Specific Machine Translation

no code implementations • AMTA 2022 • Neha Verma, Kenton Murray, Kevin Duh

Therefore, in this work, we propose two major fine-tuning strategies: our language-first approach first learns the translation language pair via general bitext, followed by the domain via in-domain bitext, and our domain-first approach first learns the domain via multilingual in-domain bitext, followed by the language pair via language pair-specific in-domain bitext.

Domain Adaptation Machine Translation +1

Paper
Add Code

The Multilingual Microblog Translation Corpus: Improving and Evaluating Translation of User-Generated Text

no code implementations • LREC 2022 • Paul McNamee, Kevin Duh

Translation of the noisy, informal language found in social media has been an understudied problem, with a principal factor being the limited availability of translation corpora in many languages.

Machine Translation NMT +1

Paper
Add Code

Prefix Embeddings for In-context Machine Translation

no code implementations • AMTA 2022 • Suzanna Sia, Kevin Duh

We analyze the resulting embeddings’ training dynamics, and where they lie in the embedding space, and show that our trained embeddings can be used for both in-context translation, and diverse generation of the target sentence.

Language Modelling Large Language Model +3

Paper
Add Code

Evolution Strategy Based Automatic Tuning of Neural Machine Translation Systems

no code implementations • IWSLT 2017 • Hao Qin, Takahiro Shinozaki, Kevin Duh

Neural machine translation (NMT) systems have demonstrated promising results in recent years.

Machine Translation NMT +1

Paper
Add Code

Machine Translation Believability

1 code implementation • EACL (HCINLP) 2021 • Marianna Martindale, Kevin Duh, Marine Carpuat

Successful Machine Translation (MT) deployment requires understanding not only the intrinsic qualities of MT output, such as fluency and adequacy, but also user perceptions.

Machine Translation Translation

Paper
Code

CLIRMatrix: A massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval

no code implementations • EMNLP 2020 • Shuo Sun, Kevin Duh

We present CLIRMatrix, a massively large collection of bilingual and multilingual datasets for Cross-Lingual Information Retrieval extracted automatically from Wikipedia.

Cross-Lingual Information Retrieval Retrieval

Paper
Add Code

Approaching Sign Language Gloss Translation as a Low-Resource Machine Translation Task

no code implementations • MTSummit 2021 • Xuan Zhang, Kevin Duh

A cascaded Sign Language Translation system first maps sign videos to gloss annotations and then translates glosses into a spoken languages.

Machine Translation Sign Language Translation +1

Paper
Add Code

Low-Resource Named Entity Recognition with Cross-Lingual, Character-Level Neural Conditional Random Fields

no code implementations • IJCNLP 2017 • Ryan Cotterell, Kevin Duh

Low-resource named entity recognition is still an open problem in NLP.

Low Resource Named Entity Recognition named-entity-recognition +2

Paper
Add Code

Where does In-context Translation Happen in Large Language Models

no code implementations • 7 Mar 2024 • Suzanna Sia, David Mueller, Kevin Duh

Self-supervised large language models have demonstrated the ability to perform Machine Translation (MT) via in-context learning, but little is known about where the model performs the task with respect to prompt instructions and demonstration examples.

In-Context Learning Machine Translation +1

Paper
Add Code

Improving Word Sense Disambiguation in Neural Machine Translation with Salient Document Context

no code implementations • 27 Nov 2023 • Elijah Rippeth, Marine Carpuat, Kevin Duh, Matt Post

Lexical ambiguity is a challenging and pervasive problem in machine translation (\mt).

Machine Translation Sentence +2

Paper
Add Code

Anti-LM Decoding for Zero-shot In-context Machine Translation

1 code implementation • 14 Nov 2023 • Suzanna Sia, Alexandra DeLucia, Kevin Duh

Zero-shot In-context learning is the phenomenon where models can perform the task simply given the instructions.

In-Context Learning Language Modelling +2

Paper
Code

HK-LegiCoST: Leveraging Non-Verbatim Transcripts for Speech Translation

1 code implementation • 20 Jun 2023 • Cihan Xiao, Henry Li Xinyuan, Jinyi Yang, Dongji Gao, Matthew Wiesner, Kevin Duh, Sanjeev Khudanpur

We introduce HK-LegiCoST, a new three-way parallel corpus of Cantonese-English translations, containing 600+ hours of Cantonese audio, its standard traditional Chinese transcript, and English translation, segmented and aligned at the sentence level.

Cross-corpus Sentence +3

Paper
Code

A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation

no code implementations • 12 Jun 2023 • Jeremy Gwinnup, Kevin Duh

Large language models such as BERT and the GPT series started a paradigm shift that calls for building general-purpose models via pre-training on large datasets, followed by fine-tuning on task-specific datasets.

Image Captioning Multimodal Machine Translation +3

Paper
Add Code

Exploring Representational Disparities Between Multilingual and Bilingual Translation Models

no code implementations • 23 May 2023 • Neha Verma, Kenton Murray, Kevin Duh

Multilingual machine translation has proven immensely useful for both parameter efficiency and overall performance across many language pairs via complete multilingual parameter sharing.

Machine Translation Translation

Paper
Add Code

In-context Learning as Maintaining Coherency: A Study of On-the-fly Machine Translation Using Large Language Models

no code implementations • 5 May 2023 • Suzanna Sia, Kevin Duh

In this work which focuses on Machine Translation, we present a perspective of in-context learning as the desired generation task maintaining coherency with its context, i. e., the prompt examples.

In-Context Learning Machine Translation +4

Paper
Add Code

Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport

no code implementations • 25 Oct 2022 • Kelly Marchisio, Ali Saad-Eldin, Kevin Duh, Carey Priebe, Philipp Koehn

Bilingual lexicons form a critical component of various natural language processing applications, including unsupervised and semisupervised machine translation and crosslingual information retrieval.

Bilingual Lexicon Induction Graph Matching +3

Paper
Add Code

IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces

1 code implementation • 11 Oct 2022 • Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn

The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism."

Bilingual Lexicon Induction Translation

Paper
Code

Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

1 code implementation • 20 Jan 2022 • Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard

These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25.

Document Ranking Information Retrieval +3

Paper
Code

An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces

1 code implementation • Findings (EMNLP) 2021 • Kelly Marchisio, Youngser Park, Ali Saad-Eldin, Anton Alyakin, Kevin Duh, Carey Priebe, Philipp Koehn

Alternatively, word embeddings may be understood as nodes in a weighted graph.

Bilingual Lexicon Induction Graph Matching +1

Paper
Code

Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring

no code implementations • 9 Sep 2021 • Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

We propose a unified NAR E2E-ST framework called Orthros, which has an NAR decoder and an auxiliary shallow AR decoder on top of the shared encoder.

Language Modelling Translation

Paper
Add Code

ESPnet-ST IWSLT 2021 Offline Speech Translation System

no code implementations • ACL (IWSLT) 2021 • Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe

This year we made various efforts on training data, architecture, and audio segmentation.

Knowledge Distillation speech-recognition +2

Paper
Add Code

Self-Guided Curriculum Learning for Neural Machine Translation

no code implementations • ACL (IWSLT) 2021 • Lei Zhou, Liang Ding, Kevin Duh, Shinji Watanabe, Ryohei Sasano, Koichi Takeda

In the field of machine learning, the well-trained model is assumed to be able to recover the training labels, i. e. the synthetic labels predicted by the model should be as close to the ground-truth labels as possible.

Machine Translation NMT +2

Paper
Add Code

Adaptive Mixed Component LDA for Low Resource Topic Modeling

1 code implementation • EACL 2021 • Suzanna Sia, Kevin Duh

Probabilistic topic models in low data resource scenarios are faced with less reliable estimates due to sparsity of discrete word co-occurrence counts, and do not have the luxury of retraining word or topic embeddings using neural methods.

Topic Models

Paper
Code

Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yol\'oxochitl Mixtec

no code implementations • EACL 2021 • Jiatong Shi, Jonathan D. Amith, Rey Castillo Garc{\'\i}a, Esteban Guadalupe Sierra, Kevin Duh, Shinji Watanabe

{``}Transcription bottlenecks{''}, created by a shortage of effective human transcribers (i. e., transcriber shortage), are one of the main challenges to endangered language (EL) documentation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec

no code implementations • 26 Jan 2021 • Jiatong Shi, Jonathan D. Amith, Rey Castillo García, Esteban Guadalupe Sierra, Kevin Duh, Shinji Watanabe

"Transcription bottlenecks", created by a shortage of effective human transcribers are one of the main challenges to endangered language (EL) documentation.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

no code implementations • 25 Oct 2020 • Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

Fast inference speed is an important goal towards real-world deployment of speech translation (ST) systems.

Translation

Paper
Add Code

Very Deep Transformers for Neural Machine Translation

4 code implementations • 18 Aug 2020 • Xiaodong Liu, Kevin Duh, Liyuan Liu, Jianfeng Gao

We explore the application of very deep Transformer models for Neural Machine Translation (NMT).

Ranked #1 on Machine Translation on WMT2014 English-French (using extra training data)

Machine Translation NMT +1

322

Paper
Code

CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task

1 code implementation • ACL 2020 • Shuo Sun, Suzanna Sia, Kevin Duh

We present CLIReval, an easy-to-use toolkit for evaluating machine translation (MT) with the proxy task of cross-lingual information retrieval (CLIR).

Cross-Lingual Information Retrieval Document Translation +3

Paper
Code

Modeling Document Interactions for Learning to Rank with Regularized Self-Attention

no code implementations • 8 May 2020 • Shuo Sun, Kevin Duh

Learning to rank is an important task that has been successfully deployed in many real-world information retrieval systems.

Information Retrieval Learning-To-Rank +1

Paper
Add Code

Benchmarking Neural and Statistical Machine Translation on Low-Resource African Languages

no code implementations • LREC 2020 • Kevin Duh, Paul McNamee, Matt Post, Brian Thompson

In this study, we benchmark state of the art statistical and neural machine translation systems on two African languages which do not have large amounts of resources: Somali and Swahili.

Benchmarking Machine Translation +2

Paper
Add Code

ESPnet-ST: All-in-One Speech Translation Toolkit

1 code implementation • ACL 2020 • Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe

We present ESPnet-ST, which is designed for the quick development of speech-to-speech translation systems in a single framework.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

7,867

Paper
Code

When Does Unsupervised Machine Translation Work?

no code implementations • WMT (EMNLP) 2020 • Kelly Marchisio, Kevin Duh, Philipp Koehn

We additionally find that unsupervised MT performance declines when source and target languages use different scripts, and observe very poor performance on authentic low-resource language pairs.

Translation Unsupervised Machine Translation

Paper
Add Code

Distill, Adapt, Distill: Training Small, In-Domain Models for Neural Machine Translation

no code implementations • WS 2020 • Mitchell A. Gordon, Kevin Duh

We explore best practices for training small, memory efficient machine translation models with sequence-level knowledge distillation in the domain adaptation setting.

Domain Adaptation Knowledge Distillation +2

Paper
Add Code

Machine Translation System Selection from Bandit Feedback

no code implementations • AMTA 2020 • Jason Naradowsky, Xuan Zhang, Kevin Duh

Adapting machine translation systems in the real world is a difficult problem.

Machine Translation Translation

Paper
Add Code

Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning

1 code implementation • ACL 2020 • Mitchell A. Gordon, Kevin Duh, Nicholas Andrews

Low levels of pruning (30-40%) do not affect pre-training loss or transfer to downstream tasks at all.

Transfer Learning

Paper
Code

Reproducible and Efficient Benchmarks for Hyperparameter Optimization of Neural Machine Translation Systems

no code implementations • TACL 2020 • Xuan Zhang, Kevin Duh

Hyperparameter selection is a crucial part of building neural machine translation (NMT) systems across both academia and industry.

Hyperparameter Optimization Machine Translation +2

Paper
Add Code

Explaining Sequence-Level Knowledge Distillation as Data-Augmentation for Neural Machine Translation

no code implementations • 6 Dec 2019 • Mitchell A. Gordon, Kevin Duh

We then propose an alternative hypothesis under the lens of data augmentation and regularization.

Data Augmentation Knowledge Distillation +3

Paper
Add Code

HABLex: Human Annotated Bilingual Lexicons for Experiments in Machine Translation

no code implementations • IJCNLP 2019 • Brian Thompson, Rebecca Knowles, Xuan Zhang, Huda Khayrallah, Kevin Duh, Philipp Koehn

Bilingual lexicons are valuable resources used by professional human translators.

Machine Translation Translation

Paper
Add Code

Multilingual End-to-End Speech Translation

1 code implementation • 1 Oct 2019 • Hirofumi Inaguma, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

In this paper, we propose a simple yet effective framework for multilingual end-to-end speech translation (ST), in which speech utterances in source languages are directly translated to the desired target languages with a universal sequence-to-sequence architecture.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

7,867

Paper
Code

Broad-Coverage Semantic Parsing as Transduction

no code implementations • IJCNLP 2019 • Sheng Zhang, Xutai Ma, Kevin Duh, Benjamin Van Durme

We unify different broad-coverage semantic parsing tasks under a transduction paradigm, and propose an attention-based neural framework that incrementally builds a meaning representation via a sequence of semantic relations.

Ranked #2 on UCCA Parsing on SemEval 2019 Task 1

AMR Parsing UCCA Parsing

Paper
Add Code

Robust Document Representations for Cross-Lingual Information Retrieval in Low-Resource Settings

no code implementations • WS 2019 • Mahsa Yarmohammadi, Xutai Ma, Sorami Hisamoto, Muhammad Rahman, Yiming Wang, Hainan Xu, Daniel Povey, Philipp Koehn, Kevin Duh

Cross-Lingual Information Retrieval Retrieval

Paper
Add Code

Identifying Fluently Inadequate Output in Neural and Statistical Machine Translation

no code implementations • WS 2019 • Marianna Martindale, Marine Carpuat, Kevin Duh, Paul McNamee

Machine Translation Translation

Paper
Add Code

JHU System Description for the MADAR Arabic Dialect Identification Shared Task

no code implementations • WS 2019 • Tom Lippincott, Pamela Shapiro, Kevin Duh, Paul McNamee

Our submission to the MADAR shared task on Arabic dialect identification employed a language modeling technique called Prediction by Partial Matching, an ensemble of neural architectures, and sources of additional data for training word embeddings and auxiliary language models.

Dialect Identification Language Modelling +1

Paper
Add Code

JHU 2019 Robustness Task System Description

no code implementations • WS 2019 • Matt Post, Kevin Duh

We describe the JHU submissions to the French{--}English, Japanese{--}English, and English{--}Japanese Robustness Task at WMT 2019.

Paper
Add Code

Overcoming Catastrophic Forgetting During Domain Adaptation of Neural Machine Translation

no code implementations • NAACL 2019 • Brian Thompson, Jeremy Gwinnup, Huda Khayrallah, Kevin Duh, Philipp Koehn

Continued training is an effective method for domain adaptation in neural machine translation.

BIG-bench Machine Learning Domain Adaptation +2

Paper
Add Code

Comparing Pipelined and Integrated Approaches to Dialectal Arabic Neural Machine Translation

no code implementations • WS 2019 • Pamela Shapiro, Kevin Duh

When translating diglossic languages such as Arabic, situations may arise where we would like to translate a text but do not know which dialect it is.

Dialect Identification Machine Translation +1

Paper
Add Code

A Call for Prudent Choice of Subword Merge Operations in Neural Machine Translation

no code implementations • WS 2019 • Shuoyang Ding, Adithya Renduchintala, Kevin Duh

Most neural machine translation systems are built upon subword units extracted by methods such as Byte-Pair Encoding (BPE) or wordpiece.

Machine Translation Translation

Paper
Add Code

AMR Parsing as Sequence-to-Graph Transduction

1 code implementation • ACL 2019 • Sheng Zhang, Xutai Ma, Kevin Duh, Benjamin Van Durme

Our experimental results outperform all previously reported SMATCH scores, on both AMR 2. 0 (76. 3% F1 on LDC2017T10) and AMR 1. 0 (70. 2% F1 on LDC2014T12).

Ranked #1 on AMR Parsing on LDC2014T12:

AMR Parsing

154

Paper
Code

Curriculum Learning for Domain Adaptation in Neural Machine Translation

no code implementations • NAACL 2019 • Xuan Zhang, Pamela Shapiro, Gaurav Kumar, Paul McNamee, Marine Carpuat, Kevin Duh

We introduce a curriculum learning approach to adapt generic neural machine translation models to a specific domain.

Domain Adaptation Machine Translation +1

Paper
Add Code

Query Expansion for Cross-Language Question Re-Ranking

no code implementations • 16 Apr 2019 • Muhammad Mahbubur Rahman, Sorami Hisamoto, Kevin Duh

Community question-answering (CQA) platforms have become very popular forums for asking and answering questions daily.

Community Question Answering Re-Ranking +1

Paper
Add Code

Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?

1 code implementation • TACL 2020 • Sorami Hisamoto, Matt Post, Kevin Duh

Data privacy is an important issue for "machine learning as a service" providers.

Machine Translation Translation +1

Paper
Code

An Empirical Exploration of Curriculum Learning for Neural Machine Translation

1 code implementation • 2 Nov 2018 • Xuan Zhang, Gaurav Kumar, Huda Khayrallah, Kenton Murray, Jeremy Gwinnup, Marianna J. Martindale, Paul McNamee, Kevin Duh, Marine Carpuat

Machine translation systems based on deep neural networks are expensive to train.

Machine Translation Translation

1,206

Paper
Code

ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension

no code implementations • 30 Oct 2018 • Sheng Zhang, Xiaodong Liu, Jingjing Liu, Jianfeng Gao, Kevin Duh, Benjamin Van Durme

We present a large-scale dataset, ReCoRD, for machine reading comprehension requiring commonsense reasoning.

Ranked #34 on Common Sense Reasoning on ReCoRD

Common Sense Reasoning Machine Reading Comprehension

Paper
Add Code

The JHU Machine Translation Systems for WMT 2018

no code implementations • WS 2018 • Philipp Koehn, Kevin Duh, Brian Thompson

We report on the efforts of the Johns Hopkins University to develop neural machine translation systems for the shared task for news translation organized around the Conference for Machine Translation (WMT) 2018.

Machine Translation Translation

Paper
Add Code

Cross-lingual Decompositional Semantic Parsing

no code implementations • EMNLP 2018 • Sheng Zhang, Xutai Ma, Rachel Rudinger, Kevin Duh, Benjamin Van Durme

We introduce the task of cross-lingual decompositional semantic parsing: mapping content provided in a source language into a decompositional semantic analysis based on a target language.

Semantic Parsing

Paper
Add Code

Stochastic Answer Networks for SQuAD 2.0

5 code implementations • 24 Sep 2018 • Xiaodong Liu, Wei Li, Yuwei Fang, Aerin Kim, Kevin Duh, Jianfeng Gao

This paper presents an extension of the Stochastic Answer Network (SAN), one of the state-of-the-art machine reading comprehension models, to be able to judge whether a question is unanswerable or not.

Machine Reading Comprehension Question Answering

148

Paper
Code

Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

1 code implementation • WS 2018 • Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya D. McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, Philipp Koehn

To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation.

Domain Adaptation Machine Translation +1

1,206

Paper
Code

Character-Aware Decoder for Translation into Morphologically Rich Languages

no code implementations • WS 2019 • Adithya Renduchintala, Pamela Shapiro, Kevin Duh, Philipp Koehn

Neural machine translation (NMT) systems operate primarily on words (or sub-words), ignoring lower-level patterns of morphology.

Machine Translation NMT +1

Paper
Add Code

BPE and CharCNNs for Translation of Morphology: A Cross-Lingual Comparison and Analysis

no code implementations • 5 Sep 2018 • Pamela Shapiro, Kevin Duh

Neural Machine Translation (NMT) in low-resource settings and of morphologically rich languages is made difficult in part by data sparsity of vocabulary words.

Machine Translation NMT +1

Paper
Add Code

Regularized Training Objective for Continued Training for Domain Adaptation in Neural Machine Translation

1 code implementation • WS 2018 • Huda Khayrallah, Brian Thompson, Kevin Duh, Philipp Koehn

Supervised domain adaptation{---}where a large generic corpus and a smaller in-domain corpus are both available for training{---}is a challenge for neural machine translation (NMT).

Domain Adaptation Machine Translation +2

Paper
Code

How Do Source-side Monolingual Word Embeddings Impact Neural Machine Translation?

no code implementations • 5 Jun 2018 • Shuoyang Ding, Kevin Duh

Using pre-trained word embeddings as input layer is a common practice in many natural language processing (NLP) tasks, but it is largely neglected for neural machine translation (NMT).

Machine Translation NMT +2

Paper
Add Code

Cross-Lingual Learning-to-Rank with Shared Representations

no code implementations • NAACL 2018 • Shota Sasaki, Shuo Sun, Shigehiko Schamoni, Kevin Duh, Kentaro Inui

Cross-lingual information retrieval (CLIR) is a document retrieval task where the documents are written in a language different from that of the user{'}s query.

Cross-Lingual Information Retrieval Learning-To-Rank +2

Paper
Add Code

Morphological Word Embeddings for Arabic Neural Machine Translation in Low-Resource Settings

no code implementations • WS 2018 • Pamela Shapiro, Kevin Duh

Neural machine translation has achieved impressive results in the last few years, but its success has been limited to settings with large amounts of parallel data.

Machine Translation NMT +3

Paper
Add Code

Halo: Learning Semantics-Aware Representations for Cross-Lingual Information Extraction

no code implementations • SEMEVAL 2018 • Hongyuan Mei, Sheng Zhang, Kevin Duh, Benjamin Van Durme

Cross-lingual information extraction (CLIE) is an important and challenging task, especially in low resource scenarios.

TAG

Paper
Add Code

Cross-lingual Semantic Parsing

no code implementations • 21 Apr 2018 • Sheng Zhang, Kevin Duh, Benjamin Van Durme

We introduce the task of cross-lingual semantic parsing: mapping content provided in a source language into a meaning representation based on a target language.

Semantic Parsing

Paper
Add Code

Fine-grained Entity Typing through Increased Discourse Context and Adaptive Classification Thresholds

1 code implementation • SEMEVAL 2018 • Sheng Zhang, Kevin Duh, Benjamin Van Durme

Fine-grained entity typing is the task of assigning fine-grained semantic types to entity mentions.

Entity Typing General Classification +1

Paper
Code

Stochastic Answer Networks for Natural Language Inference

3 code implementations • 21 Apr 2018 • Xiaodong Liu, Kevin Duh, Jianfeng Gao

We propose a stochastic answer network (SAN) to explore multi-step inference strategies in Natural Language Inference.

Ranked #32 on Natural Language Inference on SNLI

Natural Language Inference

148

Paper
Code

Book Review: Bayesian Analysis in Natural Language Processing by Shay Cohen

no code implementations • CL 2018 • Kevin Duh

Coreference Resolution Machine Translation

Paper
Add Code

Stochastic Answer Networks for Machine Reading Comprehension

5 code implementations • ACL 2018 • Xiaodong Liu, Yelong Shen, Kevin Duh, Jianfeng Gao

We propose a simple yet robust stochastic answer network (SAN) that simulates multi-step reasoning in machine reading comprehension.

Ranked #24 on Question Answering on SQuAD1.1 dev

Machine Reading Comprehension Question Answering +2

148

Paper
Code

An Empirical Analysis of Multiple-Turn Reasoning Strategies in Reading Comprehension Tasks

no code implementations • IJCNLP 2017 • Yelong Shen, Xiaodong Liu, Kevin Duh, Jianfeng Gao

Using a state-of-the-art RC model, we empirically investigate the performance of single-turn and multiple-turn reasoning on the SQuAD and MS MARCO datasets.

Descriptive Reading Comprehension +1

Paper
Add Code

A Multi-task Learning Approach to Adapting Bilingual Word Embeddings for Cross-lingual Named Entity Recognition

no code implementations • IJCNLP 2017 • Dingquan Wang, Nanyun Peng, Kevin Duh

We show how to adapt bilingual word embeddings (BWE{'}s) to bootstrap a cross-lingual name-entity recognition (NER) system in a language with no labeled data.

Cross-Lingual Transfer Multi-Task Learning +4

Paper
Add Code

Selective Decoding for Cross-lingual Open Information Extraction

no code implementations • IJCNLP 2017 • Sheng Zhang, Kevin Duh, Benjamin Van Durme

Cross-lingual open information extraction is the task of distilling facts from the source language into representations in the target language.

Machine Translation Open Information Extraction

Paper
Add Code

Neural Lattice Search for Domain Adaptation in Machine Translation

no code implementations • IJCNLP 2017 • Huda Khayrallah, Gaurav Kumar, Kevin Duh, Matt Post, Philipp Koehn

Domain adaptation is a major challenge for neural machine translation (NMT).

Domain Adaptation Machine Translation +2

Paper
Add Code

Inference is Everything: Recasting Semantic Resources into a Unified Evaluation Framework

no code implementations • IJCNLP 2017 • Aaron Steven White, Pushpendre Rastogi, Kevin Duh, Benjamin Van Durme

We propose to unify a variety of existing semantic classification tasks, such as semantic role labeling, anaphora resolution, and paraphrase detection, under the heading of Recognizing Textual Entailment (RTE).

General Classification Image Captioning +4

Paper
Add Code

CADET: Computer Assisted Discovery Extraction and Translation

no code implementations • IJCNLP 2017 • Benjamin Van Durme, Tom Lippincott, Kevin Duh, Deana Burchfield, Adam Poliak, Cash Costello, Tim Finin, Scott Miller, James Mayfield, Philipp Koehn, Craig Harman, Dawn Lawrie, Ch May, ler, Max Thomas, Annabelle Carrell, Julianne Chaloux, Tongfei Chen, Alex Comerford, Mark Dredze, Benjamin Glass, Shudong Hao, Patrick Martin, Pushpendre Rastogi, Rashmi Sankepally, Travis Wolfe, Ying-Ying Tran, Ted Zhang

It combines a multitude of analytics together with a flexible environment for customizing the workflow for different users.

Active Learning Machine Translation +1

Paper
Add Code

The JHU Machine Translation Systems for WMT 2017

no code implementations • WS 2017 • Shuoyang Ding, Huda Khayrallah, Philipp Koehn, Matt Post, Gaurav Kumar, Kevin Duh

Language Modelling Machine Translation +1

Paper
Add Code

Streaming Word Embeddings with the Space-Saving Algorithm

2 code implementations • 24 Apr 2017 • Chandler May, Kevin Duh, Benjamin Van Durme, Ashwin Lall

We develop a streaming (one-pass, bounded-memory) word embedding algorithm based on the canonical skip-gram with negative sampling algorithm implemented in word2vec.

Word Embeddings

Paper
Code

MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models

no code implementations • EACL 2017 • Sheng Zhang, Kevin Duh, Benjamin Van Durme

Conventional pipeline solutions decompose the task as machine translation followed by information extraction (or vice versa).

Machine Translation Open Information Extraction +3

Paper
Add Code

DyNet: The Dynamic Neural Network Toolkit

4 code implementations • 15 Jan 2017 • Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn, Kevin Duh, Manaal Faruqui, Cynthia Gan, Dan Garrette, Yangfeng Ji, Lingpeng Kong, Adhiguna Kuncoro, Gaurav Kumar, Chaitanya Malaviya, Paul Michel, Yusuke Oda, Matthew Richardson, Naomi Saphra, Swabha Swayamdipta, Pengcheng Yin

In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives.

graph construction

3,406

Paper
Code

Skip-Prop: Representing Sentences with One Vector Per Proposition

no code implementations • WS 2017 • Rachel Rudinger, Kevin Duh, Benjamin Van Durme

Machine Translation Question Answering +2

Paper
Add Code

Ordinal Common-sense Inference

no code implementations • TACL 2017 • Sheng Zhang, Rachel Rudinger, Kevin Duh, Benjamin Van Durme

Humans have the capacity to draw common-sense inferences from natural language: various things that are likely but not certain to hold based on established discourse, and are rarely stated explicitly.

Common Sense Reasoning Natural Language Inference

Paper
Add Code

A Generalized Framework for Hierarchical Word Sequence Language Model

no code implementations • PACLIC 2016 • Xiaoyi Wu, Kevin Duh, Yuji Matsumoto

Language Modelling Machine Translation +2

Paper
Add Code

Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network

1 code implementation • 7 Aug 2016 • Keisuke Sakaguchi, Kevin Duh, Matt Post, Benjamin Van Durme

Inspired by the findings from the Cmabrigde Uinervtisy effect, we propose a word recognition model based on a semi-character level recurrent neural network (scRNN).

Spelling Correction