Machine Translation without Words through Substring Alignment

Domain Adaptation Language Modelling +4

Paper
Add Code

Adaptation Data Selection using Neural Language Models: Experiments in Machine Translation

no code implementations • ACL 2013 • Kevin Duh, Graham Neubig, Katsuhito Sudoh, Hajime Tsukada

Paper
Add Code

Travatar: A Forest-to-String Machine Translation Engine based on Tree Transducers

no code implementations • ACL 2013 • Graham Neubig

Machine Translation Speech Recognition +2

Paper
Add Code

Towards High-Reliability Speech Translation in the Medical Domain

no code implementations • WS 2013 • Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura, Yuji Matsumoto, Ryosuke Isotani, Yukichi Ikeda

Paper
Add Code

A Framework and Tool for Collaborative Extraction of Reliable Information

no code implementations • WS 2013 • Graham Neubig, Shinsuke Mori, Masahiro Mizukami

Paper
Add Code

Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff

no code implementations • TACL 2014 • Matthias Sperber, Mirjam Simantzik, Graham Neubig, Satoshi Nakamura, Alex Waibel

In this paper, we study the problem of manually correcting automatic annotations of natural language in as efficient a manner as possible.

Active Learning Segmentation

Paper
Add Code

Acquiring a Dictionary of Emotion-Provoking Events

no code implementations • EACL 2014 • Hoa Trong Vu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Emotion Recognition

Paper
Add Code

Language Resource Addition: Dictionary or Corpus?

no code implementations • LREC 2014 • Shinsuke Mori, Graham Neubig

The experimental results showed that the annotated sentence addition to the training corpus is better than the entries addition to the dictionary.

Active Learning Domain Adaptation +4

Paper
Add Code

Collection of a Simultaneous Translation Corpus for Comparative Analysis

no code implementations • LREC 2014 • Hiroaki Shimizu, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

This makes it possible to compare translation data with simultaneous interpretation data.

Machine Translation speech-recognition +5

Paper
Add Code

Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System

no code implementations • LREC 2014 • Sakriani Sakti, Keigo Kubo, Sho Matsumiya, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Fumihiro Adachi, Ryosuke Isotani

This paper outlines the recent development on multilingual medical data and multilingual speech recognition system for network-based speech-to-speech translation in the medical domain.

Paper
Add Code

Linguistic and Acoustic Features for Automatic Identification of Autism Spectrum Disorders in Children's Narrative

no code implementations • WS 2014 • Hiroki Tanaka, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura

Paper
Add Code

Optimizing Segmentation Strategies for Simultaneous Speech Translation

no code implementations • ACL 2014 • Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Machine Translation Speech Recognition +1

Paper
Add Code

On the Elements of an Accurate Tree-to-String Machine Translation System

no code implementations • ACL 2014 • Graham Neubig, Kevin Duh

Paper
Add Code

Discriminative Language Models as a Tool for Machine Translation Error Analysis

1 code implementation • COLING 2014 • Koichi Akabe, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Reinforcement Learning of Cooperative Persuasive Dialogue Policies using Framing

no code implementations • COLING 2014 • Takuya Hiraoka, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Paper
Add Code

Rule-based Syntactic Preprocessing for Syntax-based Machine Translation

no code implementations • WS 2014 • Yuto Hatakoshi, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Language Modelling Machine Translation +1

Paper
Add Code

Forest-to-String SMT for Asian Language Translation: NAIST at WAT 2014

no code implementations • WS 2014 • Graham Neubig

Paper
Add Code

Semantic Parsing of Ambiguous Input through Paraphrasing and Verification

no code implementations • TACL 2015 • Philip Arthur, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

We propose a new method for semantic parsing of ambiguous and ungrammatical input, such as search queries.

Language Modelling Semantic Parsing +1

Paper
Add Code

Multi-Target Machine Translation with Multi-Synchronous Context-free Grammars

no code implementations • HLT 2015 • Kevin Duh, Graham Neubig, Philip Arthur

Language Modelling Machine Translation +2

Paper
Add Code

Ckylark: A More Robust PCFG-LA Parser

1 code implementation • NAACL 2015 • Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Machine Translation Natural Language Inference

Paper
Code

Improving Pivot Translation by Remembering the Pivot

no code implementations • IJCNLP 2015 • Akiva Miura, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Language Modelling Machine Translation +1

Paper
Add Code

Syntax-based Simultaneous Translation through Prediction of Unseen Syntactic Constituents

no code implementations • IJCNLP 2015 • Yusuke Oda, Graham Neubig, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Boundary Detection Machine Translation +1

Paper
Add Code

An Investigation of Machine Translation Evaluation Metrics in Cross-lingual Question Answering

no code implementations • WS 2015 • Kyoshiro Sugiyama, Masahiro Mizukami, Graham Neubig, Koichiro Yoshino, Sakriani Sakti, Tomoki Toda, Satoshi Nakamura

Cross-Lingual Question Answering Machine Translation +1

Paper
Add Code

A Binarized Neural Network Joint Model for Machine Translation

no code implementations • EMNLP 2015 • Jingyi Zhang, Masao Utiyama, Eiichiro Sumita, Graham Neubig, Satoshi Nakamura

Language Modelling Machine Translation +1

Paper
Add Code

Proceedings of the 2nd Workshop on Asian Translation (WAT2015)

no code implementations • WS 2015 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi, Eiichiro Sumita

Paper
Add Code

Overview of the 2nd Workshop on Asian Translation

no code implementations • WS 2015 • Toshiaki Nakazawa, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi, Eiichiro Sumita

Paper
Add Code

Neural Reranking Improves Subjective Quality of Machine Translation: NAIST at WAT2015

no code implementations • WS 2015 • Graham Neubig, Makoto Morishita, Satoshi Nakamura

We further perform a detailed analysis of reasons for this increase, finding that the main contributions of the neural models lie in improvement of the grammatical correctness of the output, as opposed to improvements in lexical choice of content words.

LEMMA Morphological Inflection

Paper
Add Code

Morphological Inflection Generation Using Character Sequence to Sequence Learning

1 code implementation • NAACL 2016 • Manaal Faruqui, Yulia Tsvetkov, Graham Neubig, Chris Dyer

Morphological inflection generation is the task of generating the inflected form of a given lemma corresponding to a particular linguistic transformation.

Paper
Code

Optimization for Statistical Machine Translation: A Survey

no code implementations • CL 2016 • Graham Neubig, Taro Watanabe

Active Learning Machine Translation +1

Paper
Add Code

Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces

no code implementations • LREC 2016 • Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel

Our goal is to improve the human transcription quality via appropriate user interface design.

Paper
Add Code

Generalizing and Hybridizing Count-based and Neural Language Models

1 code implementation • EMNLP 2016 • Graham Neubig, Chris Dyer

Language models (LMs) are statistical models that calculate probabilities over sequences of words or other discrete symbols.

Language Modelling

Paper
Code

Selecting Syntactic, Non-redundant Segments in Active Learning for Machine Translation

no code implementations • NAACL 2016 • Akiva Miura, Graham Neubig, Michael Paul, Satoshi Nakamura

Paper
Add Code

Incorporating Discrete Translation Lexicons into Neural Machine Translation

2 code implementations • EMNLP 2016 • Philip Arthur, Graham Neubig, Satoshi Nakamura

Neural machine translation (NMT) often makes mistakes in translating low-frequency content words that are essential to understanding the meaning of the sentence.

Paper
Code

A Continuous Space Rule Selection Model for Syntax-based Statistical Machine Translation

no code implementations • ACL 2016 • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Graham Neubig, Satoshi Nakamura

Paper
Add Code

Analyzing the Effect of Entrainment on Dialogue Acts

no code implementations • WS 2016 • Masahiro Mizukami, Koichiro Yoshino, Graham Neubig, David Traum, Satoshi Nakamura

Paper
Add Code

Controlling Output Length in Neural Encoder-Decoders

1 code implementation • EMNLP 2016 • Yuta Kikuchi, Graham Neubig, Ryohei Sasano, Hiroya Takamura, Manabu Okumura

Neural encoder-decoder models have shown great success in many sequence generation tasks.

Text Summarization

Paper
Code

Learning to Translate in Real-time with Neural Machine Translation

1 code implementation • EACL 2017 • Jiatao Gu, Graham Neubig, Kyunghyun Cho, Victor O. K. Li

Translating in real-time, a. k. a.

Paper
Code

Lexicons and Minimum Risk Training for Neural Machine Translation: NAIST-CMU at WAT2016

no code implementations • WS 2016 • Graham Neubig

This year, the Nara Institute of Science and Technology (NAIST)/Carnegie Mellon University (CMU) submission to the Japanese-English translation track of the 2016 Workshop on Asian Translation was based on attentional neural machine translation (NMT) models.

Paper
Add Code

Learning a Lexicon and Translation Model from Phoneme Lattices

1 code implementation • EMNLP 2016 • Oliver Adams, Graham Neubig, Trevor Cohn, Steven Bird, Quoc Truong Do, Satoshi Nakamura

General Classification Image Classification +1

Paper
Code

Eve: A Gradient Based Optimization Method with Locally and Globally Adaptive Learning Rates

5 code implementations • 4 Nov 2016 • Hiroaki Hayashi, Jayanth Koushik, Graham Neubig

Adaptive gradient methods for stochastic optimization adjust the learning rate for each parameter locally.

Paper
Code

What Do Recurrent Neural Network Grammars Learn About Syntax?

1 code implementation • EACL 2017 • Adhiguna Kuncoro, Miguel Ballesteros, Lingpeng Kong, Chris Dyer, Graham Neubig, Noah A. Smith

We investigate what information they learn, from a linguistic perspective, through various ablations to the model and the data, and by augmenting the model with an attention mechanism (GA-RNNG) to enable closer inspection.

Ranked #20 on Constituency Parsing on Penn Treebank

Constituency Parsing Dependency Parsing +1

186

Paper
Code

Overview of the 3rd Workshop on Asian Translation

no code implementations • WS 2016 • Toshiaki Nakazawa, Chenchen Ding, Hideya Mino, Isao Goto, Graham Neubig, Sadao Kurohashi

For the WAT2016, 15 institutions participated in the shared tasks.

Automatic Speech Recognition (ASR) Machine Translation +2

Paper
Add Code

Lightly Supervised Quality Estimation

no code implementations • COLING 2016 • Matthias Sperber, Graham Neubig, Jan Niehues, Sebastian St{\"u}ker, Alex Waibel

Evaluating the quality of output from language processing systems such as machine translation or speech recognition is an essential step in ensuring that they are sufficient for practical use.

Paper
Add Code

DyNet: The Dynamic Neural Network Toolkit

4 code implementations • 15 Jan 2017 • Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn, Kevin Duh, Manaal Faruqui, Cynthia Gan, Dan Garrette, Yangfeng Ji, Lingpeng Kong, Adhiguna Kuncoro, Gaurav Kumar, Chaitanya Malaviya, Paul Michel, Yusuke Oda, Matthew Richardson, Naomi Saphra, Swabha Swayamdipta, Pengcheng Yin

In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its derivatives.

graph construction

3,406

Paper
Code

Neural Machine Translation and Sequence-to-sequence Models: A Tutorial

2 code implementations • 5 Mar 2017 • Graham Neubig

This tutorial introduces a new and powerful set of techniques variously called "neural machine translation" or "neural sequence-to-sequence models".

Machine Translation Math +1

Paper
Code

Cross-Lingual Word Embeddings for Low-Resource Language Modeling

no code implementations • EACL 2017 • Oliver Adams, Adam Makarucha, Graham Neubig, Steven Bird, Trevor Cohn

We investigate the use of such lexicons to improve language models when textual training data is limited to as few as a thousand sentences.

Cross-Lingual Word Embeddings Language Modelling +3

Paper
Add Code

Neural Lattice-to-Sequence Models for Uncertain Inputs

no code implementations • EMNLP 2017 • Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel

In this work, we extend the TreeLSTM (Tai et al., 2015) into a LatticeLSTM that is able to consume word lattices, and can be used as encoder in an attentional encoder-decoder model.

Code Generation Semantic Parsing +1

Paper
Add Code

A Syntactic Neural Model for General-Purpose Code Generation

6 code implementations • ACL 2017 • Pengcheng Yin, Graham Neubig

We consider the problem of parsing natural language descriptions into source code written in a general-purpose programming language like Python.

181

Paper
Code

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction

no code implementations • ACL 2017 • Chunting Zhou, Graham Neubig

Labeled sequence transduction is a task of transforming one sequence into another sequence that satisfies desiderata specified by a set of labels.

Morphological Inflection

Paper
Add Code

Learning Character-level Compositionality with Visual Features

2 code implementations • ACL 2017 • Frederick Liu, Han Lu, Chieh Lo, Graham Neubig

Previous work has modeled the compositionality of words by creating character-level models of meaning, reducing problems of sparsity for rare words.

text-classification Text Classification

Paper
Code

Neural Machine Translation via Binary Code Prediction

no code implementations • ACL 2017 • Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura

In this paper, we propose a new method for calculating the output layer in neural machine translation systems.

Dependency Parsing Image Captioning +6

Paper
Add Code

Softmax Q-Distribution Estimation for Structured Prediction: A Theoretical Interpretation for RAML

no code implementations • ICLR 2018 • Xuezhe Ma, Pengcheng Yin, Jingzhou Liu, Graham Neubig, Eduard Hovy

Reward augmented maximum likelihood (RAML), a simple and effective learning framework to directly optimize towards the reward function in structured prediction tasks, has led to a number of impressive empirical successes.

Paper
Add Code

On-the-fly Operation Batching in Dynamic Computation Graphs

2 code implementations • NeurIPS 2017 • Graham Neubig, Yoav Goldberg, Chris Dyer

Dynamic neural network toolkits such as PyTorch, DyNet, and Chainer offer more flexibility for implementing models that cope with data of varying dimensions and structure, relative to toolkits that operate on statically declared computations (e. g., TensorFlow, CNTK, and Theano).

123

Paper
Code

Controllable Invariance through Adversarial Feature Learning

no code implementations • NeurIPS 2017 • Qizhe Xie, Zihang Dai, Yulun Du, Eduard Hovy, Graham Neubig

Learning meaningful representations that maintain the content necessary for a particular task while filtering away detrimental variations is a problem of great interest in machine learning.

General Classification Image Classification +1

Paper
Add Code

An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

no code implementations • WS 2017 • Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes.

Paper
Add Code

Stronger Baselines for Trustable Results in Neural Machine Translation

1 code implementation • WS 2017 • Michael Denkowski, Graham Neubig

As a result, it is often difficult to determine whether improvements from research will carry over to systems deployed for real-world use.

Paper
Code

CharManteau: Character Embedding Models For Portmanteau Creation

1 code implementation • EMNLP 2017 • Varun Gangal, Harsh Jhamtani, Graham Neubig, Eduard Hovy, Eric Nyberg

Portmanteaus are a word formation phenomenon where two words are combined to form a new word.

Paper
Code

Learning Language Representations for Typology Prediction

2 code implementations • EMNLP 2017 • Chaitanya Malaviya, Graham Neubig, Patrick Littell

One central mystery of neural NLP is what neural models "know" about their subject matter.

CCG Supertagging Motion Segmentation +3

Paper
Code

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing

no code implementations • WS 2017 • Ravich, Abhilasha er, Thomas Manzini, Matthias Grabmair, Graham Neubig, Jonathan Francis, Eric Nyberg

Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording.

Semantic Parsing

Paper
Add Code

A Continuous Relaxation of Beam Search for End-to-end Training of Neural Sequence Models

no code implementations • 1 Aug 2017 • Kartik Goyal, Graham Neubig, Chris Dyer, Taylor Berg-Kirkpatrick

In experiments, we show that optimizing this new training objective yields substantially better results on two sequence tasks (Named Entity Recognition and CCG Supertagging) when compared with both cross entropy trained greedy decoding and cross entropy trained beam decoding baselines.

Ranked #3 on Motion Segmentation on Hopkins155

Paper
Add Code

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders

no code implementations • CONLL 2017 • Chunting Zhou, Graham Neubig

Information Retrieval Machine Translation +1

Paper
Add Code

Handling Homographs in Neural Machine Translation

no code implementations • NAACL 2018 • Frederick Liu, Han Lu, Graham Neubig

Homographs, words with different meanings but the same surface form, have long caused difficulty for machine translation systems, as it is difficult to select the correct translation based on the context.

Paper
Add Code

Tree as a Pivot: Syntactic Matching Methods in Pivot Translation

no code implementations • WS 2017 • Akiva Miura, Graham Neubig, Katsuhito Sudoh, Satoshi Nakamura

Image Retrieval Multimodal Machine Translation +1

Paper
Add Code

NICT-NAIST System for WMT17 Multimodal Translation Task

no code implementations • WS 2017 • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Graham Neubig, Satoshi Nakamura

Paper
Add Code

Transcribing Against Time

no code implementations • 15 Sep 2017 • Matthias Sperber, Graham Neubig, Jan Niehues, Satoshi Nakamura, Alex Waibel

We investigate the problem of manually correcting errors from an automatic speech transcript in a cost-sensitive fashion.

Paper
Add Code

Improving Neural Machine Translation through Phrase-based Forced Decoding

no code implementations • IJCNLP 2017 • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Graham Neubig, Satoshi Nakamura

Compared to traditional statistical machine translation (SMT), neural machine translation (NMT) often sacrifices adequacy for the sake of fluency.

Paper
Add Code

Overview of the 4th Workshop on Asian Translation

no code implementations • WS 2017 • Toshiaki Nakazawa, Shohei Higashiyama, Chenchen Ding, Hideya Mino, Isao Goto, Hideto Kazawa, Yusuke Oda, Graham Neubig, Sadao Kurohashi

For the WAT2017, 12 institutions participated in the shared tasks.

Acoustic Modelling Language Modelling +1

Paper
Add Code

Phonemic Transcription of Low-Resource Tonal Languages

1 code implementation • ALTA 2017 • Oliver Adams, Trevor Cohn, Graham Neubig, Alexis Michaud

154

Paper
Code

Convolutional Neural Networks for Medical Diagnosis from Admission Notes

no code implementations • 6 Dec 2017 • Christy Li, Dimitris Konomis, Graham Neubig, Pengtao Xie, Carol Cheng, Eric Xing

The hope is that the tool can be used to reduce mis-diagnosis.

Decision Making Medical Diagnosis +2

Paper
Add Code

Cavs: A Vertex-centric Programming Interface for Dynamic Neural Networks

no code implementations • 11 Dec 2017 • Hao Zhang, Shizhen Xu, Graham Neubig, Wei Dai, Qirong Ho, Guangwen Yang, Eric P. Xing

Recent deep learning (DL) models have moved beyond static network architectures to dynamic ones, handling data where the network structure changes every example, such as sequences of variable lengths, trees, and graphs.

graph construction Management +1

Paper
Add Code

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

no code implementations • 14 Feb 2018 • Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stueker, Pierre Godard, Markus Mueller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux

We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography.

Paper
Add Code

XNMT: The eXtensible Neural Machine Translation Toolkit

1 code implementation • WS 2018 • Graham Neubig, Matthias Sperber, Xinyi Wang, Matthieu Felix, Austin Matthews, Sarguna Padmanabhan, Ye Qi, Devendra Singh Sachan, Philip Arthur, Pierre Godard, John Hewitt, Rachid Riad, Liming Wang

In this paper we describe the design of XNMT and its experiment configuration system, and demonstrate its utility on the tasks of machine translation, speech recognition, and multi-tasked machine translation/parsing.

Language Modelling Sentence

185

Paper
Code

Neural Lattice Language Models

1 code implementation • TACL 2018 • Jacob Buckman, Graham Neubig

In this work, we propose a new language modeling paradigm that has the ability to perform both prediction and moderation of information flow at multiple granularities: neural lattice language models.

Paper
Code

Self-Attentional Acoustic Models

1 code implementation • 26 Mar 2018 • Matthias Sperber, Jan Niehues, Graham Neubig, Sebastian Stüker, Alex Waibel

Self-attention is a method of encoding sequences of vectors by relating these vectors to each-other based on pairwise similarities.

Paper
Code

Attentive Interaction Model: Modeling Changes in View in Argumentation

1 code implementation • NAACL 2018 • Yohan Jo, Shivani Poddar, Byungsoo Jeon, Qinlan Shen, Carolyn P. Rose, Graham Neubig

We present a neural architecture for modeling argumentative dialogue that explicitly models the interplay between an Opinion Holder's (OH's) reasoning and a challenger's argument, with the goal of predicting if the argument successfully changes the OH's view.

Paper
Code

Guiding Neural Machine Translation with Retrieved Translation Pieces

no code implementations • NAACL 2018 • Jingyi Zhang, Masao Utiyama, Eiichro Sumita, Graham Neubig, Satoshi Nakamura

Specifically, for an input sentence, we use a search engine to retrieve sentence pairs whose source sides are similar with the input sentence, and then collect $n$-grams that are both in the retrieved target sentences and aligned with words that match in the source sentences, which we call "translation pieces".

Paper
Add Code

When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?

1 code implementation • NAACL 2018 • Ye Qi, Devendra Singh Sachan, Matthieu Felix, Sarguna Janani Padmanabhan, Graham Neubig

The performance of Neural Machine Translation (NMT) systems often suffers in low-resource scenarios where sufficiently large-scale parallel corpora cannot be obtained.

Acoustic Modelling Language Modelling +1

117

Paper
Code

Evaluation Phonemic Transcription of Low-Resource Tonal Languages for Language Documentation

1 code implementation • LREC 2018 • Oliver Adams, Trevor Cohn, Graham Neubig, Hilaria Cruz, Steven Bird, Alexis Michaud

154

Paper
Code

Stack-Pointer Networks for Dependency Parsing

3 code implementations • ACL 2018 • Xuezhe Ma, Zecong Hu, Jingzhou Liu, Nanyun Peng, Graham Neubig, Eduard Hovy

Combining pointer networks~\citep{vinyals2015pointer} with an internal stack, the proposed model first reads and encodes the whole sentence, then builds the dependency tree top-down (from root-to-leaf) in a depth-first fashion.

Ranked #14 on Dependency Parsing on Penn Treebank

Dependency Parsing Sentence

440

Paper
Code

Extreme Adaptation for Personalized Neural Machine Translation

1 code implementation • ACL 2018 • Paul Michel, Graham Neubig

Every person speaks or writes their own flavor of their native language, influenced by a number of factors: the content they tend to talk about, their gender, their social status, or their geographical origin.

Paper
Code

Automatic Estimation of Simultaneous Interpreter Performance

1 code implementation • ACL 2018 • Craig Stewart, Nikolai Vogler, Junjie Hu, Jordan Boyd-Graber, Graham Neubig

Simultaneous interpretation, translation of the spoken word in real-time, is both highly challenging and physically demanding.

Morphological Analysis Morphological Tagging +2

Paper
Code

Neural Factor Graph Models for Cross-lingual Morphological Tagging

no code implementations • ACL 2018 • Chaitanya Malaviya, Matthew R. Gormley, Graham Neubig

Morphological analysis involves predicting the syntactic traits of a word (e. g. {POS: Noun, Case: Acc, Gender: Fem}).

Paper
Add Code

Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow

no code implementations • 23 May 2018 • Pengcheng Yin, Bowen Deng, Edgar Chen, Bogdan Vasilescu, Graham Neubig

For tasks like code synthesis from natural language, code retrieval, and code summarization, data-driven models have shown great promise.

Code Summarization Retrieval +1

Paper
Add Code

Modelling Natural Language, Programs, and their Intersection

no code implementations • NAACL 2018 • Graham Neubig, Miltiadis Allamanis

As a result, in the past several years there has been an increasing research interest in methods that focus on the intersection of programming and natural language, allowing users to use natural language to interact with computers in the complex ways that programs allow us to do.

Semantic Parsing Text Generation

Paper
Add Code

Using Morphological Knowledge in Open-Vocabulary Neural Language Models

no code implementations • NAACL 2018 • Austin Matthews, Graham Neubig, Chris Dyer

Languages with productive morphology pose problems for language models that generate words from a fixed vocabulary.

Language Modelling Morphological Disambiguation

Paper
Add Code

Stress Test Evaluation for Natural Language Inference

1 code implementation • COLING 2018 • Aakanksha Naik, Abhilasha Ravichander, Norman Sadeh, Carolyn Rose, Graham Neubig

Natural language inference (NLI) is the task of determining if a natural language hypothesis can be inferred from a given premise in a justifiable manner.

Natural Language Inference Natural Language Understanding +1

Paper
Code

Multi-Source Neural Machine Translation with Missing Data

no code implementations • WS 2018 • Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura

This study focuses on the use of incomplete multilingual corpora in multi-encoder NMT and mixture of NMT experts and examines a very simple implementation where missing source translations are replaced by a special symbol <NULL>.

Data Augmentation Domain Adaptation +2

Paper
Add Code

Findings of the Second Workshop on Neural Machine Translation and Generation

no code implementations • WS 2018 • Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018).

Paper
Add Code

StructVAE: Tree-structured Latent Variable Models for Semi-supervised Semantic Parsing

7 code implementations • ACL 2018 • Pengcheng Yin, Chunting Zhou, Junxian He, Graham Neubig

Semantic parsing is the task of transducing natural language (NL) utterances into formal meaning representations (MRs), commonly represented as tree structures.

Code Generation Semantic Parsing

460

Paper
Code

Learning to Generate Move-by-Move Commentary for Chess Games from Large-Scale Social Forum Data

1 code implementation • ACL 2018 • Harsh Jhamtani, Varun Gangal, Eduard Hovy, Graham Neubig, Taylor Berg-Kirkpatrick

This paper examines the problem of generating natural language descriptions of chess games.

Game of Chess Text Generation

Paper
Code

Rapid Adaptation of Neural Machine Translation to New Languages

1 code implementation • EMNLP 2018 • Graham Neubig, Junjie Hu

This paper examines the problem of adapting neural machine translation systems to new, low-resourced languages (LRLs) as effectively and rapidly as possible.

Data Augmentation Machine Translation +3

Paper
Code

SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation

no code implementations • EMNLP 2018 • Xinyi Wang, Hieu Pham, Zihang Dai, Graham Neubig

In this work, we examine methods for data augmentation for text-based tasks such as neural machine translation (NMT).

Paper
Add Code

Contextual Parameter Generation for Universal Neural Machine Translation

1 code implementation • EMNLP 2018 • Emmanouil Antonios Platanios, Mrinmaya Sachan, Graham Neubig, Tom Mitchell

We propose a simple modification to existing neural machine translation (NMT) models that enables using a single universal model to translate between multiple languages while allowing for language specific parameterization, and that can also be used for domain adaptation.

Domain Adaptation Machine Translation +2

Paper
Code

Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations

1 code implementation • EMNLP 2018 • Aditi Chaudhary, Chunting Zhou, Lori Levin, Graham Neubig, David R. Mortensen, Jaime G. Carbonell

Much work in Natural Language Processing (NLP) has been for resource-rich languages, making generalization to new, less-resourced languages challenging.

Avg Machine Translation +6

Paper
Code

Unsupervised Learning of Syntactic Structure with Invertible Neural Projections

1 code implementation • EMNLP 2018 • Junxian He, Graham Neubig, Taylor Berg-Kirkpatrick

In this work, we propose a novel generative model that jointly learns discrete syntactic structure and continuous word representations in an unsupervised fashion by cascading an invertible neural network with a structured generative prior.

Ranked #14 on Constituency Grammar Induction on PTB Diagnostic ECG Database

Constituency Grammar Induction POS +1

Paper
Code

A Tree-based Decoder for Neural Machine Translation

1 code implementation • EMNLP 2018 • Xinyi Wang, Hieu Pham, Pengcheng Yin, Graham Neubig

Recent advances in Neural Machine Translation (NMT) show that adding syntactic information to NMT systems can improve the quality of their translations.

Code Generation Retrieval +2

Paper
Code

Retrieval-Based Neural Code Generation

1 code implementation • EMNLP 2018 • Shirley Anugrah Hayati, Raphael Olivier, Pravalika Avvaru, Pengcheng Yin, Anthony Tomasic, Graham Neubig

In models to generate program source code from natural language, representing this code in a tree structure has been a common approach.

Paper
Code

Neural Cross-Lingual Named Entity Recognition with Minimal Resources

1 code implementation • EMNLP 2018 • Jiateng Xie, Zhilin Yang, Graham Neubig, Noah A. Smith, Jaime Carbonell

To improve robustness to word order differences, we propose to use self-attention, which allows for a degree of flexibility with respect to word order.

named-entity-recognition Named Entity Recognition +2

Paper
Code

Parameter Sharing Methods for Multilingual Self-Attentional Translation Models

1 code implementation • WS 2018 • Devendra Singh Sachan, Graham Neubig

In multilingual neural machine translation, it has been shown that sharing a single translation model between multiple languages can achieve competitive performance, sometimes even leading to performance gains over bilingually trained models.

Paper
Code

Contextual Encoding for Translation Quality Estimation

1 code implementation • WS 2018 • Junjie Hu, Wei-Cheng Chang, Yuexin Wu, Graham Neubig

In this paper, propose a method to effectively encode the local and global contextual information for each target word using a three-part neural network approach.

Sentence Translation

Paper
Code

MTNT: A Testbed for Machine Translation of Noisy Text

2 code implementations • EMNLP 2018 • Paul Michel, Graham Neubig

In this paper, we propose a benchmark dataset for Machine Translation of Noisy Text (MTNT), consisting of noisy comments on Reddit (www. reddit. com) and professionally sourced translations.

Bilingual Lexicon Induction Word Embeddings

Paper
Code

BLISS in Non-Isometric Embedding Spaces

no code implementations • 27 Sep 2018 • Barun Patra, Joel Ruben Antony Moniz, Sarthak Garg, Matthew R Gormley, Graham Neubig

We then propose Bilingual Lexicon Induction with Semi-Supervision (BLISS) --- a novel semi-supervised approach that relaxes the isometric assumption while leveraging both limited aligned bilingual lexicons and a larger set of unaligned word embeddings, as well as a novel hubness filtering technique.

Paper
Add Code

Measuring Density and Similarity of Task Relevant Information in Neural Representations

no code implementations • 27 Sep 2018 • Danish Pruthi, Mansi Gupta, Nitish Kumar Kulkarni, Graham Neubig, Eduard Hovy

Neural models achieve state-of-the-art performance due to their ability to extract salient features useful to downstream tasks.

Sentence Transfer Learning

Paper
Add Code

TRANX: A Transition-based Neural Abstract Syntax Parser for Semantic Parsing and Code Generation

4 code implementations • EMNLP 2018 • Pengcheng Yin, Graham Neubig

We present TRANX, a transition-based neural semantic parser that maps natural language (NL) utterances into formal meaning representations (MRs).

Ranked #2 on Semantic Parsing on ATIS

Code Generation Semantic Parsing

460

Paper
Code

Multi-Source Neural Machine Translation with Data Augmentation

no code implementations • IWSLT (EMNLP) 2018 • Yuta Nishimura, Katsuhito Sudoh, Graham Neubig, Satoshi Nakamura

By using information from these multiple sources, these systems achieve large gains in accuracy.

Data Augmentation Machine Translation +2

Paper
Add Code

Optimizing Segmentation Granularity for Neural Machine Translation

no code implementations • 19 Oct 2018 • Elizabeth Salesky, Andrew Runge, Alex Coda, Jan Niehues, Graham Neubig

However, the granularity of these subword units is a hyperparameter to be tuned for each language and task, using methods such as grid search.

Cross-Lingual Entity Linking Entity Linking

Paper
Add Code

Learning to Represent Edits

2 code implementations • ICLR 2019 • Pengcheng Yin, Graham Neubig, Miltiadis Allamanis, Marc Brockschmidt, Alexander L. Gaunt

We introduce the problem of learning distributed representations of edits.

Paper
Code

Learning to Describe Phrases with Local and Global Contexts

1 code implementation • 1 Nov 2018 • Shonosuke Ishiwatari, Hiroaki Hayashi, Naoki Yoshinaga, Graham Neubig, Shoetsu Sato, Masashi Toyoda, Masaru Kitsuregawa

When reading a text, it is common to become stuck on unfamiliar words and phrases, such as polysemous words with novel senses, rarely used idioms, internet slang, or emerging entities.

Reading Comprehension

Paper
Code

Zero-shot Neural Transfer for Cross-lingual Entity Linking

1 code implementation • 9 Nov 2018 • Shruti Rijhwani, Jiateng Xie, Graham Neubig, Jaime Carbonell

To address this problem, we investigate zero-shot cross-lingual entity linking, in which we assume no bilingual lexical resources are available in the source low-resource language.

Paper
Code

Towards a General-Purpose Linguistic Annotation Backend

no code implementations • 13 Dec 2018 • Graham Neubig, Patrick Littell, Chian-Yu Chen, Jean Lee, Zirui Li, Yu-Hsiang Lin, Yuyan Zhang

In this extended abstract, we describe the beginnings of a new project that will attempt to ease this language documentation process through the use of natural language processing (NLP) technology.

Management

Paper
Add Code

Lagging Inference Networks and Posterior Collapse in Variational Autoencoders

2 code implementations • ICLR 2019 • Junxian He, Daniel Spokoyny, Graham Neubig, Taylor Berg-Kirkpatrick

The variational autoencoder (VAE) is a popular combination of deep latent variable model and accompanying variational learning technique.

Ranked #1 on Text Generation on Yahoo Questions

Text Generation

183

Paper
Code

An Adversarial Approach to High-Quality, Sentiment-Controlled Neural Dialogue Generation

no code implementations • 22 Jan 2019 • Xiang Kong, Bohan Li, Graham Neubig, Eduard Hovy, Yiming Yang

In this work, we propose a method for neural dialogue response generation that allows not only generating semantically reasonable responses according to the dialogue history, but also explicitly controlling the sentiment of the response via sentiment labels.

Dialogue Generation Response Generation +1

Paper
Add Code

Multilingual Neural Machine Translation With Soft Decoupled Encoding

1 code implementation • ICLR 2019 • Xinyi Wang, Hieu Pham, Philip Arthur, Graham Neubig

Multilingual training of neural machine translation (NMT) systems has led to impressive accuracy improvements on low-resource languages.

Paper
Code

The ARIEL-CMU Systems for LoReHLT18

no code implementations • 24 Feb 2019 • Aditi Chaudhary, Siddharth Dalmia, Junjie Hu, Xinjian Li, Austin Matthews, Aldrian Obaja Muis, Naoki Otani, Shruti Rijhwani, Zaid Sheikh, Nidhi Vyas, Xinyi Wang, Jiateng Xie, Ruochen Xu, Chunting Zhou, Peter J. Jansen, Yiming Yang, Lori Levin, Florian Metze, Teruko Mitamura, David R. Mortensen, Graham Neubig, Eduard Hovy, Alan W. black, Jaime Carbonell, Graham V. Horwood, Shabnam Tafreshi, Mona Diab, Efsun S. Kayi, Noura Farra, Kathleen McKeown

This paper describes the ARIEL-CMU submissions to the Low Resource Human Language Technologies (LoReHLT) 2018 evaluations for the tasks Machine Translation (MT), Entity Discovery and Linking (EDL), and detection of Situation Frames in Text and Speech (SF Text and Speech).

Paper
Add Code

Improving Robustness of Machine Translation with Synthetic Noise

1 code implementation • NAACL 2019 • Vaibhav Vaibhav, Sumeet Singh, Craig Stewart, Graham Neubig

Modern Machine Translation (MT) systems perform consistently well on clean, in-domain text.

Adversarial Robustness Machine Translation

Paper
Code

On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models

1 code implementation • NAACL 2019 • Paul Michel, Xi-An Li, Graham Neubig, Juan Miguel Pino

Adversarial examples --- perturbations to the input of a model that elicit large changes in the output --- have been shown to be an effective way of assessing the robustness of sequence-to-sequence (seq2seq) models.

Paper
Code

compare-mt: A Tool for Holistic Comparison of Language Generation Systems

2 code implementations • NAACL 2019 • Graham Neubig, Zi-Yi Dou, Junjie Hu, Paul Michel, Danish Pruthi, Xinyi Wang, John Wieting

In this paper, we describe compare-mt, a tool for holistic analysis and comparison of the results of systems for language generation tasks such as machine translation.

Machine Translation Sentence +2

462

Paper
Code

Competence-based Curriculum Learning for Neural Machine Translation

1 code implementation • NAACL 2019 • Emmanouil Antonios Platanios, Otilia Stretcu, Graham Neubig, Barnabas Poczos, Tom M. Mitchell

In this paper, we propose a curriculum learning framework for NMT that reduces training time, reduces the need for specialized heuristics or large batch sizes, and results in overall better performance.

Paper
Code

Lost in Interpretation: Predicting Untranslated Terminology in Simultaneous Interpretation

1 code implementation • NAACL 2019 • Nikolai Vogler, Craig Stewart, Graham Neubig

Simultaneous interpretation, the translation of speech from one language to another in real-time, is an inherently difficult and strenuous task.

Bilingual Lexicon Induction Word Embeddings +1

Paper
Code

Density Matching for Bilingual Word Embedding

1 code implementation • NAACL 2019 • Chunting Zhou, Xuezhe Ma, Di Wang, Graham Neubig

Recent approaches to cross-lingual word embedding have generally been based on linear transformations between the sets of embedding vectors in the two languages.

Paper
Code

Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation

no code implementations • TACL 2019 • Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel

Speech translation has traditionally been approached through cascaded models consisting of a speech recognizer trained on a corpus of transcribed speech, and a machine translation system trained on parallel texts.

Machine Translation speech-recognition +2

Paper
Add Code

On Meaning-Preserving Adversarial Perturbations for Sequence-to-Sequence Models

no code implementations • ICLR 2019 • Paul Michel, Graham Neubig, Xi-An Li, Juan Miguel Pino

Adversarial examples have been shown to be an effective way of assessing the robustness of neural sequence-to-sequence (seq2seq) models, by applying perturbations to the input of a model leading to large degradation in performance.

Adversarial Robustness Machine Translation +1

Paper
Add Code

Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation

no code implementations • ACL 2019 • Xinyi Wang, Graham Neubig

To improve low-resource Neural Machine Translation (NMT) with multilingual corpora, training on the most related high-resource language only is often more effective than using all data available (Neubig and Hu, 2018).

Low-Resource Neural Machine Translation NMT +2

Paper
Add Code

Are Sixteen Heads Really Better than One?

3 code implementations • NeurIPS 2019 • Paul Michel, Omer Levy, Graham Neubig

Attention is a powerful and ubiquitous mechanism for allowing neural models to focus on particular salient pieces of information by taking their weighted average when making predictions.

162

Paper
Code

Choosing Transfer Languages for Cross-Lingual Learning

1 code implementation • ACL 2019 • Yu-Hsiang Lin, Chian-Yu Chen, Jean Lee, Zirui Li, Yuyan Zhang, Mengzhou Xia, Shruti Rijhwani, Junxian He, Zhisong Zhang, Xuezhe Ma, Antonios Anastasopoulos, Patrick Littell, Graham Neubig

Cross-lingual transfer, where a high-resource transfer language is used to improve the accuracy of a low-resource task language, is now an invaluable tool for improving performance of natural language processing (NLP) on low-resource languages.

Cross-Lingual Transfer

Paper
Code

Improving Open Information Extraction via Iterative Rank-Aware Learning

1 code implementation • ACL 2019 • Zhengbao Jiang, Pengcheng Yin, Graham Neubig

We found that the extraction likelihood, a confidence measure used by current supervised open IE systems, is not well calibrated when comparing the quality of assertions extracted from different sentences.

Binary Classification General Classification +1

Paper
Code

Learning to Describe Unknown Phrases with Local and Global Contexts

no code implementations • NAACL 2019 • Shonosuke Ishiwatari, Hiroaki Hayashi, Naoki Yoshinaga, Graham Neubig, Shoetsu Sato, Masashi Toyoda, Masaru Kitsuregawa

When reading a text, it is common to become stuck on unfamiliar words and phrases, such as polysemous words with novel senses, rarely used idioms, internet slang, or emerging entities.

Paper
Add Code

Domain Adaptation of Neural Machine Translation by Lexicon Induction

2 code implementations • ACL 2019 • Junjie Hu, Mengzhou Xia, Graham Neubig, Jaime Carbonell

It has been previously noted that neural machine translation (NMT) is very sensitive to domain shift.

Domain Adaptation Machine Translation +2

Paper
Code

Self-Attentional Models for Lattice Inputs

no code implementations • ACL 2019 • Matthias Sperber, Graham Neubig, Ngoc-Quan Pham, Alex Waibel

Lattices are an efficient and effective method to encode ambiguity of upstream systems in natural language processing tasks, for example to compactly capture multiple speech recognition hypotheses, or to represent multiple linguistic analyses.

Computational Efficiency speech-recognition +2

Paper
Add Code

Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections

1 code implementation • ACL 2019 • Junxian He, Zhisong Zhang, Taylor Berg-Kirkpatrick, Graham Neubig

The parameters of source model and target model are softly shared through a regularized log likelihood objective.

Cross-Lingual Transfer Dependency Parsing +3

Paper
Code

Generalized Data Augmentation for Low-Resource Translation

no code implementations • ACL 2019 • Mengzhou Xia, Xiang Kong, Antonios Anastasopoulos, Graham Neubig

Translation to or from low-resource languages LRLs poses challenges for machine translation in terms of both adequacy and fluency.

Data Augmentation Translation +1

Paper
Add Code

Findings of the First Shared Task on Machine Translation Robustness

1 code implementation • WS 2019 • Xi-An Li, Paul Michel, Antonios Anastasopoulos, Yonatan Belinkov, Nadir Durrani, Orhan Firat, Philipp Koehn, Graham Neubig, Juan Pino, Hassan Sajjad

We share the findings of the first shared task on improving robustness of Machine Translation (MT).

Code Generation Semantic Parsing

462

Paper
Code

Reranking for Neural Semantic Parsing

no code implementations • ACL 2019 • Pengcheng Yin, Graham Neubig

Semantic parsing considers the task of transducing natural language (NL) utterances into machine executable meaning representations (MRs).

Ranked #4 on Code Generation on Django

Paper
Add Code

Beyond BLEU:Training Neural Machine Translation with Semantic Similarity

no code implementations • ACL 2019 • John Wieting, Taylor Berg-Kirkpatrick, Kevin Gimpel, Graham Neubig

While most neural machine translation (NMT)systems are still trained using maximum likelihood estimation, recent work has demonstrated that optimizing systems to directly improve evaluation metrics such as BLEU can significantly improve final translation accuracy.

Machine Translation Multi-Task Learning +2

Paper
Add Code

Improving Robustness of Neural Machine Translation with Multi-task Learning

1 code implementation • WS 2019 • Shuyan Zhou, Xiangkai Zeng, Yingqi Zhou, Antonios Anastasopoulos, Graham Neubig

While neural machine translation (NMT) achieves remarkable performance on clean, in-domain text, performance is known to degrade drastically when facing text which is full of typos, grammatical errors and other varieties of noise.

Paper
Code

Mitigating Noisy Inputs for Question Answering

no code implementations • 8 Aug 2019 • Denis Peskov, Joe Barrow, Pedro Rodriguez, Graham Neubig, Jordan Boyd-Graber

We investigate and mitigate the effects of noise from Automatic Speech Recognition systems on two factoid Question Answering (QA) tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Paper
Add Code

Pushing the Limits of Low-Resource Morphological Inflection

4 code implementations • IJCNLP 2019 • Antonios Anastasopoulos, Graham Neubig

Recent years have seen exceptional strides in the task of automatic morphological inflection generation.

Cross-Lingual Transfer Hallucination +1

Paper
Code

Bilingual Lexicon Induction with Semi-supervision in Non-Isometric Embedding Spaces

1 code implementation • ACL 2019 • Barun Patra, Joel Ruben Antony Moniz, Sarthak Garg, Matthew R. Gormley, Graham Neubig

We then propose Bilingual Lexicon Induction with Semi-Supervision (BLISS) --- a semi-supervised approach that relaxes the isometric assumption while leveraging both limited aligned bilingual lexicons and a larger set of unaligned word embeddings, as well as a novel hubness filtering technique.

Bilingual Lexicon Induction Word Embeddings

Paper
Code

Latent Relation Language Models

no code implementations • 21 Aug 2019 • Hiroaki Hayashi, Zecong Hu, Chenyan Xiong, Graham Neubig

In this paper, we propose Latent Relation Language Models (LRLMs), a class of language models that parameterizes the joint distribution over the words in a document and the entities that occur therein via knowledge graph relations.

Language Modelling Relation

Paper
Add Code

A Little Annotation does a Lot of Good: A Study in Bootstrapping Low-resource Named Entity Recognizers

1 code implementation • IJCNLP 2019 • Aditi Chaudhary, Jiateng Xie, Zaid Sheikh, Graham Neubig, Jaime G. Carbonell

Most state-of-the-art models for named entity recognition (NER) rely on the availability of large amounts of labeled data, making them challenging to extend to new, lower-resourced languages.

Active Learning Cross-Lingual Transfer +4

Paper
Code

Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings

1 code implementation • IJCNLP 2019 • Zi-Yi Dou, Junjie Hu, Antonios Anastasopoulos, Graham Neubig

The recent success of neural machine translation models relies on the availability of high quality, in-domain data.

Language Modelling Machine Translation +2

Paper
Code

Handling Syntactic Divergence in Low-resource Machine Translation

1 code implementation • IJCNLP 2019 • Chunting Zhou, Xuezhe Ma, Junjie Hu, Graham Neubig

Despite impressive empirical successes of neural machine translation (NMT) on standard benchmarks, limited parallel data impedes the application of NMT models to many language pairs.

Data Augmentation Machine Translation +2

Paper
Code

Contextualized Representations for Low-resource Utterance Tagging

no code implementations • WS 2019 • Bhargavi Paranjape, Graham Neubig

Utterance-level analysis of the speaker{'}s intentions and emotions is a core task in conversational understanding.

Emotion Classification

Paper
Add Code

A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text

1 code implementation • IJCNLP 2019 • Bohan Li, Junxian He, Graham Neubig, Taylor Berg-Kirkpatrick, Yiming Yang

In this paper, we investigate a simple fix for posterior collapse which yields surprisingly effective results.

Language Modelling Representation Learning

Paper
Code

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

2 code implementations • IJCNLP 2019 • Xuezhe Ma, Chunting Zhou, Xi-An Li, Graham Neubig, Eduard Hovy

Most sequence-to-sequence (seq2seq) models are autoregressive; they generate each token by conditioning on previously generated tokens.

Ranked #3 on Machine Translation on WMT2016 English-Romanian

244

Paper
Code

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

1 code implementation • 11 Sep 2019 • Junjie Hu, Yu Cheng, Zhe Gan, Jingjing Liu, Jianfeng Gao, Graham Neubig

Previous storytelling approaches mostly focused on optimizing traditional metrics such as BLEU, ROUGE and CIDEr.

Ranked #10 on Visual Storytelling on VIST

Visual Storytelling

Paper
Code

Beyond BLEU: Training Neural Machine Translation with Semantic Similarity

1 code implementation • 14 Sep 2019 • John Wieting, Taylor Berg-Kirkpatrick, Kevin Gimpel, Graham Neubig

While most neural machine translation (NMT) systems are still trained using maximum likelihood estimation, recent work has demonstrated that optimizing systems to directly improve evaluation metrics such as BLEU can substantially improve final translation accuracy.

Cross-Lingual Entity Linking Entity Linking

Paper
Code

Learning to Deceive with Attention-Based Explanations

3 code implementations • ACL 2020 • Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

Attention mechanisms are ubiquitous components in neural architectures applied to natural language processing.

Fairness

Paper
Code

Regularizing Trajectories to Mitigate Catastrophic Forgetting

no code implementations • 25 Sep 2019 • Paul Michel, Elisabeth Salesky, Graham Neubig

Regularization-based continual learning approaches generally prevent catastrophic forgetting by augmenting the training loss with an auxiliary objective.

Continual Learning

Paper
Add Code

Towards Zero-resource Cross-lingual Entity Linking

1 code implementation • WS 2019 • Shuyan Zhou, Shruti Rijhwani, Graham Neubig

Cross-lingual entity linking (XEL) grounds named entities in a source language to an English Knowledge Base (KB), such as Wikipedia.

Paper
Code

Simple and Effective Paraphrastic Similarity from Parallel Translations

4 code implementations • ACL 2019 • John Wieting, Kevin Gimpel, Graham Neubig, Taylor Berg-Kirkpatrick

We present a model and methodology for learning paraphrastic sentence embeddings directly from bitext, removing the time-consuming intermediate step of creating paraphrase corpora.

Sentence Sentence Embeddings

Paper
Code

Domain Differential Adaptation for Neural Machine Translation

1 code implementation • WS 2019 • Zi-Yi Dou, Xinyi Wang, Junjie Hu, Graham Neubig

We then use these learned domain differentials to adapt models for the target task accordingly.

Domain Adaptation Machine Translation +1

Paper
Code

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

2 code implementations • ICLR 2020 • Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, Jaime Carbonell

Learning multilingual representations of text has proven a successful method for many cross-lingual transfer learning tasks.

Bilingual Lexicon Induction Cross-Lingual NER +2

Paper
Code

Findings of the Third Workshop on Neural Generation and Translation

no code implementations • WS 2019 • Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019).

Knowledge Distillation Machine Translation +1

Paper
Add Code

Comparing Top-Down and Bottom-Up Neural Generative Dependency Models

no code implementations • CONLL 2019 • Austin Matthews, Graham Neubig, Chris Dyer

Recurrent neural network grammars generate sentences using phrase-structure syntax and perform very well on both parsing and language modeling.

Language Modelling

Paper
Add Code

Understanding Knowledge Distillation in Non-autoregressive Machine Translation

no code implementations • ICLR 2020 • Chunting Zhou, Graham Neubig, Jiatao Gu

We find that knowledge distillation can reduce the complexity of data sets and help NAT to model the variations in the output data.

Paper
Add Code

Should All Cross-Lingual Embeddings Speak English?

1 code implementation • ACL 2020 • Antonios Anastasopoulos, Graham Neubig

Most of recent work in cross-lingual word embeddings is severely Anglocentric.

Cross-Lingual Word Embeddings Word Embeddings

Paper
Code

A Bilingual Generative Transformer for Semantic Sentence Embedding

2 code implementations • EMNLP 2020 • John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Semantic sentence embedding models encode natural language sentences into vectors, such that closeness in embedding space indicates closeness in the semantics between the sentences.

Paper
Code

Generalizing Natural Language Analysis through Span-relation Representations

3 code implementations • ACL 2020 • Zhengbao Jiang, Wei Xu, Jun Araki, Graham Neubig

Natural language processing covers a wide variety of tasks predicting syntax, semantics, and information content, and usually each type of output is generated with specially designed architectures.

Ranked #1 on Relation Extraction on WLPC

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +8

Paper
Code

Optimizing Data Usage via Differentiable Rewards

1 code implementation • ICML 2020 • Xinyi Wang, Hieu Pham, Paul Michel, Antonios Anastasopoulos, Jaime Carbonell, Graham Neubig

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems.

Image Classification Machine Translation

Paper
Code

How Can We Know What Language Models Know?

1 code implementation • TACL 2020 • Zhengbao Jiang, Frank F. Xu, Jun Araki, Graham Neubig

Recent work has presented intriguing results examining the knowledge contained in language models (LM) by having the LM fill in the blanks of prompts such as "Obama is a _ by profession".

155

Paper
Code

Merging Weak and Active Supervision for Semantic Parsing

1 code implementation • 29 Nov 2019 • Ansong Ni, Pengcheng Yin, Graham Neubig

Experiments on WikiTableQuestions with human annotators show that our method can improve the performance with only 100 active queries, especially for weakly-supervised parsers learnt from a cold start.

Active Learning Semantic Parsing

Paper
Code

A Probabilistic Formulation of Unsupervised Text Style Transfer

5 code implementations • ICLR 2020 • Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Across all style transfer tasks, our approach yields substantial gains over state-of-the-art non-generative baselines, including the state-of-the-art unsupervised machine translation techniques that our approach generalizes.

Decipherment Language Modelling +6

222

Paper
Code

Learning Relation Entailment with Structured and Textual Information

1 code implementation • AKBC 2020 • Zhengbao Jiang, Jun Araki, Donghan Yu, Ruohong Zhang, Wei Xu, Yiming Yang, Graham Neubig

We propose several methods that incorporate both structured and textual information to represent relations for this task.

Question Answering Relation +2

Paper
Code

Differentiable Reasoning over a Virtual Knowledge Base

1 code implementation • ICLR 2020 • Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen

In particular, we describe a neural module, DrKIT, that traverses textual data like a KB, softly following paths of relations between mentions of entities in the corpus.

Re-Ranking

1,563

Paper
Code

Universal Phone Recognition with a Multilingual Allophone System

1 code implementation • 26 Feb 2020 • Xinjian Li, Siddharth Dalmia, Juncheng Li, Matthew Lee, Patrick Littell, Jiali Yao, Antonios Anastasopoulos, David R. Mortensen, Graham Neubig, Alan W. black, Florian Metze

Multilingual models can improve language processing, particularly for low resource situations, by sharing parameters across languages.

speech-recognition Speech Recognition

507

Paper
Code

Improving Candidate Generation for Low-resource Cross-lingual Entity Linking

1 code implementation • TACL 2020 • Shuyan Zhou, Shruti Rijhawani, John Wieting, Jaime Carbonell, Graham Neubig

Cross-lingual entity linking (XEL) is the task of finding referents in a target-language knowledge base (KB) for mentions extracted from source-language texts.

Cross-Lingual Entity Linking Entity Linking +1

Paper
Code

XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization

4 code implementations • 24 Mar 2020 • Junjie Hu, Sebastian Ruder, Aditya Siddhant, Graham Neubig, Orhan Firat, Melvin Johnson

However, these broad-coverage benchmarks have been mostly limited to English, and despite an increasing interest in multilingual models, a benchmark that enables the comprehensive evaluation of such methods on a diverse range of languages and tasks is still missing.

Cross-Lingual Transfer Retrieval +1

619

Paper
Code

A Set of Recommendations for Assessing Human-Machine Parity in Language Translation

1 code implementation • 3 Apr 2020 • Samuel Läubli, Sheila Castilho, Graham Neubig, Rico Sennrich, Qinlan Shen, Antonio Toral

The quality of machine translation has increased remarkably over the past years, to the degree that it was found to be indistinguishable from professional human translation in a number of empirical investigations.

Domain Adaptation Machine Translation +3

Paper
Code

Dynamic Data Selection and Weighting for Iterative Back-Translation

1 code implementation • EMNLP 2020 • Zi-Yi Dou, Antonios Anastasopoulos, Graham Neubig

Back-translation has proven to be an effective method to utilize monolingual data in neural machine translation (NMT), and iteratively conducting back-translation can further improve the model performance.

Paper
Code

Weight Poisoning Attacks on Pre-trained Models

2 code implementations • 14 Apr 2020 • Keita Kurita, Paul Michel, Graham Neubig

We show that by applying a regularization method, which we call RIPPLe, and an initialization procedure, which we call Embedding Surgery, such attacks are possible even with limited knowledge of the dataset and fine-tuning procedure.

Sentiment Analysis Sentiment Classification +1

135

Paper
Code

Balancing Training for Multilingual Neural Machine Translation

2 code implementations • ACL 2020 • Xinyi Wang, Yulia Tsvetkov, Graham Neubig

When training multilingual machine translation (MT) models that can translate to/from multiple languages, we are faced with imbalanced training sets: some languages have much more training data than others.

speech-recognition Speech Recognition

Paper
Code

AlloVera: A Multilingual Allophone Database

no code implementations • LREC 2020 • David R. Mortensen, Xinjian Li, Patrick Littell, Alexis Michaud, Shruti Rijhwani, Antonios Anastasopoulos, Alan W. black, Florian Metze, Graham Neubig

While phonemic representations are language specific, phonetic representations (stated in terms of (allo)phones) are much closer to a universal (language-independent) transcription.

Paper
Add Code

Incorporating External Knowledge through Pre-training for Natural Language to Code Generation

2 code implementations • ACL 2020 • Frank F. Xu, Zhengbao Jiang, Pengcheng Yin, Bogdan Vasilescu, Graham Neubig

Open-domain code generation aims to generate code in a general-purpose programming language (such as Python) from natural language (NL) intents.

Ranked #3 on Code Generation on CoNaLa-Ext

Code Generation Data Augmentation +1

Paper
Code

Practical Comparable Data Collection for Low-Resource Languages via Images

1 code implementation • 24 Apr 2020 • Aman Madaan, Shruti Rijhwani, Antonios Anastasopoulos, Yiming Yang, Graham Neubig

We propose a method of curating high-quality comparable training data for low-resource languages with monolingual annotators.

Sentence Style Transfer +1

Paper
Code

A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization

no code implementations • LREC 2020 • Graham Neubig, Shruti Rijhwani, Alexis Palmer, Jordan MacKenzie, Hilaria Cruz, Xinjian Li, Matthew Lee, Aditi Chaudhary, Luke Gessler, Steven Abney, Shirley Anugrah Hayati, Antonios Anastasopoulos, Olga Zamaraeva, Emily Prud'hommeaux, Jennette Child, Sara Child, Rebecca Knowles, Sarah Moeller, Jeffrey Micher, Yiyuan Li, Sydney Zink, Mengzhou Xia, Roshan S Sharma, Patrick Littell

Despite recent advances in natural language processing and other language technology, the application of such technology to language documentation and conservation has been limited.

Paper
Add Code

Politeness Transfer: A Tag and Generate Approach

2 code implementations • ACL 2020 • Aman Madaan, Amrith Setlur, Tanmay Parekh, Barnabas Poczos, Graham Neubig, Yiming Yang, Ruslan Salakhutdinov, Alan W. black, Shrimai Prabhumoye

This paper introduces a new task of politeness transfer which involves converting non-polite sentences to polite sentences while preserving the meaning.

64,121

Paper
Code

A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos

1 code implementation • EMNLP (nlpbt) 2020 • Frank F. Xu, Lei Ji, Botian Shi, Junyi Du, Graham Neubig, Yonatan Bisk, Nan Duan

Watching instructional videos are often used to learn about procedures.

Action Detection Semantic Role Labeling +2

Paper
Code

Predicting Performance for Natural Language Processing Tasks

1 code implementation • ACL 2020 • Mengzhou Xia, Antonios Anastasopoulos, Ruochen Xu, Yiming Yang, Graham Neubig

Given the complexity of combinations of tasks, languages, and domains in natural language processing (NLP) research, it is computationally prohibitive to exhaustively test newly proposed models on each possible experimental setting.

Paper
Code

Soft Gazetteers for Low-Resource Named Entity Recognition

1 code implementation • ACL 2020 • Shruti Rijhwani, Shuyan Zhou, Graham Neubig, Jaime Carbonell

However, designing such features for low-resource languages is challenging, because exhaustive entity gazetteers do not exist in these languages.

Cross-Lingual Entity Linking Entity Linking +4

Paper
Code

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data

1 code implementation • ACL 2020 • Pengcheng Yin, Graham Neubig, Wen-tau Yih, Sebastian Riedel

Recent years have witnessed the burgeoning of pretrained language models (LMs) for text-based natural language (NL) understanding tasks.

Ranked #10 on Text-To-SQL on spider (Exact Match Accuracy (Dev) metric)

Semantic Parsing Text-To-SQL

578

Paper
Code

Learning Sparse Prototypes for Text Generation

1 code implementation • NeurIPS 2020 • Junxian He, Taylor Berg-Kirkpatrick, Graham Neubig

While effective, these methods are inefficient at test time as a result of needing to store and index the entire training corpus.

Language Modelling Prototype Selection +4

Paper
Code

Findings of the Fourth Workshop on Neural Generation and Translation

no code implementations • WS 2020 • Kenneth Heafield, Hiroaki Hayashi, Yusuke Oda, Ioannis Konstas, Andrew Finch, Graham Neubig, Xi-An Li, Alex Birch, ra

We describe the finding of the Fourth Workshop on Neural Generation and Translation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2020).

Sentiment Analysis Sentiment Classification +1

Paper
Add Code

Weight Poisoning Attacks on Pretrained Models

no code implementations • ACL 2020 • Keita Kurita, Paul Michel, Graham Neubig

Recently, NLP has seen a surge in the usage of large pre-trained models.

Paper
Add Code

Transliteration for Cross-Lingual Morphological Inflection

no code implementations • WS 2020 • Nikitha Murikinati, Antonios Anastasopoulos, Graham Neubig

Cross-lingual transfer between typologically related languages has been proven successful for the task of morphological inflection.

Cross-Lingual Transfer Morphological Inflection +1

Paper
Add Code

TICO-19: the Translation Initiative for Covid-19

no code implementations • EMNLP (NLP-COVID19) 2020 • Antonios Anastasopoulos, Alessandro Cattelan, Zi-Yi Dou, Marcello Federico, Christian Federman, Dmitriy Genzel, Francisco Guzmán, Junjie Hu, Macduff Hughes, Philipp Koehn, Rosie Lazar, Will Lewis, Graham Neubig, Mengmeng Niu, Alp Öktem, Eric Paquin, Grace Tang, Sylwia Tur

Further, the team is converting the test and development data into translation memories (TMXs) that can be used by localizers from and to any of the languages.

Cross-Lingual Transfer Descriptive

Paper
Add Code

The Return of Lexical Dependencies: Neural Lexicalized PCFGs

3 code implementations • 29 Jul 2020 • Hao Zhu, Yonatan Bisk, Graham Neubig

In this paper we demonstrate that $\textit{context free grammar (CFG) based methods for grammar induction benefit from modeling lexical dependencies}$.

Paper
Code

Automatic Extraction of Rules Governing Morphological Agreement

1 code implementation • EMNLP 2020 • Aditi Chaudhary, Antonios Anastasopoulos, Adithya Pratapa, David R. Mortensen, Zaid Sheikh, Yulia Tsvetkov, Graham Neubig

Using cross-lingual transfer, even with no expert annotations in the language of interest, our framework extracts a grammatical specification which is nearly equivalent to those created with large amounts of gold-standard annotated data.

Paper
Code

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Luyu Gao, Xinyi Wang, Graham Neubig

To improve the performance of Neural Machine Translation~(NMT) for low-resource languages~(LRL), one effective strategy is to leverage parallel data from a related high-resource language~(HRL).