Search Results for author: Colin Cherry

Found 57 papers, 6 papers with code

Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010

no code implementations • JAMIA 2011 • Berry de Bruijn, Colin Cherry, Svetlana Kiritchenko, Joel Martin, Xiaodan Zhu

Objective: As clinical text mining continues to mature, its potential as an enabling technology for innovations in patient care and clinical research is becoming a reality.

Ranked #5 on Clinical Concept Extraction on 2010 i2b2/VA

BIG-bench Machine Learning Clinical Concept Extraction +1

Paper
Add Code

Batch Tuning Strategies for Statistical Machine Translation

no code implementations • HLT 2012 • Colin Cherry, George Foster

Machine Translation Sentence +1

Paper
Add Code

MSR SPLAT, a language analysis toolkit

no code implementations • NAACL 2012 • Chris Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova, Michael Gamon, Wen-tau Yih, Colin Cherry, V, Lucy erwende

Constituency Parsing

Paper
Add Code

On Hierarchical Re-ordering and Permutation Parsing for Phrase-based Decoding

no code implementations • WS 2012 • Colin Cherry, Robert C. Moore, Chris Quirk

Machine Translation

Paper
Add Code

Paraphrasing for Style

no code implementations • COLING 2012 • Wei Xu, Alan Ritter, Bill Dolan, Ralph Grishman, Colin Cherry

Paper
Add Code

Improved Reordering for Phrase-Based Translation using Sparse Features

no code implementations • NAACL 2013 • Colin Cherry

Machine Translation Translation

Paper
Add Code

Reversing Morphological Tokenization in English-to-Arabic SMT

no code implementations • NAACL 2013 • Mohammad Salameh, Colin Cherry, Grzegorz Kondrak

Machine Translation Word Alignment

Paper
Add Code

Regularized Minimum Error Rate Training

no code implementations • EMNLP 2013 • Michel Galley, Chris Quirk, Colin Cherry, Kristina Toutanova

Machine Translation

Paper
Add Code

Lattice Desegmentation for Statistical Machine Translation

no code implementations • ACL 2014 • Mohammad Salameh, Colin Cherry, Grzegorz Kondrak

Language Modelling Machine Translation +2

Paper
Add Code

A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU

no code implementations • WS 2014 • Boxing Chen, Colin Cherry

Machine Translation Sentence

Paper
Add Code

NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews

no code implementations • SEMEVAL 2014 • Svetlana Kiritchenko, Xiaodan Zhu, Colin Cherry, Saif Mohammad

Aspect-Based Sentiment Analysis (ABSA)

Paper
Add Code

Inflection Generation as Discriminative String Transduction

no code implementations • HLT 2015 • Grzegorz Kondrak, Colin Cherry, Garrett Nicolai

LEMMA Machine Translation +3

Paper
Add Code

The Unreasonable Effectiveness of Word Representations for Twitter Named Entity Recognition

no code implementations • HLT 2015 • Colin Cherry, Hongyu Guo

Domain Adaptation Entity Linking +4

Paper
Add Code

Morpho-syntactic Regularities in Continuous Word Representations: A multilingual study.

no code implementations • WS 2015 • Garrett Nicolai, Colin Cherry, Grzegorz Kondrak

Language Modelling

Paper
Add Code

What Matters Most in Morphologically Segmented SMT Models?

no code implementations • WS 2015 • Mohammad Salameh, Colin Cherry, Grzegorz Kondrak

Transliteration

Paper
Add Code

NRC: Infused Phrase Vectors for Named Entity Recognition in Twitter

no code implementations • WS 2015 • Colin Cherry, Hongyu Guo, Chengbi Dai

named-entity-recognition Named Entity Recognition +2

Paper
Add Code

A Dataset for Detecting Stance in Tweets

no code implementations • LREC 2016 • Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu, Colin Cherry

Apart from stance, the tweets are also annotated for whether the target of interest is the target of opinion in the tweet.

Paper
Add Code

Integrating Morphological Desegmentation into Phrase-based Decoding

no code implementations • NAACL 2016 • Mohammad Salameh, Colin Cherry, Grzegorz Kondrak

Language Modelling

Paper
Add Code

An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation

no code implementations • NAACL 2016 • Colin Cherry

Domain Adaptation Language Modelling +2

Paper
Add Code

SemEval-2016 Task 6: Detecting Stance in Tweets

no code implementations • SEMEVAL 2016 • Saif Mohammad, Svetlana Kiritchenko, Parinaz Sobhani, Xiaodan Zhu, Colin Cherry

Information Retrieval Natural Language Inference +2

Paper
Add Code

NRC Russian-English Machine Translation System for WMT 2016

no code implementations • WS 2016 • Chi-kiu Lo, Colin Cherry, George Foster, Darlene Stewart, Rabib Islam, Anna Kazantseva, Rol Kuhn,

Lemmatization Translation +2

Paper
Add Code

End-to-End Multi-View Networks for Text Classification

no code implementations • 19 Apr 2017 • Hongyu Guo, Colin Cherry, Jiang Su

For a bag-of-words representation, each view focuses on a different subset of the text's words.

General Classification text-classification +1

Paper
Add Code

A Challenge Set Approach to Evaluating Machine Translation

no code implementations • EMNLP 2017 • Pierre Isabelle, Colin Cherry, George Foster

We address these questions with a challenge set approach to translation evaluation and error analysis.

Machine Translation Translation

Paper
Add Code

Cost Weighting for Neural Machine Translation Domain Adaptation

no code implementations • WS 2017 • Boxing Chen, Colin Cherry, George Foster, Samuel Larkin

We compare cost weighting to two traditional domain adaptation techniques developed for statistical machine translation: data selection and sub-corpus weighting.

Domain Adaptation Machine Translation +1

Paper
Add Code

NRC Machine Translation System for WMT 2017

no code implementations • WS 2017 • Chi-kiu Lo, Boxing Chen, Colin Cherry, George Foster, Samuel Larkin, Darlene Stewart, Rol Kuhn,

Domain Adaptation Machine Translation +1

Paper
Add Code

Revisiting Character-Based Neural Machine Translation with Capacity and Compression

no code implementations • EMNLP 2018 • Colin Cherry, George Foster, Ankur Bapna, Orhan Firat, Wolfgang Macherey

Translating characters instead of words or word-fragments has the potential to simplify the processing pipeline for neural machine translation (NMT), and improve results by eliminating hyper-parameters and manual feature engineering.

Feature Engineering Machine Translation +2

Paper
Add Code

Efficient Sequence Labeling with Actor-Critic Training

1 code implementation • 30 Sep 2018 • Saeed Najafi, Colin Cherry, Grzegorz Kondrak

We set out to establish RNNs as an attractive alternative to CRFs for sequence labeling.

Decision Making NER +1

Paper
Code

Shaping the Narrative Arc: An Information-Theoretic Approach to Collaborative Dialogue

no code implementations • 31 Jan 2019 • Kory W. Mathewson, Pablo Samuel Castro, Colin Cherry, George Foster, Marc G. Bellemare

We consider the problem of designing an artificial agent capable of interacting with humans in collaborative dialogue to produce creative, engaging narratives.

Specificity

Paper
Add Code

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

2 code implementations • 21 Feb 2019 • Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel A. U. Bacchiani, Thomas B. Jablin, Rob Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon

Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models.

Sequence-To-Sequence Speech Recognition

2,776

Paper
Code

Reinforcement Learning based Curriculum Optimization for Neural Machine Translation

no code implementations • NAACL 2019 • Gaurav Kumar, George Foster, Colin Cherry, Maxim Krikun

We consider the problem of making efficient use of heterogeneous training data in neural machine translation (NMT).

Machine Translation NMT +4

Paper
Add Code

Thinking Slow about Latency Evaluation for Simultaneous Machine Translation

no code implementations • 31 May 2019 • Colin Cherry, George Foster

Simultaneous machine translation attempts to translate a source sentence before it is finished being spoken, with applications to translation of spoken language for live streaming and conversation.

Machine Translation Sentence +1

Paper
Add Code

Monotonic Infinite Lookback Attention for Simultaneous Machine Translation

no code implementations • ACL 2019 • Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, Chung-Cheng Chiu, Semih Yavuz, Ruoming Pang, Wei Li, Colin Raffel

Simultaneous machine translation begins to translate each source sentence before the source speaker is finished speaking, with applications to live and streaming scenarios.

Machine Translation NMT +2

Paper
Add Code

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges

no code implementations • 11 Jul 2019 • Naveen Arivazhagan, Ankur Bapna, Orhan Firat, Dmitry Lepikhin, Melvin Johnson, Maxim Krikun, Mia Xu Chen, Yuan Cao, George Foster, Colin Cherry, Wolfgang Macherey, Zhifeng Chen, Yonghui Wu

We introduce our efforts towards building a universal neural machine translation (NMT) system capable of translating between any language pair.

Machine Translation NMT +2

Paper
Add Code

Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation

1 code implementation • 6 Dec 2019 • Naveen Arivazhagan, Colin Cherry, Te I, Wolfgang Macherey, Pallavi Baljekar, George Foster

As this scenario allows for revisions to our incremental translations, we adopt a re-translation approach to simultaneous translation, where the source is repeatedly translated from scratch as it grows.

Machine Translation speech-recognition +2

2,826

Paper
Code

Re-translation versus Streaming for Simultaneous Translation

no code implementations • WS 2020 • Naveen Arivazhagan, Colin Cherry, Wolfgang Macherey, George Foster

There has been great progress in improving streaming machine translation, a simultaneous paradigm where the system appends to a growing hypothesis as more source content becomes available.

Attribute Data Augmentation +2

Paper
Add Code

Inference Strategies for Machine Translation with Conditional Masking

no code implementations • EMNLP 2020 • Julia Kreutzer, George Foster, Colin Cherry

Conditional masked language model (CMLM) training has proven successful for non-autoregressive and semi-autoregressive sequence generation tasks, such as machine translation.

Language Modelling Machine Translation +1

Paper
Add Code

Human-Paraphrased References Improve Neural Machine Translation

1 code implementation • WMT (EMNLP) 2020 • Markus Freitag, George Foster, David Grangier, Colin Cherry

When used in place of original references, the paraphrased versions produce metric scores that correlate better with human judgment.

Machine Translation NMT +1

Paper
Code

Sentence Boundary Augmentation For Neural Machine Translation Robustness

no code implementations • 21 Oct 2020 • Daniel Li, Te I, Naveen Arivazhagan, Colin Cherry, Dirk Padfield

Specifically, in the context of long-form speech translation systems, where the input transcripts come from Automatic Speech Recognition (ASR), the NMT models have to handle errors including phoneme substitutions, grammatical structure, and sentence boundaries, all of which pose challenges to NMT robustness.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +7

Paper
Add Code

Simultaneous Translation

no code implementations • EMNLP 2020 • Liang Huang, Colin Cherry, Mingbo Ma, Naveen Arivazhagan, Zhongjun He

Simultaneous translation, which performs translation concurrently with the source speech, is widely useful in many scenarios such as international conferences, negotiations, press releases, legal proceedings, and medicine.

Machine Translation speech-recognition +3

Paper
Add Code

Assessing Reference-Free Peer Evaluation for Machine Translation

no code implementations • NAACL 2021 • Sweta Agrawal, George Foster, Markus Freitag, Colin Cherry

Reference-free evaluation has the potential to make machine translation evaluation substantially more scalable, allowing us to pivot easily to new languages or domains.

Machine Translation Translation

Paper
Add Code

Scaling Laws for Neural Machine Translation

no code implementations • ICLR 2022 • Behrooz Ghorbani, Orhan Firat, Markus Freitag, Ankur Bapna, Maxim Krikun, Xavier Garcia, Ciprian Chelba, Colin Cherry

We show that cross-entropy loss as a function of model size follows a certain scaling law.

Decoder Machine Translation +2

Paper
Add Code

Can Multilinguality benefit Non-autoregressive Machine Translation?

no code implementations • 16 Dec 2021 • Sweta Agrawal, Julia Kreutzer, Colin Cherry

Non-autoregressive (NAR) machine translation has recently achieved significant improvements, and now outperforms autoregressive (AR) models on some benchmarks, providing an efficient alternative to AR inference.

Machine Translation Translation

Paper
Add Code

mSLAM: Massively multilingual joint pre-training for speech and text

no code implementations • 3 Feb 2022 • Ankur Bapna, Colin Cherry, Yu Zhang, Ye Jia, Melvin Johnson, Yong Cheng, Simran Khanuja, Jason Riesa, Alexis Conneau

We present mSLAM, a multilingual Speech and LAnguage Model that learns cross-lingual cross-modal representations of speech and text by pre-training jointly on large amounts of unlabeled speech and text in multiple languages.

intent-classification Intent Classification +4

Paper
Add Code

Data Scaling Laws in NMT: The Effect of Noise and Architecture

no code implementations • 4 Feb 2022 • Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Maxim Krikun, Colin Cherry, Behnam Neyshabur, Orhan Firat

In this work, we study the effect of varying the architecture and training data quality on the data scaling properties of Neural Machine Translation (NMT).

Decoder Language Modelling +2

Paper
Add Code

XTREME-S: Evaluating Cross-lingual Speech Representations

no code implementations • 21 Mar 2022 • Alexis Conneau, Ankur Bapna, Yu Zhang, Min Ma, Patrick von Platen, Anton Lozhkov, Colin Cherry, Ye Jia, Clara Rivera, Mihir Kale, Daan van Esch, Vera Axelrod, Simran Khanuja, Jonathan H. Clark, Orhan Firat, Michael Auli, Sebastian Ruder, Jason Riesa, Melvin Johnson

Covering 102 languages from 10+ language families, 3 different domains and 4 task families, XTREME-S aims to simplify multilingual speech representation evaluation, as well as catalyze research in "universal" speech representation learning.

Representation Learning Retrieval +4

Paper
Add Code

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation

no code implementations • 24 Mar 2022 • Ye Jia, Yifan Ding, Ankur Bapna, Colin Cherry, Yu Zhang, Alexis Conneau, Nobuyuki Morioka

End-to-end speech-to-speech translation (S2ST) without relying on intermediate text representations is a rapidly emerging frontier of research.

Representation Learning Speech-to-Speech Translation +1

Paper
Add Code

Prompting PaLM for Translation: Assessing Strategies and Performance

no code implementations • 16 Nov 2022 • David Vilar, Markus Freitag, Colin Cherry, Jiaming Luo, Viresh Ratnakar, George Foster

Large language models (LLMs) that have been trained on multilingual but not parallel text exhibit a remarkable ability to translate between languages.

Language Modelling Machine Translation +1

Paper
Add Code

The unreasonable effectiveness of few-shot learning for machine translation

no code implementations • 2 Feb 2023 • Xavier Garcia, Yamini Bansal, Colin Cherry, George Foster, Maxim Krikun, Fangxiaoyu Feng, Melvin Johnson, Orhan Firat

We demonstrate the potential of few-shot translation systems, trained with unpaired language data, for both high and low-resource language pairs.

Decoder Few-Shot Learning +3

Paper
Add Code

PaLM 2 Technical Report

1 code implementation • 17 May 2023 • Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego, Junwhan Ahn, Jacob Austin, Paul Barham, Jan Botha, James Bradbury, Siddhartha Brahma, Kevin Brooks, Michele Catasta, Yong Cheng, Colin Cherry, Christopher A. Choquette-Choo, Aakanksha Chowdhery, Clément Crepy, Shachi Dave, Mostafa Dehghani, Sunipa Dev, Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vlad Feinberg, Fangxiaoyu Feng, Vlad Fienber, Markus Freitag, Xavier Garcia, Sebastian Gehrmann, Lucas Gonzalez, Guy Gur-Ari, Steven Hand, Hadi Hashemi, Le Hou, Joshua Howland, Andrea Hu, Jeffrey Hui, Jeremy Hurwitz, Michael Isard, Abe Ittycheriah, Matthew Jagielski, Wenhao Jia, Kathleen Kenealy, Maxim Krikun, Sneha Kudugunta, Chang Lan, Katherine Lee, Benjamin Lee, Eric Li, Music Li, Wei Li, Yaguang Li, Jian Li, Hyeontaek Lim, Hanzhao Lin, Zhongtao Liu, Frederick Liu, Marcello Maggioni, Aroma Mahendru, Joshua Maynez, Vedant Misra, Maysam Moussalem, Zachary Nado, John Nham, Eric Ni, Andrew Nystrom, Alicia Parrish, Marie Pellat, Martin Polacek, Alex Polozov, Reiner Pope, Siyuan Qiao, Emily Reif, Bryan Richter, Parker Riley, Alex Castro Ros, Aurko Roy, Brennan Saeta, Rajkumar Samuel, Renee Shelby, Ambrose Slone, Daniel Smilkov, David R. So, Daniel Sohn, Simon Tokumine, Dasha Valter, Vijay Vasudevan, Kiran Vodrahalli, Xuezhi Wang, Pidong Wang, ZiRui Wang, Tao Wang, John Wieting, Yuhuai Wu, Kelvin Xu, Yunhan Xu, Linting Xue, Pengcheng Yin, Jiahui Yu, Qiao Zhang, Steven Zheng, Ce Zheng, Weikang Zhou, Denny Zhou, Slav Petrov, Yonghui Wu

Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on downstream tasks across different model sizes, while simultaneously exhibiting faster and more efficient inference compared to PaLM.

Ranked #1 on Question Answering on StrategyQA

Code Generation Common Sense Reasoning +6

Paper
Code

Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM's Translation Capability

no code implementations • 17 May 2023 • Eleftheria Briakou, Colin Cherry, George Foster

We investigate the role of incidental bilingualism -- the unintentional consumption of bilingual signals, including translation examples -- in explaining the translation capabilities of large language models, taking the Pathways Language Model (PaLM) as a case study.

Language Modelling Machine Translation +1

Paper
Add Code

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

1 code implementation • 19 May 2023 • Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar

We evaluate commonly used models on the benchmark.

In-Context Learning Multilingual NLP +3

Paper
Code

Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model

no code implementations • 10 Oct 2023 • Christian Tomani, David Vilar, Markus Freitag, Colin Cherry, Subhajit Naskar, Mara Finkelstein, Xavier Garcia, Daniel Cremers

Maximum-a-posteriori (MAP) decoding is the most widely used decoding strategy for neural machine translation (NMT) models.

Machine Translation NMT +1

Paper
Add Code

To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation

no code implementations • 2 Jan 2024 • Jiaming Luo, Colin Cherry, George Foster

We conduct a large-scale fine-grained comparative analysis of machine translations (MT) against human translations (HT) through the lens of morphosyntactic divergence.

Attribute Machine Translation +1

Paper
Add Code

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

no code implementations • 27 Feb 2024 • Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat

While large language models (LLMs) often adopt finetuning to unlock their capabilities for downstream applications, our understanding on the inductive biases (especially the scaling properties) of different finetuning methods is still limited.

Machine Translation

Paper
Add Code

Inverted Projection for Robust Speech Translation

no code implementations • ACL (IWSLT) 2021 • Dirk Padfield, Colin Cherry

Traditional translation systems trained on written documents perform well for text-based translation but not as well for speech-based applications.

Translation

Paper
Add Code

A Natural Diet: Towards Improving Naturalness of Machine Translation Output

no code implementations • Findings (ACL) 2022 • Markus Freitag, David Vilar, David Grangier, Colin Cherry, George Foster

In this work we propose a method for training MT systems to achieve a more natural style, i. e. mirroring the style of text originally written in the target language.

Machine Translation Sentence +1

Paper
Add Code

Bilingual Methods for Adaptive Training Data Selection for Machine Translation

no code implementations • AMTA 2016 • Boxing Chen, Roland Kuhn, George Foster, Colin Cherry, Fei Huang

In this paper, we propose a new data selection method which uses semi-supervised convolutional neural networks based on bitokens (Bi-SSCNNs) for training machine translation systems from a large bilingual corpus.

Machine Translation NMT +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.