Search Results for author: Phillippe Langlais

Found 17 papers, 1 papers with code

About Evaluating Bilingual Lexicon Induction

no code implementations • LREC (BUCC) 2022 • Martin Laville, Emmanuel Morin, Phillippe Langlais

With numerous new methods proposed recently, the evaluation of Bilingual Lexicon Induction have been quite hazardous and inconsistent across works.

Bilingual Lexicon Induction

Paper
Add Code

Data Selection for Bilingual Lexicon Induction from Specialized Comparable Corpora

no code implementations • COLING 2020 • Martin Laville, Amir Hazem, Emmanuel Morin, Phillippe Langlais

In this paper, we contrast several data selection techniques to improve bilingual lexicon induction from specialized comparable corpora.

Bilingual Lexicon Induction Data Augmentation +1

Paper
Add Code

Human or Neural Translation?

no code implementations • COLING 2020 • Shivendra Bhardwaj, David Alfonso Hermelo, Phillippe Langlais, Gabriel Bernier-Colborne, Cyril Goutte, Michel Simard

Deep neural models tremendously improved machine translation.

Machine Translation Translation

Paper
Add Code

SEDAR: a Large Scale French-English Financial Domain Parallel Corpus

1 code implementation • LREC 2020 • Abbas Ghaddar, Phillippe Langlais

This paper describes the acquisition, preprocessing and characteristics of SEDAR, a large scale English-French parallel corpus for the financial domain.

Domain Adaptation Machine Translation +2

Paper
Code

HardEval: Focusing on Challenging Tokens to Assess Robustness of NER

no code implementations • LREC 2020 • Gabriel Bernier-Colborne, Phillippe Langlais

To assess the robustness of NER systems, we propose an evaluation method that focuses on subsets of tokens that represent specific sources of errors: unknown words and label shift or ambiguity.

NER

Paper
Add Code

Contextualized Word Representations from Distant Supervision with and for NER

no code implementations • WS 2019 • Abbas Ghaddar, Phillippe Langlais

We describe a special type of deep contextualized word representation that is learned from distant supervision annotations and dedicated to named entity recognition.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

WiNER: A Wikipedia Annotated Corpus for Named Entity Recognition

no code implementations • IJCNLP 2017 • Abbas Ghaddar, Phillippe Langlais

We revisit the idea of mining Wikipedia in order to generate named-entity annotations.

named-entity-recognition Named Entity Recognition +1

Paper
Add Code

Users and Data: The Two Neglected Children of Bilingual Natural Language Processing Research

no code implementations • WS 2017 • Phillippe Langlais

Despite numerous studies devoted to mining parallel material from bilingual data, we have yet to see the resulting technologies wholeheartedly adopted by professional translators and terminologists alike.

Machine Translation

Paper
Add Code

Reranking Translation Candidates Produced by Several Bilingual Word Similarity Sources

no code implementations • EACL 2017 • Laurent Jakubina, Phillippe Langlais

We investigate the reranking of the output of several distributional approaches on the Bilingual Lexicon Induction task.

Bilingual Lexicon Induction Translation +2

Paper
Add Code

Coreference in Wikipedia: Main Concept Resolution

no code implementations • CONLL 2016 • Abbas Ghaddar, Phillippe Langlais

Coreference Resolution Open Information Extraction

Paper
Add Code

BAD LUC@WMT 2016: a Bilingual Document Alignment Platform Based on Lucene

no code implementations • WS 2016 • Laurent Jakubina, Phillippe Langlais

Information Retrieval Machine Translation

Paper
Add Code

WikiCoref: An English Coreference-annotated Corpus of Wikipedia Articles

no code implementations • LREC 2016 • Abbas Ghaddar, Phillippe Langlais

This paper presents WikiCoref, an English corpus annotated for anaphoric relations, where all documents are from the English version of Wikipedia.

coreference-resolution

Paper
Add Code

Projective methods for mining missing translations in DBpedia

no code implementations • WS 2015 • Laurent Jakubina, Phillippe Langlais

Paper
Add Code

Fourteen Light Tasks for comparing Analogical and Phrase-based Machine Translation

no code implementations • COLING 2014 • Rafik Rhouma, Phillippe Langlais

Translation Transliteration

Paper
Add Code

An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus

no code implementations • LREC 2014 • Lise Rebout, Phillippe Langlais

We describe an approach for mining parallel sentences in a collection of documents in two languages.

Information Retrieval Machine Translation +2

Paper
Add Code

Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government

no code implementations • LREC 2014 • Fabrizio Gotti, Phillippe Langlais, Atefeh Farzindar

A manual analysis of the bilingual alignment of 5000 hashtags shows that 5{\%} (French) to 18{\%} (English) of them don{'}t have a counterpart in their containing tweet{'}s translation.

Information Retrieval Machine Translation +2

Paper
Add Code

Mapping Source to Target Strings without Alignment by Analogical Learning: A Case Study with Transliteration

no code implementations • ACL 2013 • Phillippe Langlais

Information Retrieval Transliteration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.