Search Results for author: Olivier Kraif

Found 8 papers, 2 papers with code

Multi-Task Sequence Prediction For Tunisian Arabizi Multi-Level Annotation

1 code implementation COLING (WANLP) 2020 Elisa Gugliotta, Marco Dinarelli, Olivier Kraif

We show also how we used the system in order to annotate a Tunisian Arabizi corpus, which has been afterwards manually corrected and used to further evaluate sequence models on Tunisian data.

POS POS Tagging +2

Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation

1 code implementation LREC 2018 Ali Can Kocabiyikoglu, Laurent Besacier, Olivier Kraif

However, while large quantities of parallel texts (such as Europarl, OpenSubtitles) are available for training machine translation systems, there are no large (100h) and open source parallel corpora that include speech in a source language aligned to text in a target language.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Comparing Recurring Lexico-Syntactic Trees (RLTs) and Ngram Techniques for Extended Phraseology Extraction

no code implementations WS 2017 Agn{\`e}s Tutin, Olivier Kraif

This paper aims at assessing to what extent a syntax-based method (Recurring Lexico-syntactic Trees (RLT) extraction) allows us to extract large phraseological units such as prefabricated routines, e. g. {``}as previously said{''} or {``}as far as we/I know{''} in scientific writing.

Multialignement vs bialignement : \`a plusieurs, c'est mieux !

no code implementations JEPTALNRECITAL 2015 Olivier Kraif

Dans cet article, nous proposons une m{\'e}thode originale destin{\'e}e {\`a} effectuer l{'}alignement d{'}un corpus multiparall{\`e}le, i. e. comportant plus de deux langues, en prenant en compte toutes les langues simultan{\'e}ment (et non en composant une s{\'e}rie de bialignements ind{\'e}pendants).

Cannot find the paper you are looking for? You can Submit a new open access paper.