no code implementations • 18 Mar 2024 • Olivier Kraif
In this paper, we present an adaptive bitextual alignment system called AIlign.
1 code implementation • COLING (WANLP) 2020 • Elisa Gugliotta, Marco Dinarelli, Olivier Kraif
We show also how we used the system in order to annotate a Tunisian Arabizi corpus, which has been afterwards manually corrected and used to further evaluate sequence models on Tunisian data.
1 code implementation • LREC 2018 • Ali Can Kocabiyikoglu, Laurent Besacier, Olivier Kraif
However, while large quantities of parallel texts (such as Europarl, OpenSubtitles) are available for training machine translation systems, there are no large (100h) and open source parallel corpora that include speech in a source language aligned to text in a target language.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • WS 2017 • Agn{\`e}s Tutin, Olivier Kraif
This paper aims at assessing to what extent a syntax-based method (Recurring Lexico-syntactic Trees (RLT) extraction) allows us to extract large phraseological units such as prefabricated routines, e. g. {``}as previously said{''} or {``}as far as we/I know{''} in scientific writing.
no code implementations • JEPTALNRECITAL 2015 • Olivier Kraif
Dans cet article, nous proposons une m{\'e}thode originale destin{\'e}e {\`a} effectuer l{'}alignement d{'}un corpus multiparall{\`e}le, i. e. comportant plus de deux langues, en prenant en compte toutes les langues simultan{\'e}ment (et non en composant une s{\'e}rie de bialignements ind{\'e}pendants).
no code implementations • JEPTALNRECITAL 2012 • Olivier Kraif, Sascha Diwersy