no code implementations • WS (NoDaLiDa) 2019 • José Carlos Rosales Núñez, Djamé Seddah, Guillaume Wisniewski
This work compares the performances achieved by Phrase-Based Statistical Machine Translation systems (PB-SMT) and attention-based Neuronal Machine Translation systems (NMT) when translating User Generated Content (UGC), as encountered in social medias, from French to English.
no code implementations • WMT (EMNLP) 2020 • Sadaf Abdul Rauf, José Carlos Rosales Núñez, Minh Quang Pham, François Yvon
This paper describes LIMSI’s submissions to the translation shared tasks at WMT’20.
1 code implementation • WNUT (ACL) 2021 • José Carlos Rosales Núñez, Guillaume Wisniewski, Djamé Seddah
This work explores the capacities of character-based Neural Machine Translation to translate noisy User-Generated Content (UGC) with a strong focus on exploring the limits of such approaches to handle productive UGC phenomena, which almost by definition, cannot be seen at training time.
no code implementations • WNUT (ACL) 2021 • José Carlos Rosales Núñez, Djamé Seddah, Guillaume Wisniewski
This work takes a critical look at the evaluation of user-generated content automatic translation, the well-known specificities of which raise many challenges for MT.