1 code implementation • 12 Apr 2024 • Marta Bañón, Jaume Zaragoza-Bernabeu, Gema Ramírez-Sánchez, Sergio Ortiz-Rojas
Language identification is a crucial component in the automated production of language resources, particularly in multilingual and big data contexts.
1 code implementation • WS 2018 • V{\'\i}ctor M. S{\'a}nchez-Cartagena, Marta Ba{\~n}{\'o}n, Sergio Ortiz-Rojas, Gema Ram{\'\i}rez
This paper describes Prompsit Language Engineering{'}s submissions to the WMT 2018 parallel corpus filtering shared task.
no code implementations • WS 2014 • Raphael Rubino, Antonio Toral, Victor M. S{\'a}nchez-Cartagena, Jorge Ferr{\'a}ndez-Tordera, Sergio Ortiz-Rojas, Gema Ram{\'\i}rez-S{\'a}nchez, Felipe S{\'a}nchez-Mart{\'\i}nez, Andy Way
no code implementations • LREC 2014 • Miquel Espl{\`a}-Gomis, Filip Klubi{\v{c}}ka, Nikola Ljube{\v{s}}i{\'c}, Sergio Ortiz-Rojas, Vassilis Papavassiliou, Prokopis Prokopidis
We used both tools for crawling 21 multilingual websites from the tourism domain to build a domain-specific English―Croatian parallel corpus.