2 code implementations • ACL 2020 • Marta Ba{\~n}{\'o}n, Pin-zhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Espl{\`a}-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ram{\'\i}rez-S{\'a}nchez, Elsa Sarr{\'\i}as, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, Jaume Zaragoza
We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software.
no code implementations • WS 2019 • Sheila Castilho, Nat{\'a}lia Resende, Federico Gaspari, Andy Way, Tony O{'}Dowd, Marek Mazur, Manuel Herranz, Alex Helle, Gema Ram{\'\i}rez-S{\'a}nchez, V{\'\i}ctor S{\'a}nchez-Cartagena, M{\=a}rcis Pinnis, Valters {\v{S}}ics
no code implementations • EAMT 2016 • Antonio Toral, Tommi A. Pirinen, Andy Way, Gema Ram{\'\i}rez-S{\'a}nchez, Sergio Ortiz Rojas, Raphael Rubino, Miquel Espl{\`a}, Mikel L. Forcada, Vassilis Papavassiliou, Prokopis Prokopidis, Nikola Ljube{\v{s}}i{\'c}
no code implementations • WS 2014 • Raphael Rubino, Antonio Toral, Victor M. S{\'a}nchez-Cartagena, Jorge Ferr{\'a}ndez-Tordera, Sergio Ortiz-Rojas, Gema Ram{\'\i}rez-S{\'a}nchez, Felipe S{\'a}nchez-Mart{\'\i}nez, Andy Way
no code implementations • LREC 2014 • Raphael Rubino, Antonio Toral, Nikola Ljube{\v{s}}i{\'c}, Gema Ram{\'\i}rez-S{\'a}nchez
This paper presents a novel approach for parallel data generation using machine translation and quality estimation.