1 code implementation • 12 Aug 2020 • Pin-zhen Chen, Kenneth Heafield
Chinese word segmentation has entered the deep learning era which greatly reduces the hassle of feature engineering.
Chinese Word Segmentation Low-Resource Neural Machine Translation +2
1 code implementation • ACL 2020 • Pin-zhen Chen, Nikolay Bogoychev, Kenneth Heafield, Faheem Kirefu
We present a novel method to extract parallel sentences from two monolingual corpora, using neural machine translation.
2 code implementations • ACL 2020 • Marta Ba{\~n}{\'o}n, Pin-zhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Espl{\`a}-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ram{\'\i}rez-S{\'a}nchez, Elsa Sarr{\'\i}as, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, Jaume Zaragoza
We report on methods to create the largest publicly available parallel corpora by crawling the web, using open source software.
1 code implementation • WS 2020 • Pin-zhen Chen, Nikolay Bogoychev, Ulrich Germann
This paper describes the University of Edinburgh{'}s neural machine translation systems submitted to the IWSLT 2020 open domain Japanese$\leftrightarrow$Chinese translation task.
no code implementations • WS 2019 • Nianheng Wu, Eric DeMattos, Kwok Him So, Pin-zhen Chen, {\c{C}}a{\u{g}}r{\i} {\c{C}}{\"o}ltekin
This paper describes the work done by team tearsofjoy participating in the VarDial 2019 Evaluation Campaign.