no code implementations • MTSummit 2021 • Seamus Lankford, Haithem Alfi, Andy Way
The Transformer model is the state-of-the-art in Machine Translation.
no code implementations • GWC 2016 • Arefeh Kazemi, Antonio Toral, Andy Way
We propose the use of WordNet synsets in a syntax-based reordering model for hierarchical statistical machine translation (HPB-SMT) to enable the model to generalize to phrases not seen in the training data but that have equivalent meaning.
1 code implementation • LREC 2022 • Séamus Lankford, Haithem Afli, Órla Ní Loinsigh, Andy Way
However in the context of low-resource languages, there is a paucity of parallel data datasets available for developing translation models.
no code implementations • AMTA 2022 • Wandri Jooste, Andy Way, Rejwanul Haque, Riccardo Superbo
Knowledge distillation (KD) can be used to reduce model size and training time, without significant loss in performance.
no code implementations • EAMT 2022 • Pintu Lohar, Guodong Xie, Andy Way
It is often a challenging task to build Machine Translation (MT) engines for a specific domain due to the lack of parallel data in that area.
no code implementations • EAMT 2022 • Itziar Aldabe, Jane Dunne, Aritz Farwell, Owen Gallagher, Federico Gaspari, Maria Giagkou, Jan Hajic, Jens Peter Kückens, Teresa Lynn, Georg Rehm, German Rigau, Katrin Marheinecke, Stelios Piperidis, Natalia Resende, Tea Vojtěchová, Andy Way
This paper provides an overview of the ongoing European Language Equality(ELE) project, an 18-month action funded by the European Commission which involves 52 partners.
no code implementations • EAMT 2022 • Petra Bago, Sheila Castilho, Jane Dunne, Federico Gaspari, Andre K, Gauti Kristmannsson, Jon Arild Olsen, Natalia Resende, Níels Rúnar Gíslason, Dana D. Sheridan, Páraic Sheridan, John Tinsley, Andy Way
This paper provides an overview of the main achievements of the completed PRINCIPLE project, a 2-year action funded by the European Commission under the Connecting Europe Facility (CEF) programme.
no code implementations • ICON 2020 • Rejwanul Haque, Yasmin Moslem, Andy Way
This paper describes the ADAPT Centre’s submission to the Adap-MT 2020 AI Translation Shared Task for English-to-Hindi.
1 code implementation • ICON 2020 • Raghvendra Pratap Singh, Rejwanul Haque, Mohammed Hasanuzzaman, Andy Way
Automatic recognition of customer complaints on products or services that they purchase can be crucial for the organisations, multinationals and online retailers since they can exploit this information to fulfil their customers’ expectations including managing and resolving the complaints.
no code implementations • AACL (WAT) 2020 • Wandri Jooste, Rejwanul Haque, Andy Way
In this paper we describe the ADAPT Centre’s submissions to the WAT 2020 document-level Business Scene Dialogue (BSD) Translation task.
no code implementations • AACL (WAT) 2020 • Prashanth Nayak, Rejwanul Haque, Andy Way
This paper describes the ADAPT Centre sub-missions to WAT 2020 for the English-to-Odia translation task.
no code implementations • EAMT 2020 • Natália Resende, Andy Way
The literature on MT and Computer Assisted Language Learning (CALL) shows that, over the years, MT systems have been facilitating language teaching and also language learning (Nin ̃o, 2006).
no code implementations • EAMT 2020 • Andy Way, Petra Bago, Jane Dunne, Federico Gaspari, Andre Kåsen, Gauti Kristmannsson, Helen McHugh, Jon Arild Olsen, Dana Davis Sheridan, Páraic Sheridan, John Tinsley
This paper updates the progress made on the PRINCIPLE project, a 2-year action funded by the European Commission under the Connecting Europe Facility (CEF) programme.
no code implementations • EAMT 2020 • Meghan Dowling, Sheila Castilho, Joss Moorkens, Teresa Lynn, Andy Way
With official status in both Ireland and the EU, there is a need for high-quality English-Irish (EN-GA) machine translation (MT) systems which are suitable for use in a professional translation environment.
no code implementations • EAMT 2020 • Natália Resende, Benjamin Cowan, Andy Way
In this paper, we tested 20 Brazilian Portuguese speakers at intermediate and advanced English proficiency levels to investigate the influence of Google Translate’s MT system on the mental processing of English as a second language.
no code implementations • EAMT 2020 • Kamal Kumar Gupta, Rejwanul Haque, Asif Ekbal, Pushpak Bhattacharyya, Andy Way
In this study, we model source-language syntactic constituency parse and target-language syntactic descriptions in the form of supertags as conditional context for interactive prediction in neural MT (NMT).
no code implementations • loresmt (AACL) 2020 • Akshai Ramesh, Venkatesh Balavadhani parthasa, Rejwanul Haque, Andy Way
Statistical machine translation (SMT) which was the dominant paradigm in machine translation (MT) research for nearly three decades has recently been superseded by the end-to-end deep learning approaches to MT.
no code implementations • TDLE (LREC) 2022 • Federico Gaspari, Owen Gallagher, Georg Rehm, Maria Giagkou, Stelios Piperidis, Jane Dunne, Andy Way
The paper situates this ongoing work with a strong European focus in the broader context of related efforts, and explains how the DLE Metric can help track the progress towards DLE for all languages of Europe, focusing in particular on the role played by the TFs.
1 code implementation • WMT (EMNLP) 2021 • Sheila Castilho, João Lucas Cavalheiro Camargo, Miguel Menezes, Andy Way
Recently, the Machine Translation (MT) community has become more interested in document-level evaluation especially in light of reactions to claims of “human parity”, since examining the quality at the level of the document rather than at the sentence level allows for the assessment of suprasentential context, providing a more reliable evaluation.
no code implementations • WMT (EMNLP) 2020 • Prashant Nayak, Rejwanul Haque, Andy Way
This paper describes the ADAPT Centre’s submissions to the WMT20 Biomedical Translation Shared Task for English-to-Basque.
no code implementations • WMT (EMNLP) 2020 • Venkatesh Parthasarathy, Akshai Ramesh, Rejwanul Haque, Andy Way
This paper describes the ADAPT Centre’s submissions to the WMT20 News translation shared task for English-to-Tamil and Tamil-to-English.
no code implementations • EAMT 2016 • Valia Kordoni, Lexi Birch, Ioana Buliga, Kostadin Cholakov, Markus Egg, Federico Gaspari, Yota Georgakopolou, Maria Gialama, Iris Hendrickx, Mitja Jermol, Katia Kermanidis, Joss Moorkens, Davor Orlic, Michael Papadopoulos, Maja Popović, Rico Sennrich, Vilelmini Sosoni, Dimitrios Tsoumakos, Antal Van den Bosch, Menno van Zaanen, Andy Way
no code implementations • VarDial (COLING) 2020 • Maja Popović, Alberto Poncelas, Marija Brkic, Andy Way
Furthermore, translation performance from English is much better than from German, partly because German is morphologically more complex and partly because the corpus consists mostly of parallel human translations instead of original text and its human translation.
no code implementations • AACL (WAT) 2020 • Akshai Ramesh, Venkatesh Balavadhani parthasa, Rejwanul Haque, Andy Way
Statistical machine translation (SMT) was the state-of-the-art in machine translation (MT) research for more than two decades, but has since been superseded by neural MT (NMT).
1 code implementation • NoDaLiDa 2021 • Steinþór Steingrímsson, Hrafn Loftsson, Andy Way
Being able to generate accurate word alignments is useful for a variety of tasks.
no code implementations • gwll (LREC) 2022 • Steinþór Steingrímsson, Luke O’Brien, Finnur Ingimundarson, Hrafn Loftsson, Andy Way
By combining the most promising approaches and data sets, using confidence scores calculated from the data and the results of manually evaluating samples from our manual evaluation as indicators, we are able to induce lists of translations with a very high acceptance rate.
no code implementations • RANLP (BUCC) 2021 • Steinþór Steingrímsson, Pintu Lohar, Hrafn Loftsson, Andy Way
Parallel sentences extracted from comparable corpora can be useful to supplement parallel corpora when training machine translation (MT) systems.
no code implementations • RANLP 2021 • Maja Popović, Alberto Poncelas, Marija Brkic, Andy Way
This work investigates neural machine translation (NMT) systems for translating English user reviews into Croatian and Serbian, two similar morphologically complex languages.
no code implementations • 31 Oct 2024 • Séamus Lankford, Andy Way
By focusing on the unique challenges posed by crisis situations where speed, accuracy, and the ability to handle a wide range of languages are paramount, this research outlines a novel approach that combines the cutting-edge capabilities of LLMs with fine-tuning techniques and community-driven corpus development strategies.
no code implementations • 5 Sep 2024 • Inacio Vieira, Will Allred, Séamus Lankford, Sheila Castilho, Andy Way
We analyse diverse sizes of training datasets (1k to 207k segments) to evaluate their influence on translation quality.
1 code implementation • 6 Mar 2024 • Séamus Lankford, Haithem Afli, Órla Ní Loinsigh, Andy Way
However in the context of low-resource languages, there is a paucity of parallel data datasets available for developing translation models.
no code implementations • 6 Mar 2024 • Séamus Lankford, Haithem Afli, Andy Way
adaptNMT is an open-source application that offers a streamlined approach to the development and deployment of Recurrent Neural Networks and Transformer models.
1 code implementation • 4 Mar 2024 • Séamus Lankford, Haithem Afli, Andy Way
As a multilingual tool, we used adaptMLLM to fine-tune models for two low-resource language pairs: English to Irish (EN$\leftrightarrow$GA) and English to Marathi (EN$\leftrightarrow$MR).
no code implementations • 4 Mar 2024 • Séamus Lankford, Haithem Afli, Andy Way
Our findings show the best-performing Transformer system significantly reduces both accuracy and fluency errors when compared with an RNN-based model.
no code implementations • 4 Mar 2024 • Séamus Lankford, Haithem Afli, Andy Way
adaptNMT streamlines all processes involved in the development and deployment of RNN and Transformer neural translation models.
no code implementations • 4 Mar 2024 • Séamus Lankford, Haithem Afli, Andy Way
The Transformer model is the state-of-the-art in Machine Translation.
no code implementations • MTSummit 2021 • Séamus Lankford, Haithem Afli, Andy Way
Translation models for the specific domain of translating Covid data from English to Irish were developed for the LoResMT 2021 shared task.
2 code implementations • 20 Dec 2023 • Yasmin Moslem, Rejwanul Haque, Andy Way
These findings emphasise the significance of fine-tuning efficient LLMs like Mistral 7B to yield high-quality zero-shot translations comparable to task-oriented models like NLLB 3. 3B.
1 code implementation • 15 Nov 2023 • Steinþór Steingrímsson, Hrafn Loftsson, Andy Way
We present SentAlign, an accurate sentence alignment tool designed to handle very large parallel document pairs.
no code implementations • 6 Nov 2023 • Longyue Wang, Zhaopeng Tu, Yan Gu, Siyou Liu, Dian Yu, Qingsong Ma, Chenyang Lyu, Liting Zhou, Chao-Hong Liu, Yufeng Ma, WeiYu Chen, Yvette Graham, Bonnie Webber, Philipp Koehn, Andy Way, Yulin Yuan, Shuming Shi
To foster progress in this domain, we hold a new shared task at WMT 2023, the first edition of the Discourse-Level Literary Translation.
1 code implementation • 30 Jan 2023 • Yasmin Moslem, Rejwanul Haque, John D. Kelleher, Andy Way
By feeding an LLM at inference time with a prompt that consists of a list of translation pairs, it can then simulate the domain and style characteristics.
1 code implementation • 23 Oct 2022 • Yasmin Moslem, Rejwanul Haque, Andy Way
Research on Machine Translation (MT) has achieved important breakthroughs in several areas.
2 code implementations • AMTA 2022 • Yasmin Moslem, Rejwanul Haque, John D. Kelleher, Andy Way
Preservation of domain knowledge from the source to target is crucial in any translation workflow.
no code implementations • 20 Mar 2021 • Liangyou Li, Andy Way, Qun Liu
We present graph-based translation models which translate source graphs into target strings.
1 code implementation • 1 Dec 2020 • Yasmin Moslem, Rejwanul Haque, Andy Way
Accordingly, we made use of a bidirectional LSTM language model (LM) for our context-sensitive spelling detection and correction model which is shown to have much control over the correction process.
no code implementations • loresmt (AACL) 2020 • Alberto Poncelas, Jan Buts, James Hadley, Andy Way
As an additional contribution, we made available a set of English-Esperanto parallel data in the literary domain.
no code implementations • AMTA 2020 • Alberto Poncelas, Pintu Lohar, Andy Way, James Hadley
Furthermore, as performing a direct translation is not always possible, we explore the performance of automatic classifiers on sentences that have been translated using a pivot MT system.
no code implementations • ACL 2020 • Stein{\th}{\'o}r Steingr{\'\i}msson, Hrafn Loftsson, Andy Way
When rich morphology exacerbates the data sparsity problem, it is imperative to have accurate alignment and filtering methods that can help make the most of what is available by maximising the number of correctly translated segments in a corpus and minimising noise by removing incorrect translations and segments containing extraneous data.
no code implementations • WS 2020 • Rejwanul Haque, Yasmin Moslem, Andy Way
This paper describes the ADAPT Centre{'}s submission to STAPLE (Simultaneous Translation and Paraphrase for Language Education) 2020, a shared task of the 4th Workshop on Neural Generation and Translation (WNGT), for the English-to-Portuguese translation task.
no code implementations • LREC 2020 • Thierry Etchegoyhen, Borja Anza Porras, Andoni Azpeitia, Eva Mart{\'\i}nez Garcia, Jos{\'e} Luis Fonseca, Patricia Fonseca, Paulo Vale, Jane Dunne, Federico Gaspari, Teresa Lynn, Helen McHugh, Andy Way, Victoria Arranz, Khalid Choukri, Herv{\'e} Pusset, Alex Sicard, re, Rui Neto, Maite Melero, David Perez, Ant{\'o}nio Branco, Ruben Branco, Lu{\'\i}s Gomes
We describe the European Language Resource Infrastructure (ELRI), a decentralised network to help collect, prepare and share language resources.
no code implementations • LREC 2020 • Sheila Castilho, Maja Popovi{\'c}, Andy Way
Despite increasing efforts to improve evaluation of machine translation (MT) by going beyond the sentence level to the document level, the definition of what exactly constitutes a {``}document level{''} is still not clear.
no code implementations • ACL 2020 • Xabier Soto, Dimitar Shterionov, Alberto Poncelas, Andy Way
Machine translation (MT) has benefited from using synthetic training data originating from translating monolingual corpora, a technique known as backtranslation.
no code implementations • 1 May 2020 • Andy Way, Rejwanul Haque, Guodong Xie, Federico Gaspari, Maja Popovic, Alberto Poncelas
Every day, more people are becoming infected and dying from exposure to COVID-19.
1 code implementation • LREC 2020 • Alberto Poncelas, Mohammad Aboomar, Jan Buts, James Hadley, Andy Way
This paper reports on a tool built for postediting the output of Tesseract, more specifically for correcting common errors in digitized historical documents.
no code implementations • LREC 2020 • Alberto Poncelas, Wichaya Pidchamook, Chao-Hong Liu, James Hadley, Andy Way
Thai is a low-resource language, so it is often the case that data is not available in sufficient quantities to train an Neural Machine Translation (NMT) model which perform to a high level of quality.
no code implementations • LREC 2020 • Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin, Walter Daelemans, Koenraad De Smedt, Radovan Garabík, Maria Gavriilidou, Dagmar Gromann, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Jan Odijk, Maciej Ogrodniczuk, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Marko Tadić, Dan Tufiş, Tamás Váradi, Kadri Vider, Andy Way, François Yvon
Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality.
no code implementations • WS 2019 • Alberto Poncelas, Andy Way
Neural Machine Translation (NMT) models tend to achieve best performance when larger sets of parallel sentences are provided for training.
no code implementations • EMNLP 2018 • Eva Vanmassenhove, Christian Hardmeier, Andy Way
Our contribution is two-fold: (1) the compilation of large datasets with speaker information for 20 language pairs, and (2) a simple set of experiments that incorporate gender information into NMT for multiple language pairs.
no code implementations • 9 Sep 2019 • Alberto Poncelas, Maja Popovic, Dimitar Shterionov, Gideon Maillette de Buy Wenniger, Andy Way
Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training.
no code implementations • RANLP 2019 • Rejwanul Haque, Md Hasanuzzaman, Andy Way
Terminology translation plays a critical role in domain-specific machine translation (MT).
no code implementations • RANLP 2019 • Alberto Poncelas, Maja Popovi{\'c}, Dimitar Shterionov, Gideon Maillette de Buy Wenniger, Andy Way
Neural Machine Translation (NMT) models achieve their best performance when large sets of parallel data are used for training.
no code implementations • WS 2019 • Alberto Poncelas, Gideon Maillette de Buy Wenniger, Andy Way
Machine Translation models are trained to translate a variety of documents from one language into another.
no code implementations • WS 2019 • Sheila Castilho, Nat{\'a}lia Resende, Federico Gaspari, Andy Way, Tony O{'}Dowd, Marek Mazur, Manuel Herranz, Alex Helle, Gema Ram{\'\i}rez-S{\'a}nchez, V{\'\i}ctor S{\'a}nchez-Cartagena, M{\=a}rcis Pinnis, Valters {\v{S}}ics
1 code implementation • WS 2019 • Pintu Lohar, Maja Popovi{\'c}, Andy Way
This paper reports the results of the first experiment dealing with the challenges of building a machine translation system for user-generated content involving a complex South Slavic language.
no code implementations • WS 2019 • Eva Vanmassenhove, Dimitar Shterionov, Andy Way
This work presents an empirical approach to quantifying the loss of lexical richness in Machine Translation (MT) systems compared to Human Translation (HT).
no code implementations • 18 Jun 2019 • Alberto Poncelas, Gideon Maillette de Buy Wenniger, Andy Way
These methods ensure that selected sentences share n-grams with the test set so the NMT model can be adapted to translate it.
1 code implementation • 28 Feb 2019 • Gideon Maillette de Buy Wenniger, Lambert Schomaker, Andy Way
Neural handwriting recognition (NHR) is the recognition of handwritten text with deep learning models, such as multi-dimensional long short-term memory (MDLSTM) recurrent neural networks.
Ranked #16 on Handwritten Text Recognition on IAM
no code implementations • 23 Feb 2019 • Eva Vanmassenhove, Amit Moryossef, Alberto Poncelas, Andy Way, Dimitar Shterionov
In contradiction with the results described in previous comparable shared tasks, our neural models performed better than our best traditional approaches with our best feature set-up.
no code implementations • IWSLT (EMNLP) 2018 • Alberto Poncelas, Andy Way, Kepa Sarasola
In this paper we present the ADAPT system built for the Basque to English Low Resource MT Evaluation Campaign.
no code implementations • IWSLT (EMNLP) 2018 • Alberto Poncelas, Gideon Maillette de Buy Wenniger, Andy Way
A limitation of these methods to date is that using the source-side test set does not by itself guarantee that sentences are selected with correct translations, or translations that are suitable given the test-set domain.
no code implementations • EMNLP 2018 • Long-Yue Wang, Zhaopeng Tu, Andy Way, Qun Liu
Pronouns are frequently omitted in pro-drop languages, such as Chinese, generally leading to significant challenges with respect to the production of complete translations.
no code implementations • WS 2018 • Catarina Cruz Silva, Chao-Hong Liu, Alberto Poncelas, Andy Way
Data selection is a process used in selecting a subset of parallel data for the training of machine translation (MT) systems, so that 1) resources for training might be reduced, 2) trained models could perform better than those trained with the whole corpus, and/or 3) trained models are more tailored to specific domains.
no code implementations • EMNLP 2018 • Jinhua Du, Jingguang Han, Andy Way, Dadong Wan
Targeting the MIL issue, the structured sentence-level attention learns a 2-D matrix where each row vector represents a weight distribution on selection of different valid in-stances.
1 code implementation • WS 2018 • Antonio Toral, Sheila Castilho, Ke Hu, Andy Way
We reassess a recent study (Hassan et al., 2018) that claimed that machine translation (MT) has reached human parity for the translation of news from Chinese into English, using pairwise ranking and considering three variables that were not taken into account in that previous study: the language in which the source side of the test set was originally written, the translation proficiency of the evaluators, and the provision of inter-sentential context.
no code implementations • COLING 2018 • Peyman Passban, Andy Way, Qun Liu
A morphologically complex word (MCW) is a hierarchical constituent with meaning-preserving subunits, so word-based models which rely on surface forms might not be powerful enough to translate such structures.
no code implementations • COLING 2018 • Sayantan Mitra, Mohammed Hasanuzzaman, Sriparna Saha, Andy Way
Current paper explores the use of multi-view learning for search result clustering.
no code implementations • ACL 2018 • Eva Vanmassenhove, Andy Way
In this paper we incorporate semantic supersensetags and syntactic supertag features into EN{--}FR and EN{--}DE factored NMT systems.
no code implementations • NAACL 2018 • Sabyasachi Kamila, Mohammed Hasanuzzaman, Asif Ekbal, Pushpak Bhattacharyya, Andy Way
In this paper, we propose a very first study to demonstrate the association between the sentiment view of the temporal orientation of the users and their different psycho-demographic attributes by analyzing their tweets.
no code implementations • 17 Apr 2018 • Alberto Poncelas, Dimitar Shterionov, Andy Way, Gideon Maillette de Buy Wenniger, Peyman Passban
A prerequisite for training corpus-based machine translation (MT) systems -- either Statistical MT (SMT) or Neural MT (NMT) -- is the availability of high-quality parallel data.
no code implementations • NAACL 2018 • Peyman Passban, Qun Liu, Andy Way
Recently, neural machine translation (NMT) has emerged as a powerful alternative to conventional statistical approaches.
no code implementations • 22 Mar 2018 • Andy Way
Machine Translation (MT) is being deployed for a range of use-cases by millions of people on a daily basis.
no code implementations • 15 Jan 2018 • Antonio Toral, Andy Way
Given the rise of a new approach to MT, Neural MT (NMT), and its promising performance on different text types, we assess the translation quality it can attain on what is perceived to be the greatest challenge for MT: literary text.
no code implementations • IJCNLP 2017 • Pintu Lohar, Koel Dutta Chowdhury, Haithem Afli, Mohammed Hasanuzzaman, Andy Way
In this paper, we analyse the real world samples of customer feedback from Microsoft Office customers in four languages, i. e., English, French, Spanish and Japanese and conclude a five-plus-one-classes categorisation (comment, request, bug, complaint, meaningless and undetermined) for meaning classification.
no code implementations • IJCNLP 2017 • Long-Yue Wang, Jinhua Du, Liangyou Li, Zhaopeng Tu, Andy Way, Qun Liu
We showcase TODAY, a semantics-enhanced task-oriented dialogue translation system, whose novelties are: (i) task-oriented named entity (NE) definition and a hybrid strategy for NE recognition and translation; and (ii) a novel grounded semantic method for dialogue understanding and task-order management.
no code implementations • WS 2017 • Haithem Afli, Pintu Lohar, Andy Way
Integrating Natural Language Processing (NLP) and computer vision is a promising effort.
Content-Based Image Retrieval Multimodal Machine Translation
no code implementations • IJCNLP 2017 • Mohammed Hasanuzzaman, Ga{\"e}l Dias, Andy Way
Most social media platforms grant users freedom of speech by allowing them to freely express their thoughts, beliefs, and opinions.
1 code implementation • EMNLP 2017 • Long-Yue Wang, Zhaopeng Tu, Andy Way, Qun Liu
In translation, considering the document as a whole can help to resolve ambiguities and inconsistencies.
no code implementations • EACL 2017 • Liangyou Li, Andy Way, Qun Liu
In this paper, we present an improved graph-based translation model which segments an input graph into node-induced subgraphs by taking source context into consideration.
no code implementations • EACL 2017 • Iacer Calixto, Daniel Stein, Evgeny Matusov, Pintu Lohar, Sheila Castilho, Andy Way
We evaluate our models quantitatively using BLEU and TER and find that (i) additional synthetic data has a general positive impact on text-only and multi-modal NMT models, and that (ii) using a multi-modal NMT model for re-ranking n-best lists improves TER significantly across different n-best list sizes.
no code implementations • WS 2017 • Ahmad Khwileh, Haithem Afli, Gareth Jones, Andy Way
Cross Language Information Retrieval (CLIR) systems are a valuable tool to enable speakers of one language to search for content of interest expressed in a different language.
no code implementations • WS 2017 • Iacer Calixto, Daniel Stein, Evgeny Matusov, Sheila Castilho, Andy Way
Nonetheless, human evaluators ranked translations from a multi-modal NMT model as better than those of a text-only NMT over 88{\%} of the time, which suggests that images do help NMT in this use-case.
no code implementations • WS 2017 • Carla Parra Escart{\'\i}n, Wessel Reijers, Teresa Lynn, Joss Moorkens, Andy Way, Chao-Hong Liu
Shared tasks are increasingly common in our field, and new challenges are suggested at almost every conference and workshop.
no code implementations • COLING 2016 • Jian Zhang, Liangyou Li, Andy Way, Qun Liu
In recent years, neural machine translation (NMT) has demonstrated state-of-the-art machine translation (MT) performance.
no code implementations • COLING 2016 • Jian Zhang, Xiaofeng Wu, Andy Way, Qun Liu
We show that the neural LM perplexity can be reduced by 7. 395 and 12. 011 using the proposed domain adaptation mechanism on the Penn Treebank and News data, respectively.
no code implementations • COLING 2016 • Peyman Passban, Qun Liu, Andy Way
PBSMT engines by default provide four probability scores in phrase tables which are considered as the main set of bilingual features.
no code implementations • WS 2016 • Haithem Afli, Andy Way
Such documents must be passed through a process of Optical Character Recognition (OCR) to render the text suitable for MT.
no code implementations • LREC 2016 • Long-Yue Wang, Xiaojun Zhang, Zhaopeng Tu, Andy Way, Qun Liu
Then tags such as speaker and discourse boundary from the script data are projected to its subtitle data via an information retrieval approach in order to map monolingual discourse to bilingual texts.
no code implementations • LREC 2016 • Haithem Afli, Zhengwei Qiu, Andy Way, P{\'a}raic Sheridan
A trend to digitize historical paper-based archives has emerged in recent years, with the advent of digital optical scanners.
no code implementations • LREC 2016 • Xiaofeng Wu, Jinhua Du, Qun Liu, Andy Way
This paper presents ProphetMT, a tree-based SMT-driven Controlled Language (CL) authoring and post-editing tool.
no code implementations • LREC 2016 • Valia Kordoni, Antal Van den Bosch, Katia Lida Kermanidis, Vilelmini Sosoni, Kostadin Cholakov, Iris Hendrickx, Matthias Huck, Andy Way
The present work is an overview of the TraMOOC (Translation for Massive Open Online Courses) research and innovation project, a machine translation approach for online educational content.
no code implementations • LREC 2016 • Jinhua Du, Andy Way, Andrzej Zydron
Out-of-vocabulary words (OOVs) are a ubiquitous and difficult problem in statistical machine translation (SMT).
no code implementations • NAACL 2016 • Long-Yue Wang, Zhaopeng Tu, Xiaojun Zhang, Hang Li, Andy Way, Qun Liu
Finally, we integrate the above outputs into our translation system to recall missing pronouns by both extracting rules from the DP-labelled training data and translating the DP-generated input sentences.
no code implementations • EAMT 2016 • Antonio Toral, Tommi A. Pirinen, Andy Way, Gema Ram{\'\i}rez-S{\'a}nchez, Sergio Ortiz Rojas, Raphael Rubino, Miquel Espl{\`a}, Mikel L. Forcada, Vassilis Papavassiliou, Prokopis Prokopidis, Nikola Ljube{\v{s}}i{\'c}
no code implementations • WS 2015 • Valia Kordoni, Kostadin Cholakov, Markus Egg, Andy Way, Lexi Birch, Katia Kermanidis, Vilelmini Sosoni, Dimitrios Tsoumakos, Antal Van den Bosch, Iris Hendrickx, Michael Papadopoulos, Panayota Georgakopoulou, Maria Gialama, Menno van Zaanen, Ioana Buliga, Mitja Jermol, Davor Orlic
no code implementations • WS 2014 • Raphael Rubino, Antonio Toral, Victor M. S{\'a}nchez-Cartagena, Jorge Ferr{\'a}ndez-Tordera, Sergio Ortiz-Rojas, Gema Ram{\'\i}rez-S{\'a}nchez, Felipe S{\'a}nchez-Mart{\'\i}nez, Andy Way
no code implementations • LREC 2012 • Volha Petukhova, Rodrigo Agerri, Mark Fishel, Sergio Penkale, Arantza del Pozo, Mirjam Sepesy Mau{\v{c}}ec, Andy Way, Panayota Georgakopoulou, Martin Volk
Subtitling and audiovisual translation have been recognized as areas that could greatly benefit from the introduction of Statistical Machine Translation (SMT) followed by post-editing, in order to increase efficiency of subtitle production process.