Search Results for author: Ond{\v{r}}ej Bojar

Found 81 papers, 2 papers with code

ELITR Multilingual Live Subtitling: Demo and Strategy

no code implementations • EACL 2021 • Ond{\v{r}}ej Bojar, Dominik Mach{\'a}{\v{c}}ek, Sangeet Sagar, Otakar Smr{\v{z}}, Jon{\'a}{\v{s}} Kratochv{\'\i}l, Peter Pol{\'a}k, Ebrahim Ansari, Mohammad Mahmoudi, Rishu Kumar, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian St{\"u}ker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams

This paper presents an automatic speech translation system aimed at live subtitling of conference presentations.

Translation

Paper
Add Code

SLTEV: Comprehensive Evaluation of Spoken Language Translation

1 code implementation • EACL 2021 • Ebrahim Ansari, Ond{\v{r}}ej Bojar, Barry Haddow, Mohammad Mahmoudi

SLTev reports the quality, latency, and stability of an SLT candidate output based on the time-stamped transcript and reference translation into a target language.

Machine Translation Translation

Paper
Code

FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN

no code implementations • WS 2020 • Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ond{\v{r}}ej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian St{\"u}ker, Marco Turchi, Alex Waibel, er, Changhan Wang

The evaluation campaign of the International Conference on Spoken Language Translation (IWSLT 2020) featured this year six challenge tracks: (i) Simultaneous speech translation, (ii) Video speech translation, (iii) Offline speech translation, (iv) Conversational speech translation, (v) Open domain translation, and (vi) Non-native speech translation.

Translation

Paper
Add Code

CUNI Neural ASR with Phoneme-Level Intermediate Step for\textasciitildeNon-Native\textasciitildeSLT at IWSLT 2020

no code implementations • WS 2020 • Peter Pol{\'a}k, Sangeet Sagar, Dominik Mach{\'a}{\v{c}}ek, Ond{\v{r}}ej Bojar

We complement this ASR with off-the-shelf MT systems to take part also in the speech translation track.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Removing European Language Barriers with Innovative Machine Translation Technology

no code implementations • LREC 2020 • Dario Franceschini, Chiara Canton, Ivan Simonini, Armin Schweinfurth, Adelheid Glott, Sebastian St{\"u}ker, Thai-Son Nguyen, Felix Schneider, Thanh-Le Ha, Alex Waibel, Barry Haddow, Philip Williams, Rico Sennrich, Ond{\v{r}}ej Bojar, Sangeet Sagar, Dominik Mach{\'a}{\v{c}}ek, Otakar Smr{\v{z}}

This paper presents our progress towards deploying a versatile communication platform in the task of highly multilingual live speech translation for conferences and remote meetings live subtitling.

Machine Translation Translation

Paper
Add Code

Outbound Translation User Interface Ptakop\vet: A Pilot Study

no code implementations • LREC 2020 • Vil{\'e}m Zouhar, Ond{\v{r}}ej Bojar

It is not uncommon for Internet users to have to produce a text in a foreign language they have very little knowledge of and are unable to verify the translation quality.

Translation

Paper
Add Code

Two Huge Title and Keyword Generation Corpora of Research Articles

no code implementations • LREC 2020 • Erion {\c{C}}ano, Ond{\v{r}}ej Bojar

Recent developments in sequence-to-sequence learning with neural networks have considerably improved the quality of automatically generated text summaries and document keywords, stipulating the need for even bigger training corpora.

Text Summarization Vocal Bursts Valence Prediction

Paper
Add Code

OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation

no code implementations • LREC 2020 • Shantipriya Parida, Satya Ranjan Dash, Ond{\v{r}}ej Bojar, Petr Motlicek, Priyanka Pattnaik, Debasish Kumar Mallick

The preparation of parallel corpora is a challenging task, particularly for languages that suffer from under-representation in the digital world.

Machine Translation NMT +3

Paper
Add Code

Idiap NMT System for WAT 2019 Multimodal Translation Task

no code implementations • WS 2019 • Shantipriya Parida, Ond{\v{r}}ej Bojar, Petr Motlicek

This paper describes the Idiap submission to WAT 2019 for the English-Hindi Multi-Modal Translation Task.

NMT Translation

Paper
Add Code

Overview of the 6th Workshop on Asian Translation

no code implementations • WS 2019 • Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Yusuke Oda, Shantipriya Parida, Ond{\v{r}}ej Bojar, Sadao Kurohashi

This paper presents the results of the shared tasks from the 6th workshop on Asian translation (WAT2019) including Ja↔En, Ja↔Zh scientific paper translation subtasks, Ja↔En, Ja↔Ko, Ja↔En patent translation subtasks, Hi↔En, My↔En, Km↔En, Ta↔En mixed domain subtasks and Ru↔Ja news commentary translation task.

Translation

Paper
Add Code

Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study

no code implementations • WS 2019 • Erion {\c{C}}ano, Ond{\v{r}}ej Bojar

Using data-driven models for solving text summarization or similar tasks has become very common in the last years.

Text Summarization

Paper
Add Code

Findings of the 2019 Conference on Machine Translation (WMT19)

no code implementations • WS 2019 • Lo{\"\i}c Barrault, Ond{\v{r}}ej Bojar, Marta R. Costa-juss{\`a}, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias M{\"u}ller, Santanu Pal, Matt Post, Marcos Zampieri

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2019.

Machine Translation Translation

Paper
Add Code

Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges

no code implementations • WS 2019 • Qingsong Ma, Johnny Wei, Ond{\v{r}}ej Bojar, Yvette Graham

This paper presents the results of the WMT19 Metrics Shared Task.

Translation

Paper
Add Code

A Test Suite and Manual Evaluation of Document-Level NMT at WMT19

no code implementations • WS 2019 • Kate{\v{r}}ina Rysov{\'a}, Magdal{\'e}na Rysov{\'a}, Tom{\'a}{\v{s}} Musil, Lucie Pol{\'a}kov{\'a}, Ond{\v{r}}ej Bojar

As the quality of machine translation rises and neural machine translation (NMT) is moving from sentence to document level translations, it is becoming increasingly difficult to evaluate the output of translation systems.

Machine Translation NMT +2

Paper
Add Code

SAO WMT19 Test Suite: Machine Translation of Audit Reports

1 code implementation • WS 2019 • Tereza Vojt{\v{e}}chov{\'a}, Michal Nov{\'a}k, Milo{\v{s}} Klou{\v{c}}ek, Ond{\v{r}}ej Bojar

This paper describes a machine translation test set of documents from the auditing domain and its use as one of the {``}test suites{''} in the WMT19 News Translation Task for translation directions involving Czech, English and German.

Machine Translation Translation

Paper
Code

CUNI Submission for Low-Resource Languages in WMT News 2019

no code implementations • WS 2019 • Tom Kocmi, Ond{\v{r}}ej Bojar

This paper describes the CUNI submission to the WMT 2019 News Translation Shared Task for the low-resource languages: Gujarati-English and Kazakh-English.

Transfer Learning Translation

Paper
Add Code

CUNI Systems for the Unsupervised News Translation Task in WMT 2019

no code implementations • WS 2019 • Ivana Kvapil{\'\i}kov{\'a}, Dominik Mach{\'a}{\v{c}}ek, Ond{\v{r}}ej Bojar

In this paper we describe the CUNI translation system used for the unsupervised news shared task of the ACL 2019 Fourth Conference on Machine Translation (WMT19).

Machine Translation Translation

Paper
Add Code

Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation

no code implementations • ACL 2019 • Du{\v{s}}an Vari{\v{s}}, Ond{\v{r}}ej Bojar

In our method, we initialize the weights of the encoder and decoder with two language models that are trained with monolingual data and then fine-tune the model on parallel data using Elastic Weight Consolidation (EWC) to avoid forgetting of the original language modeling task.

Language Modelling Machine Translation +2

Paper
Add Code

Keyphrase Generation: A Text Summarization Struggle

no code implementations • NAACL 2019 • Erion {\c{C}}ano, Ond{\v{r}}ej Bojar

Most of the proposed supervised and unsupervised methods for keyphrase generation are unable to produce terms that are valuable but do not appear in the text.

Keyphrase Generation Text Summarization

Paper
Add Code

Findings of the 2018 Conference on Machine Translation (WMT18)

no code implementations • WS 2018 • Ond{\v{r}}ej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Philipp Koehn, Christof Monz

This paper presents the results of the premier shared task organized alongside the Conference on Machine Translation (WMT) 2018.

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Testsuite on Czech--English Grammatical Contrasts

no code implementations • WS 2018 • Silvie Cinkov{\'a}, Ond{\v{r}}ej Bojar

We present a pilot study of machine translation of selected grammatical contrasts between Czech and English in WMT18 News Translation Task.

Machine Translation Translation

Paper
Add Code

Proceedings of the Third Conference on Machine Translation: Shared Task Papers

no code implementations • EMNLP 2018 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Christof Monz, Matteo Negri, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Matt Post, Lucia Specia, Marco Turchi, Karin Verspoor

Machine Translation Translation

Paper
Add Code

EvalD Reference-Less Discourse Evaluation for WMT18

no code implementations • WS 2018 • Ond{\v{r}}ej Bojar, Ji{\v{r}}{\'\i} M{\'\i}rovsk{\'y}, Kate{\v{r}}ina Rysov{\'a}, Magdal{\'e}na Rysov{\'a}

We present the results of automatic evaluation of discourse in machine translation (MT) outputs using the EVALD tool.

Machine Translation Translation

Paper
Add Code

Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance

no code implementations • WS 2018 • Qingsong Ma, Ond{\v{r}}ej Bojar, Yvette Graham

We asked participants of this task to score the outputs of the MT systems involved in the WMT18 News Translation Task with automatic metrics.

Machine Translation Sentence +1

Paper
Add Code

The WMT'18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English

no code implementations • WS 2018 • Franck Burlot, Yves Scherrer, Vinit Ravishankar, Ond{\v{r}}ej Bojar, Stig-Arne Gr{\"o}nroos, Maarit Koponen, Tommi Nieminen, Fran{\c{c}}ois Yvon

Progress in the quality of machine translation output calls for new automatic evaluation procedures and metrics.

Machine Translation Translation

Paper
Add Code

CUNI Submissions in WMT18

no code implementations • WS 2018 • Tom Kocmi, Roman Sudarikov, Ond{\v{r}}ej Bojar

Our main focus was the low-resource language pair of Estonian and English for which we utilized Finnish parallel data in a simple method.

Machine Translation Translation

Paper
Add Code

Are BLEU and Meaning Representation in Opposition?

no code implementations • ACL 2018 • Ond{\v{r}}ej C{\'\i}fka, Ond{\v{r}}ej Bojar

One of possible ways of obtaining continuous-space sentence representations is by training neural machine translation (NMT) systems.

General Classification Machine Translation +4

Paper
Add Code

Neural Monkey: The Current State and Beyond

no code implementations • WS 2018 • Jind{\v{r}}ich Helcl, Jind{\v{r}}ich Libovick{\'y}, Tom Kocmi, Tom{\'a}{\v{s}} Musil, Ond{\v{r}}ej C{\'\i}fka, Du{\v{s}}an Vari{\v{s}}, Ond{\v{r}}ej Bojar

Image Captioning Machine Translation +3

Paper
Add Code

CUNI NMT System for WAT 2017 Translation Tasks

no code implementations • WS 2017 • Tom Kocmi, Du{\v{s}}an Vari{\v{s}}, Ond{\v{r}}ej Bojar

The paper presents this year{'}s CUNI submissions to the WAT 2017 Translation Task focusing on the Japanese-English translation, namely Scientific papers subtask, Patents subtask and Newswire subtask.

Machine Translation NMT +2

Paper
Add Code

CUNI System for WMT17 Automatic Post-Editing Task

no code implementations • WS 2017 • Du{\v{s}}an Vari{\v{s}}, Ond{\v{r}}ej Bojar

Automatic Post-Editing

Paper
Add Code

Variable Mini-Batch Sizing and Pre-Trained Embeddings

no code implementations • WS 2017 • Mostafa Abdou, Vladan Glon{\v{c}}{\'a}k, Ond{\v{r}}ej Bojar

Machine Translation Word Embeddings

Paper
Add Code

The QT21 Combined Machine Translation System for English to Latvian

no code implementations • WS 2017 • Jan-Thorsten Peter, Hermann Ney, Ond{\v{r}}ej Bojar, Ngoc-Quan Pham, Jan Niehues, Alex Waibel, Franck Burlot, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Valters {\v{S}}ics, Jasmijn Bastings, Miguel Rios, Wilker Aziz, Philip Williams, Fr{\'e}d{\'e}ric Blain, Lucia Specia

Machine Translation Translation

Paper
Add Code

CUNI Experiments for WMT17 Metrics Task

no code implementations • WS 2017 • David Mare{\v{c}}ek, Ond{\v{r}}ej Bojar, Ond{\v{r}}ej H{\"u}bsch, Rudolf Rosa, Du{\v{s}}an Vari{\v{s}}

Dependency Parsing Machine Translation +1

Paper
Add Code

Findings of the 2017 Conference on Machine Translation (WMT17)

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Shu-Jian Huang, Matthias Huck, Philipp Koehn, Qun Liu, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Raphael Rubino, Lucia Specia, Marco Turchi

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

Results of the WMT17 Neural MT Training Task

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Jind{\v{r}}ich Helcl, Tom Kocmi, Jind{\v{r}}ich Libovick{\'y}, Tom{\'a}{\v{s}} Musil

Machine Translation

Paper
Add Code

CUNI submission in WMT17: Chimera goes neural

no code implementations • WS 2017 • Roman Sudarikov, David Mare{\v{c}}ek, Tom Kocmi, Du{\v{s}}an Vari{\v{s}}, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

Findings of the WMT 2017 Biomedical Translation Shared Task

no code implementations • WS 2017 • Antonio Jimeno Yepes, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Karin Verspoor, Ond{\v{r}}ej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Rol Roller, , Rudolf Rosa, Amy Siu, Philippe Thomas, Saskia Trescher

Machine Translation Translation

Paper
Add Code

Results of the WMT17 Metrics Shared Task

no code implementations • WS 2017 • Ond{\v{r}}ej Bojar, Yvette Graham, Amir Kamran

Machine Translation

Paper
Add Code

Producing Unseen Morphological Variants in Statistical Machine Translation

no code implementations • EACL 2017 • Matthias Huck, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Alex Fraser, er

Translating into morphologically rich languages is difficult.

Machine Translation Translation

Paper
Add Code

Enriching Source for English-to-Urdu Machine Translation

no code implementations • WS 2016 • Bushra Jawaid, Amir Kamran, Ond{\v{r}}ej Bojar

This paper focuses on the generation of case markers for free word order languages that use case markers as phrasal clitics for marking the relationship between the dependent-noun and its head.

Machine Translation Translation

Paper
Add Code

Verb sense disambiguation in Machine Translation

no code implementations • WS 2016 • Roman Sudarikov, Ond{\v{r}}ej Du{\v{s}}ek, Martin Holub, Ond{\v{r}}ej Bojar, Vincent Kr{\'\i}{\v{z}}

We describe experiments in Machine Translation using word sense disambiguation (WSD) information.

Machine Translation Translation +1

Paper
Add Code

Moses \& Treex Hybrid MT Systems Bestiary

no code implementations • WS 2016 • Rudolf Rosa, Martin Popel, Ond{\v{r}}ej Bojar, David Mare{\v{c}}ek, Ond{\v{r}}ej Du{\v{s}}ek

Language Modelling Machine Translation +1

Paper
Add Code

Findings of the 2016 Conference on Machine Translation

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor, Marcos Zampieri

Automatic Post-Editing Multimodal Machine Translation +1

Paper
Add Code

CUNI-LMU Submissions in WMT2016: Chimera Constrained and Beaten

no code implementations • WS 2016 • Ale{\v{s}} Tamchyna, Roman Sudarikov, Ond{\v{r}}ej Bojar, Alex Fraser, er

Machine Translation

Paper
Add Code

Bilingual Embeddings and Word Alignments for Translation Quality Estimation

no code implementations • WS 2016 • Amal Abdelsalam, Ond{\v{r}}ej Bojar, Samhaa El-Beltagy

Machine Translation Translation +2

Paper
Add Code

Particle Swarm Optimization Submission for WMT16 Tuning Task

no code implementations • WS 2016 • Viktor Kocur, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

Dictionary-based Domain Adaptation of MT Systems without Retraining

no code implementations • WS 2016 • Rudolf Rosa, Roman Sudarikov, Michal Nov{\'a}k, Martin Popel, Ond{\v{r}}ej Bojar

Domain Adaptation Machine Translation

Paper
Add Code

Edinburgh's Statistical Machine Translation Systems for WMT16

no code implementations • WS 2016 • Philip Williams, Rico Sennrich, Maria N{\u{a}}dejde, Matthias Huck, Barry Haddow, Ond{\v{r}}ej Bojar

Language Modelling Machine Translation +2

Paper
Add Code

Results of the WMT16 Tuning Shared Task

no code implementations • WS 2016 • Bushra Jawaid, Amir Kamran, Milo{\v{s}} Stanojevi{\'c}, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

Using Term Position Similarity and Language Modeling for Bilingual Document Alignment

no code implementations • WS 2016 • Thanh C. Le, Hoa Trong Vu, Jonathan Oberl{\"a}nder, Ond{\v{r}}ej Bojar

Information Retrieval Language Modelling +2

Paper
Add Code

Proceedings of the First Conference on Machine Translation: Volume 2, Shared Task Papers

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Christian Buck, Rajen Chatterjee, Christian Federmann, Liane Guillou, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Pavel Pecina, Martin Popel, Philipp Koehn, Christof Monz, Matteo Negri, Matt Post, Lucia Specia, Karin Verspoor, J{\"o}rg Tiedemann, Marco Turchi

Machine Translation Translation

Paper
Add Code

The QT21/HimL Combined Machine Translation System

no code implementations • WS 2016 • Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alex Fraser, er, Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar, Barry Haddow, Rico Sennrich, Fr{\'e}d{\'e}ric Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alex Allauzen, re, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, Fran{\c{c}}ois Yvon, M{\=a}rcis Pinnis, Stella Frank

Ranked #12 on Machine Translation on WMT2016 English-Romanian

Machine Translation Translation

Paper
Add Code

Results of the WMT16 Metrics Shared Task

no code implementations • WS 2016 • Ond{\v{r}}ej Bojar, Yvette Graham, Amir Kamran, Milo{\v{s}} Stanojevi{\'c}

Machine Translation

Paper
Add Code

CUNI in WMT15: Chimera Strikes Again

no code implementations • WS 2015 • Ond{\v{r}}ej Bojar, Ale{\v{s}} Tamchyna

Machine Translation

Paper
Add Code

Results of the WMT15 Tuning Shared Task

no code implementations • WS 2015 • Milo{\v{s}} Stanojevi{\'c}, Amir Kamran, Ond{\v{r}}ej Bojar

Language Modelling Machine Translation

Paper
Add Code

Findings of the 2015 Workshop on Statistical Machine Translation

no code implementations • WS 2015 • Ond{\v{r}}ej Bojar, Rajen Chatterjee, Christian Federmann, Barry Haddow, Matthias Huck, Chris Hokamp, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Matt Post, Carolina Scarton, Lucia Specia, Marco Turchi

Automatic Post-Editing Translation

Paper
Add Code

Results of the WMT15 Metrics Shared Task

no code implementations • WS 2015 • Milo{\v{s}} Stanojevi{\'c}, Amir Kamran, Philipp Koehn, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

What a Transfer-Based System Brings to the Combination with PBMT

no code implementations • WS 2015 • Ale{\v{s}} Tamchyna, Ond{\v{r}}ej Bojar

Paper
Add Code

TeamUFAL: WSD+EL as Document Retrieval

no code implementations • SEMEVAL 2015 • Petr Fanta, Roman Sudarikov, Ond{\v{r}}ej Bojar

Entity Linking Information Retrieval +4

Paper
Add Code

English to Urdu Statistical Machine Translation: Establishing a Baseline

no code implementations • WS 2014 • Bushra Jawaid, Amir Kamran, Ond{\v{r}}ej Bojar

Machine Translation Translation

Paper
Add Code

Comparing Czech and English AMRs

no code implementations • WS 2014 • Jan Haji{\v{c}}, Ond{\v{r}}ej Bojar, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}

Paper
Add Code

Results of the WMT14 Metrics Shared Task

no code implementations • WS 2014 • Matou{\v{s}} Mach{\'a}{\v{c}}ek, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

CUNI in WMT14: Chimera Still Awaits Bellerophon

no code implementations • WS 2014 • Ale{\v{s}} Tamchyna, Martin Popel, Rudolf Rosa, Ond{\v{r}}ej Bojar

Automatic Post-Editing Language Modelling

Paper
Add Code

Two-Step Machine Translation with Lattices

no code implementations • LREC 2014 • Bushra Jawaid, Ond{\v{r}}ej Bojar

The idea of two-step machine translation was introduced to divide the complexity of the search space into two independent steps: (1) lexical translation and reordering, and (2) conjugation and declination in the target language.

Language Modelling Lemmatization +3

Paper
Add Code

Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech

no code implementations • LREC 2014 • Nianwen Xue, Ond{\v{r}}ej Bojar, Jan Haji{\v{c}}, Martha Palmer, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Xiuhong Zhang

Abstract Meaning Representations (AMRs) are rooted, directional and labeled graphs that abstract away from morpho-syntactic idiosyncrasies such as word category (verbs and nouns), word order, and function words (determiners, some prepositions).

Machine Translation Semantic Parsing +2

Paper
Add Code

HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation

no code implementations • LREC 2014 • Ond{\v{r}}ej Bojar, Vojt{\v{e}}ch Diatka, Pavel Rychl{\'y}, Pavel Stra{\v{n}}{\'a}k, V{\'\i}t Suchomel, Ale{\v{s}} Tamchyna, Daniel Zeman

HindEnCorp consists of 274k parallel sentences (3. 9 million Hindi and 3. 8 million English tokens).

Machine Translation Translation

Paper
Add Code

A Tagged Corpus and a Tagger for Urdu

no code implementations • LREC 2014 • Bushra Jawaid, Amir Kamran, Ond{\v{r}}ej Bojar

In this paper, we describe a release of a sizeable monolingual Urdu corpus automatically tagged with part-of-speech tags.

Paper
Add Code

Results of the WMT13 Metrics Shared Task

no code implementations • WS 2013 • Matou{\v{s}} Mach{\'a}{\v{c}}ek, Ond{\v{r}}ej Bojar

Paper
Add Code

PhraseFix: Statistical Post-Editing of TectoMT

no code implementations • WS 2013 • Petra Galu{\v{s}}{\v{c}}{\'a}kov{\'a}, Martin Popel, Ond{\v{r}}ej Bojar

Machine Translation

Paper
Add Code

Findings of the 2013 Workshop on Statistical Machine Translation

no code implementations • WS 2013 • Ond{\v{r}}ej Bojar, Christian Buck, Chris Callison-Burch, Christian Federmann, Barry Haddow, Philipp Koehn, Christof Monz, Matt Post, Radu Soricut, Lucia Specia

Machine Translation Translation

Paper
Add Code

Chimera -- Three Heads for English-to-Czech Translation

no code implementations • WS 2013 • Ond{\v{r}}ej Bojar, Rudolf Rosa, Ale{\v{s}} Tamchyna

Lemmatization Machine Translation +1

Paper
Add Code

Morphological Processing for English-Tamil Statistical Machine Translation

no code implementations • WS 2012 • Loganathan Ramasamy, Ond{\v{r}}ej Bojar, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

Machine Translation Translation

Paper
Add Code

Tagger Voting for Urdu

no code implementations • WS 2012 • Bushra Jawaid, Ond{\v{r}}ej Bojar

Part-Of-Speech Tagging

Paper
Add Code

Towards a Predicate-Argument Evaluation for MT

no code implementations • WS 2012 • Ond{\v{r}}ej Bojar, Dekai Wu

Machine Translation

Paper
Add Code

Selecting Data for English-to-Czech Machine Translation

no code implementations • WS 2012 • Ale{\v{s}} Tamchyna, Petra Galu{\v{s}}{\v{c}}{\'a}kov{\'a}, Amir Kamran, Milo{\v{s}} Stanojevi{\'c}, Ond{\v{r}}ej Bojar

Domain Adaptation Language Modelling +2

Paper
Add Code

TerrorCat: a Translation Error Categorization-based MT Quality Metric

no code implementations • WS 2012 • Mark Fishel, Rico Sennrich, Maja Popovi{\'c}, Ond{\v{r}}ej Bojar

Machine Translation Translation

Paper
Add Code

Probes in a Taxonomy of Factored Phrase-Based Models

no code implementations • WS 2012 • Ond{\v{r}}ej Bojar, Bushra Jawaid, Amir Kamran

Machine Translation

Paper
Add Code

Terra: a Collection of Translation Error-Annotated Corpora

no code implementations • LREC 2012 • Mark Fishel, Ond{\v{r}}ej Bojar, Maja Popovi{\'c}

Recently the first methods of automatic diagnostics of machine translation have emerged; since this area of research is relatively young, the efforts are not coordinated.

Machine Translation Translation

Paper
Add Code

Automatic MT Error Analysis: Hjerson Helping Addicter

no code implementations • LREC 2012 • Jan Berka, Ond{\v{r}}ej Bojar, Mark Fishel, Maja Popovi{\'c}, Daniel Zeman

We present a complex, open source tool for detailed machine translation error analysis providing the user with automatic error detection and classification, several monolingual alignment algorithms as well as with training and test corpus browsing.

General Classification Machine Translation +1

Paper
Add Code

Announcing Prague Czech-English Dependency Treebank 2.0

no code implementations • LREC 2012 • Jan Haji{\v{c}}, Eva Haji{\v{c}}ov{\'a}, Jarmila Panevov{\'a}, Petr Sgall, Ond{\v{r}}ej Bojar, Silvie Cinkov{\'a}, Eva Fu{\v{c}}{\'\i}kov{\'a}, Marie Mikulov{\'a}, Petr Pajas, Jan Popelka, Ji{\v{r}}{\'\i} Semeck{\'y}, Jana {\v{S}}indlerov{\'a}, Jan {\v{S}}t{\v{e}}p{\'a}nek, Josef Toman, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

We introduce a substantial update of the Prague Czech-English Dependency Treebank, a parallel corpus manually annotated at the deep syntactic layer of linguistic representation.

Coreference Resolution Sentence

Paper
Add Code

The Joy of Parallelism with CzEng 1.0

no code implementations • LREC 2012 • Ond{\v{r}}ej Bojar, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}, Ond{\v{r}}ej Du{\v{s}}ek, Petra Galu{\v{s}}{\v{c}}{\'a}kov{\'a}, Martin Majli{\v{s}}, David Mare{\v{c}}ek, Ji{\v{r}}{\'\i} Mar{\v{s}}{\'\i}k, Michal Nov{\'a}k, Martin Popel, Ale{\v{s}} Tamchyna

CzEng 1. 0 is automatically aligned at the level of sentences as well as words.

Machine Translation Sentence

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.