Search Results for author: Rudolf Rosa

Found 40 papers, 6 papers with code

How Language-Neutral is Multilingual BERT?

1 code implementation • 8 Nov 2019 • Jindřich Libovický, Rudolf Rosa, Alexander Fraser

Multilingual BERT (mBERT) provides sentence representations for 104 languages, which are useful for many multi-lingual tasks.

Retrieval Sentence +2

Paper
Code

On the Language Neutrality of Pre-trained Multilingual Representations

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Jindřich Libovický, Rudolf Rosa, Alexander Fraser

Multilingual contextual embeddings, such as multilingual BERT and XLM-RoBERTa, have proved useful for many multi-lingual tasks.

Language Identification Transfer Learning +1

Paper
Code

Unsupervised Lemmatization as Embeddings-Based Word Clustering

1 code implementation • 22 Aug 2019 • Rudolf Rosa, Zdeněk Žabokrtský

We focus on the task of unsupervised lemmatization, i. e. grouping together inflected forms of one word under one label (a lemma) without the use of annotated training data.

Clustering LEMMA +1

Paper
Code

Predicting Typological Features in WALS using Language Embeddings and Conditional Probabilities: ÚFAL Submission to the SIGTYP 2020 Shared Task

1 code implementation • EMNLP (SIGTYP) 2020 • Martin Vastl, Daniel Zeman, Rudolf Rosa

We present our submission to the SIGTYP 2020 Shared Task on the prediction of typological features.

Paper
Code

Parsing Natural Language Sentences by Semi-supervised Methods

no code implementations • 16 Jun 2015 • Rudolf Rosa

We present our work on semi-supervised parsing of natural language sentences, focusing on multi-source crosslingual transfer of delexicalized dependency parsers.

Paper
Add Code

CUNI x-ling: Parsing Under-Resourced Languages in CoNLL 2018 UD Shared Task

no code implementations • CONLL 2018 • Rudolf Rosa, David Mare{\v{c}}ek

This is a system description paper for the CUNI x-ling submission to the CoNLL 2018 UD Shared Task.

Translation

Paper
Add Code

Extracting Syntactic Trees from Transformer Encoder Self-Attentions

no code implementations • WS 2018 • David Mare{\v{c}}ek, Rudolf Rosa

This is a work in progress about extracting the sentence tree structures from the encoder{'}s self-attention weights, when translating into another language using the Transformer neural network architecture.

Machine Translation Sentence

Paper
Add Code

Slavic Forest, Norwegian Wood

no code implementations • WS 2017 • Rudolf Rosa, Daniel Zeman, David Mare{\v{c}}ek, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

We once had a corp, or should we say, it once had us They showed us its tags, isn{'}t it great, unified tags They asked us to parse and they told us to use everything So we looked around and we noticed there was near nothing We took other langs, bitext aligned: words one-to-one We played for two weeks, and then they said, here is the test The parser kept training till morning, just until deadline So we had to wait and hope what we get would be just fine And, when we awoke, the results were done, we saw we{'}d won So, we wrote this paper, isn{'}t it good, Norwegian wood.

Dependency Parsing Machine Translation +1

Paper
Add Code

Findings of the WMT 2017 Biomedical Translation Shared Task

no code implementations • WS 2017 • Antonio Jimeno Yepes, Aur{\'e}lie N{\'e}v{\'e}ol, Mariana Neves, Karin Verspoor, Ond{\v{r}}ej Bojar, Arthur Boyer, Cristian Grozea, Barry Haddow, Madeleine Kittner, Yvonne Lichtblau, Pavel Pecina, Rol Roller, , Rudolf Rosa, Amy Siu, Philippe Thomas, Saskia Trescher

Machine Translation Translation

Paper
Add Code

CUNI Experiments for WMT17 Metrics Task

no code implementations • WS 2017 • David Mare{\v{c}}ek, Ond{\v{r}}ej Bojar, Ond{\v{r}}ej H{\"u}bsch, Rudolf Rosa, Du{\v{s}}an Vari{\v{s}}

Dependency Parsing Machine Translation +1

Paper
Add Code

Error Analysis of Cross-lingual Tagging and Parsing

1 code implementation • WS 2017 • Rudolf Rosa, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

Machine Translation

Paper
Code

Dictionary-based Domain Adaptation of MT Systems without Retraining

no code implementations • WS 2016 • Rudolf Rosa, Roman Sudarikov, Michal Nov{\'a}k, Martin Popel, Ond{\v{r}}ej Bojar

Domain Adaptation Machine Translation

Paper
Add Code

Moses \& Treex Hybrid MT Systems Bestiary

no code implementations • WS 2016 • Rudolf Rosa, Martin Popel, Ond{\v{r}}ej Bojar, David Mare{\v{c}}ek, Ond{\v{r}}ej Du{\v{s}}ek

Language Modelling Machine Translation +1

Paper
Add Code

KLcpos3 - a Language Similarity Measure for Delexicalized Parser Transfer

no code implementations • IJCNLP 2015 • Rudolf Rosa, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

Information Retrieval

Paper
Add Code

Deepfix: Statistical Post-editing of Statistical Machine Translation Using Deep Syntactic Analysis

no code implementations • ACL 2013 • Rudolf Rosa, David Mare{\v{c}}ek, Ale{\v{s}} Tamchyna

Automatic Post-Editing Translation

Paper
Add Code

Targeted Paraphrasing on Deep Syntactic Layer for MT Evaluation

no code implementations • WS 2015 • Petra Baran{\v{c}}{\'\i}kov{\'a}, Rudolf Rosa

Machine Translation

Paper
Add Code

Multi-source Cross-lingual Delexicalized Parser Transfer: Prague or Stanford?

no code implementations • WS 2015 • Rudolf Rosa

Paper
Add Code

MSTParser Model Interpolation for Multi-Source Delexicalized Transfer

no code implementations • WS 2015 • Rudolf Rosa, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

Information Retrieval

Paper
Add Code

New Language Pairs in TectoMT

no code implementations • WS 2015 • Ond{\v{r}}ej Du{\v{s}}ek, Lu{\'\i}s Gomes, Michal Nov{\'a}k, Martin Popel, Rudolf Rosa

Machine Translation

Paper
Add Code

CUNI in WMT14: Chimera Still Awaits Bellerophon

no code implementations • WS 2014 • Ale{\v{s}} Tamchyna, Martin Popel, Rudolf Rosa, Ond{\v{r}}ej Bojar

Automatic Post-Editing Language Modelling

Paper
Add Code

Machine Translation of Medical Texts in the Khresmoi Project

no code implementations • WS 2014 • Ond{\v{r}}ej Du{\v{s}}ek, Jan Haji{\v{c}}, Jaroslava Hlav{\'a}{\v{c}}ov{\'a}, Michal Nov{\'a}k, Pavel Pecina, Rudolf Rosa, Ale{\v{s}} Tamchyna, Zde{\v{n}}ka Ure{\v{s}}ov{\'a}, Daniel Zeman

Domain Adaptation Image Retrieval +3

Paper
Add Code

Chimera -- Three Heads for English-to-Czech Translation

no code implementations • WS 2013 • Ond{\v{r}}ej Bojar, Rudolf Rosa, Ale{\v{s}} Tamchyna

Lemmatization Machine Translation +1

Paper
Add Code

DEPFIX: A System for Automatic Correction of Czech MT Outputs

no code implementations • WS 2012 • Rudolf Rosa, David Mare{\v{c}}ek, Ond{\v{r}}ej Du{\v{s}}ek

Machine Translation Word Alignment

Paper
Add Code

Using Parallel Features in Parsing of Machine-Translated Sentences for Correction of Grammatical Errors

no code implementations • WS 2012 • Rudolf Rosa, Ond{\v{r}}ej Du{\v{s}}ek, David Mare{\v{c}}ek, Martin Popel

Machine Translation Morphological Inflection

Paper
Add Code

HamleDT 2.0: Thirty Dependency Treebanks Stanfordized

no code implementations • LREC 2014 • Rudolf Rosa, Jan Ma{\v{s}}ek, David Mare{\v{c}}ek, Martin Popel, Daniel Zeman, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

We present HamleDT 2. 0 (HArmonized Multi-LanguagE Dependency Treebank).

Paper
Add Code

Improving Evaluation of English-Czech MT through Paraphrasing

no code implementations • LREC 2014 • Petra Baran{\v{c}}{\'\i}kov{\'a}, Rudolf Rosa, Ale{\v{s}} Tamchyna

Grammatical correctness of the new reference sentence is provided by applying Depfix on newly created paraphrases.

Automatic Post-Editing Sentence +2

Paper
Add Code

From Balustrades to Pierre Vinken: Looking for Syntax in Transformer Self-Attentions

no code implementations • WS 2019 • David Mareček, Rudolf Rosa

We inspect the multi-head self-attention in Transformer NMT encoders for three source languages, looking for patterns that could have a syntactic interpretation.

NMT Position

Paper
Add Code

Translation Model Interpolation for Domain Adaptation in TectoMT

no code implementations • WS 2015 • Rudolf Rosa, Ond{\v{r}}ej Du{\v{s}}ek, Michal Nov{\'a}k, Martin Popel

Domain Adaptation Translation

Paper
Add Code

Inducing Syntactic Trees from BERT Representations

no code implementations • 27 Jun 2019 • Rudolf Rosa, David Mareček

We use the English model of BERT and explore how a deletion of one word in a sentence changes representations of other words.

Language Modelling Sentence

Paper
Add Code

Attempting to separate inflection and derivation using vector space representations

no code implementations • WS 2019 • Rudolf Rosa, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}

Paper
Add Code

Universal Dependencies according to BERT: both more specific and more general

2 code implementations • Findings of the Association for Computational Linguistics 2020 • Tomasz Limisiewicz, Rudolf Rosa, David Mareček

This work focuses on analyzing the form and extent of syntactic abstraction captured by BERT by extracting labeled dependency trees from self-attentions.

Relation

Paper
Code

THEaiTRE: Artificial Intelligence to Write a Theatre Play

no code implementations • 25 Jun 2020 • Rudolf Rosa, Ondřej Dušek, Tom Kocmi, David Mareček, Tomáš Musil, Patrícia Schmidtová, Dominik Jurko, Ondřej Bojar, Daniel Hrbek, David Košťák, Martina Kinská, Josef Doležal, Klára Vosecká

We present THEaiTRE, a starting project aimed at automatic generation of theatre play scripts.

Machine Translation Translation

Paper
Add Code

Measuring Memorization Effect in Word-Level Neural Networks Probing

no code implementations • 29 Jun 2020 • Rudolf Rosa, Tomáš Musil, David Mareček

In classical probing, a classifier is trained on the representations to extract the target linguistic information.

Machine Translation Memorization +1

Paper
Add Code

THEaiTRE 1.0: Interactive generation of theatre play scripts

no code implementations • 17 Feb 2021 • Rudolf Rosa, Tomáš Musil, Ondřej Dušek, Dominik Jurko, Patrícia Schmidtová, David Mareček, Ondřej Bojar, Tom Kocmi, Daniel Hrbek, David Košťák, Martina Kinská, Marie Nováková, Josef Doležal, Klára Vosecká, Tomáš Studeník, Petr Žabka

We present the first version of a system for interactive generation of theatre play scripts.

Paper
Add Code

Eyes on the Parse: Using Gaze Features in Syntactic Parsing

no code implementations • LANTERN (COLING) 2020 • Abhishek Agrawal, Rudolf Rosa

We also augment a graph-based parser with eye-tracking features and parse the Dundee Corpus to corroborate our findings from the sequence labelling parser.

Dependency Parsing

Paper
Add Code

TectoMT – a deep linguistic core of the combined Cimera MT system

no code implementations • EAMT 2016 • Martin Popel, Roman Sudarikov, Ondřej Bojar, Rudolf Rosa, Jan Hajič

Paper
Add Code

DialogueScript: Using Dialogue Agents to Produce a Script

no code implementations • 16 Jun 2022 • Patrícia Schmidtová, Dávid Javorský, Christián Mikláš, Tomáš Musil, Rudolf Rosa, Ondřej Dušek

We present a novel approach to generating scripts by using agents with different personality types.

Natural Language Inference

Paper
Add Code

GPT-2-based Human-in-the-loop Theatre Play Script Generation

no code implementations • NAACL (WNU) 2022 • Rudolf Rosa, Patrícia Schmidtová, Ondřej Dušek, Tomáš Musil, David Mareček, Saad Obaid, Marie Nováková, Klára Vosecká, Josef Doležal

We experiment with adapting generative language models for the generation of long coherent narratives in the form of theatre plays.

Language Modelling Text Generation

Paper
Add Code

TEAM UFAL @ CreativeSumm 2022: BART and SamSum based few-shot approach for creative Summarization

no code implementations • COLING (CreativeSumm) 2022 • Rishu Kumar, Rudolf Rosa

This system description paper details TEAM UFAL’s approach for the SummScreen, TVMegasite subtask of the CreativeSumm shared task.

Few-Shot Learning

Paper
Add Code

OOVs in the Spotlight: How to Inflect them?

no code implementations • 13 Apr 2024 • Tomáš Sourada, Jana Straková, Rudolf Rosa

For testing in OOV conditions, we automatically extracted a large dataset of nouns in the morphologically rich Czech language, with lemma-disjoint data splits, and we further manually annotated a real-world OOV dataset of neologisms.

LEMMA Morphological Inflection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.