Search Results for author: Tomáš Musil

Found 11 papers, 2 papers with code

A Test Suite and Manual Evaluation of Document-Level NMT at WMT19

1 code implementation8 Aug 2019 Kateřina Rysová, Magdaléna Rysová, Tomáš Musil, Lucie Poláková, Ondřej Bojar

As the quality of machine translation rises and neural machine translation (NMT) is moving from sentence to document level translations, it is becoming increasingly difficult to evaluate the output of translation systems.

Machine Translation NMT +2

Debiasing Algorithm through Model Adaptation

1 code implementation29 Oct 2023 Tomasz Limisiewicz, David Mareček, Tomáš Musil

Large language models are becoming the go-to solution for the ever-growing number of tasks.

Examining Structure of Word Embeddings with PCA

no code implementations31 May 2019 Tomáš Musil

We also show that further examining histograms of classes along the principal component is important to understand the structure of representation of information in embeddings.

Machine Translation NMT +4

Derivational Morphological Relations in Word Embeddings

no code implementations6 Jun 2019 Tomáš Musil, Jonáš Vidra, David Mareček

Derivation is a type of a word-formation process which creates new words from existing ones by adding, changing or deleting affixes.

Clustering Word Embeddings

Semantic Holism and Word Representations in Artificial Neural Networks

no code implementations11 Mar 2020 Tomáš Musil

Taking Tugendhat's formal reinterpretation of Frege's work as a starting point, we demonstrate that it is analogical to the process of training the Skip-gram model and offers a possible explanation of its semantic properties.

Measuring Memorization Effect in Word-Level Neural Networks Probing

no code implementations29 Jun 2020 Rudolf Rosa, Tomáš Musil, David Mareček

In classical probing, a classifier is trained on the representations to extract the target linguistic information.

Machine Translation Memorization +1

Independent Components of Word Embeddings Represent Semantic Features

no code implementations19 Dec 2022 Tomáš Musil, David Mareček

Independent Component Analysis (ICA) is an algorithm originally developed for finding separate sources in a mixed signal, such as a recording of multiple people in the same room speaking at the same time.

Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.