Search Results for author: Tania Avgustinova

Found 11 papers, 2 papers with code

CLIMB grammars: three projects using metagrammar engineering

no code implementations LREC 2012 Antske Fokkens, Tania Avgustinova, Yi Zhang

This paper introduces the CLIMB (Comparative Libraries of Implementations with Matrix Basis) methodology and grammars.

Code Generation

Machine Translation from an Intercomprehension Perspective

no code implementations WS 2019 Yu Chen, Tania Avgustinova

Within the first shared task on machine translation between similar languages, we present our first attempts on Czech to Polish machine translation from an intercomprehension perspective.

Machine Translation Translation

The INCOMSLAV Platform: Experimental Website with Integrated Methods for Measuring Linguistic Distances and Asymmetries in Receptive Multilingualism

no code implementations LREC 2020 Irina Stenger, Klara Jagrova, Tania Avgustinova

We report on a web-based resource for conducting intercomprehension experiments with native speakers of Slavic languages and present our methods for measuring linguistic distances and asymmetries in receptive multilingualism.

Language Modelling

Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages

1 code implementation2 Aug 2020 Badr M. Abdullah, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language.

Language Identification Spoken language identification +1

Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification

no code implementations VarDial (COLING) 2020 Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

In this paper, we present a neural model for Slavic language identification in speech signals and analyze its emergent representations to investigate whether they reflect objective measures of language relatedness and/or non-linguists' perception of language similarity.

Language Identification Spoken language identification

Are Language-Agnostic Sentence Representations Actually Language-Agnostic?

no code implementations RANLP 2021 Yu Chen, Tania Avgustinova

With the emergence of pre-trained multilingual models, multilingual embeddings have been widely applied in various natural language processing tasks.

Sentence Sentence Embeddings +1

incom.py 2.0 - Calculating Linguistic Distances and Asymmetries in Auditory Perception of Closely Related Languages

no code implementations RANLP 2021 Marius Mosbach, Irina Stenger, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

We present an extended version of a tool developed for calculating linguistic distances and asymmetries in auditory perception of closely related languages.

regression

Modeling the Impact of Syntactic Distance and Surprisal on Cross-Slavic Text Comprehension

no code implementations LREC 2022 Irina Stenger, Philip Georgis, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

We focus on the syntactic variation and measure syntactic distances between nine Slavic languages (Belarusian, Bulgarian, Croatian, Czech, Polish, Slovak, Slovene, Russian, and Ukrainian) using symmetric measures of insertion, deletion and movement of syntactic units in the parallel sentences of the fable “The North Wind and the Sun”.

Cloze Test Reading Comprehension

Cannot find the paper you are looking for? You can Submit a new open access paper.