Search Results for author: Tania Avgustinova

Found 11 papers, 2 papers with code

CLIMB grammars: three projects using metagrammar engineering

no code implementations • LREC 2012 • Antske Fokkens, Tania Avgustinova, Yi Zhang

This paper introduces the CLIMB (Comparative Libraries of Implementations with Matrix Basis) methodology and grammars.

Paper
Add Code

Orthographic and Morphological Correspondences between Related Slavic Languages as a Base for Modeling of Mutual Intelligibility

no code implementations • LREC 2016 • Andrea Fischer, Kl{\'a}ra J{\'a}grov{\'a}, Irina Stenger, Tania Avgustinova, Dietrich Klakow, Rol Marti,

In an intercomprehension scenario, typically a native speaker of language L1 is confronted with output from an unknown, but related language L2.

Paper
Add Code

Machine Translation from an Intercomprehension Perspective

no code implementations • WS 2019 • Yu Chen, Tania Avgustinova

Within the first shared task on machine translation between similar languages, we present our first attempts on Czech to Polish machine translation from an intercomprehension perspective.

Machine Translation Translation

Paper
Add Code

incom.py - A Toolbox for Calculating Linguistic Distances and Asymmetries between Related Languages

no code implementations • RANLP 2019 • Marius Mosbach, Irina Stenger, Tania Avgustinova, Dietrich Klakow

Languages may be differently distant from each other and their mutual intelligibility may be asymmetric.

Paper
Add Code

The INCOMSLAV Platform: Experimental Website with Integrated Methods for Measuring Linguistic Distances and Asymmetries in Receptive Multilingualism

no code implementations • LREC 2020 • Irina Stenger, Klara Jagrova, Tania Avgustinova

We report on a web-based resource for conducting intercomprehension experiments with native speakers of Slavic languages and present our methods for measuring linguistic distances and asymmetries in receptive multilingualism.

Language Modelling

Paper
Add Code

Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages

1 code implementation • 2 Aug 2020 • Badr M. Abdullah, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language.

Language Identification Spoken language identification +1

Paper
Code

Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification

no code implementations • VarDial (COLING) 2020 • Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

In this paper, we present a neural model for Slavic language identification in speech signals and analyze its emergent representations to investigate whether they reflect objective measures of language relatedness and/or non-linguists' perception of language similarity.

Language Identification Spoken language identification

Paper
Add Code

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

1 code implementation • EMNLP (BlackboxNLP) 2021 • Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

We further discuss the implications of our work on modeling speech processing and language similarity with neural networks.

Experimental Design Word Embeddings

Paper
Code

Are Language-Agnostic Sentence Representations Actually Language-Agnostic?

no code implementations • RANLP 2021 • Yu Chen, Tania Avgustinova

With the emergence of pre-trained multilingual models, multilingual embeddings have been widely applied in various natural language processing tasks.

Sentence Sentence Embeddings +1

Paper
Add Code

incom.py 2.0 - Calculating Linguistic Distances and Asymmetries in Auditory Perception of Closely Related Languages

no code implementations • RANLP 2021 • Marius Mosbach, Irina Stenger, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

We present an extended version of a tool developed for calculating linguistic distances and asymmetries in auditory perception of closely related languages.

regression

Paper
Add Code

Modeling the Impact of Syntactic Distance and Surprisal on Cross-Slavic Text Comprehension

no code implementations • LREC 2022 • Irina Stenger, Philip Georgis, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

We focus on the syntactic variation and measure syntactic distances between nine Slavic languages (Belarusian, Bulgarian, Croatian, Czech, Polish, Slovak, Slovene, Russian, and Ukrainian) using symmetric measures of insertion, deletion and movement of syntactic units in the parallel sentences of the fable “The North Wind and the Sun”.

Cloze Test Reading Comprehension

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.