Search Results for author: Martin Jansche

Found 10 papers, 3 papers with code

Towards Induction of Structured Phoneme Inventories

no code implementations12 Oct 2020 Alexander Gutkin, Martin Jansche, Lucy Skidmore

This extended abstract surveying the work on phonological typology was prepared for "SIGTYP 2020: The Second Workshop on Computational Research in Linguistic Typology" to be held at EMNLP 2020.

Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech

no code implementations LREC 2020 Yin May Oo, Theeraphol Wattanavekin, Chenfang Li, Pasindu De Silva, Supheakmungkol Sarin, Knot Pipatsrisawat, Martin Jansche, Oddur Kjartansson, Alex Gutkin, er

This paper introduces an open-source crowd-sourced multi-speaker speech corpus along with the comprehensive set of finite-state transducer (FST) grammars for performing text normalization for the Burmese (Myanmar) language.

Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems

no code implementations LREC 2020 Fei He, Shan-Hui Cathy Chu, Oddur Kjartansson, Clara Rivera, Anna Katanova, Alex Gutkin, er, Isin Demirsahin, Cibu Johny, Martin Jansche, Supheakmungkol Sarin, Knot Pipatsrisawat

We present free high quality multi-speaker speech corpora for Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu, which are six of the twenty two official languages of India spoken by 374 million native speakers.

Speech Synthesis

Linguistic Typology Features from Text: Inferring the Sparse Features of World Atlas of Language Structures

no code implementations30 Apr 2020 Alexander Gutkin, Tatiana Merkulova, Martin Jansche

In this paper we investigate whether the various linguistic features from World Atlas of Language Structures (WALS) can be reliably inferred from multi-lingual text.

Multi-Label Classification

Sampling from Stochastic Finite Automata with Applications to CTC Decoding

2 code implementations21 May 2019 Martin Jansche, Alexander Gutkin

We consider the problem of efficient sampling: drawing random string variates from the probability distribution represented by stochastic automata and transformations of those.

TTS for Low Resource Languages: A Bangla Synthesizer

no code implementations LREC 2016 Alex Gutkin, er, Linne Ha, Martin Jansche, Knot Pipatsrisawat, Richard Sproat

We present a text-to-speech (TTS) system designed for the dialect of Bengali spoken in Bangladesh.

Cannot find the paper you are looking for? You can Submit a new open access paper.