Acoustic Unit Discovery

9 papers with code • 1 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Latest papers with no code

Revisiting speech segmentation and lexicon learning with better features

no code yet • 31 Jan 2024

We revisit a self-supervised method that segments unlabelled speech into word-like segments.

LUPET: Incorporating Hierarchical Information Path into Multilingual ASR

no code yet • 8 Jan 2024

Many factors have separately shown their effectiveness on improving multilingual ASR.

Regularizing Contrastive Predictive Coding for Speech Applications

no code yet • 12 Apr 2023

These representations significantly reduce the amount of labeled data needed for downstream task performance, such as automatic speech recognition.

Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge

no code yet • 27 Oct 2022

Recent progress in self-supervised or unsupervised machine learning has opened the possibility of building a full speech processing system from raw audio without using any textual representations or expert labels such as phonemes, dictionaries or parse trees.

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

no code yet • 29 Jul 2022

GAN training is adopted in the first stage to find the mapping relationship between unpaired speech and phone sequence.

A Temporal Extension of Latent Dirichlet Allocation for Unsupervised Acoustic Unit Discovery

no code yet • 23 Jun 2022

In this paper, we present an extension to LDA that uses a Markov chain to model temporal information.

Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery

no code yet • 4 May 2021

Discovering speaker independent acoustic units purely from spoken input is known to be a hard problem.

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

no code yet • 4 Nov 2020

In the target language, we infer both the language and unit embeddings in an unsupervised manner, and in so doing, we simultaneously learn a subspace of units specific to that language and the units that dwell on it.

Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks

no code yet • 16 Apr 2019

For our submission to the ZeroSpeech 2019 challenge, we apply discrete latent-variable neural networks to unlabelled speech and use the discovered units for speech synthesis.

Unsupervised Word Segmentation from Speech with Attention

no code yet • 18 Jun 2018

We present a first attempt to perform attentional word segmentation directly from the speech signal, with the final goal to automatically identify lexical units in a low-resource, unwritten language (UL).