Speech

Unsupervised Speech Recognition

6 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Unsupervised Speech Recognition

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Unsupervised Speech Recognition models and implementations

pytorch/fairseq

2 papers

29,233

Latest papers with no code

Most implemented Social Latest No code

Enhancing Unsupervised Speech Recognition with Diffusion GANs

no code yet • 23 Mar 2023

We enhance the vanilla adversarial training method for unsupervised Automatic Speech Recognition (ASR) by a diffusion-GAN.

Paper
Add Code

Simple and Effective Unsupervised Speech Translation

no code yet • 18 Oct 2022

The amount of labeled data to train models for speech tasks is limited for most languages, however, the data scarcity is exacerbated for speech translation which requires labeled data covering two different languages.

Paper
Add Code

Simple and Effective Unsupervised Speech Synthesis

no code yet • 6 Apr 2022

We introduce the first unsupervised speech synthesis system based on a simple, yet effective recipe.

Paper
Add Code

Analyzing the Robustness of Unsupervised Speech Recognition

no code yet • 7 Oct 2021

In this work, we further analyze the training robustness of unsupervised ASR on the domain mismatch scenarios in which the domains of unpaired speech and text are different.

Paper
Add Code

Dynamic Gradient Aggregation for Federated Domain Adaptation

no code yet • 14 Jun 2021

The proposed scheme is based on a weighted gradient aggregation using two-step optimization to offer a flexible training pipeline.

Paper
Add Code

Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning

no code yet • 28 Oct 2019

In this paper we propose a Sequential Representation Quantization AutoEncoder (SeqRQ-AE) to learn from primarily unpaired audio data and produce sequences of representations very close to phoneme sequences of speech utterances.

Paper
Add Code

From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings

no code yet • 10 Apr 2019

However, we note human babies start to learn the language by the sounds (or phonetic structures) of a small number of exemplar words, and "generalize" such knowledge to other words without hearing a large amount of data.

Paper
Add Code

Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models

no code yet • 8 Apr 2019

Producing a large annotated speech corpus for training ASR systems remains difficult for more than 95% of languages all over the world which are low-resourced, but collecting a relatively big unlabeled data set for such languages is more achievable.

Paper
Add Code

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching

no code yet • ICLR 2019

We consider the problem of training speech recognition systems without using any labeled data, under the assumption that the learner can only access to the input utterances and a phoneme language model estimated from a non-overlapping corpus.

Paper
Add Code

Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data

no code yet • 30 Oct 2018

This can be learned by aligning a small number of spoken words and the corresponding text words in the embedding spaces.

Paper
Add Code

Unsupervised Speech Recognition

Benchmarks Add a Result

Libraries

Latest papers with no code

Content

Benchmarks

Add a Result