Resynthesis

16 papers with code • 2 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Resynthesis

Trend	Dataset	Best Model	Paper	Code	Compare
	LJSpeech	CPC			See all
	LibriSpeech	CPC			See all

Datasets

Most implemented papers

Most implemented Social Latest No code

textless-lib: a Library for Textless Spoken Language Processing

facebookresearch/textlesslib • • NAACL (ACL) 2022

Textless spoken language processing research aims to extend the applicability of standard NLP toolset onto spoken language and languages with few or no textual resources.

Paper
Code

A Perceptual Measure for Evaluating the Resynthesis of Automatic Music Transcriptions

limunimi/perceptualevaluation • 24 Feb 2022

This study focuses on the perception of music performances when contextual factors, such as room acoustics and instrument, change.

Paper
Code

Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling

slp-rl/slm-discrete-representations • • 2 Jan 2023

Following the findings of such an analysis, we propose practical improvements to the discrete unit for the GSLM.

Paper
Code

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

articulatory/articulatory • • 14 Feb 2023

To build speech processing methods that can handle speech as naturally as humans, researchers have explored multiple ways of building an invertible mapping from speech to an interpretable space.

Paper
Code

Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

npucvr/wscuod • • 7 Jul 2023

Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of objects from the background within a scene without relying on labeled datasets, which benefits the task of bounding-box-level localization and pixel-level segmentation.

Paper
Code

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models

facebookresearch/emphassess • • 21 Dec 2023

We introduce EmphAssess, a prosodic benchmark designed to evaluate the capability of speech-to-speech models to encode and reproduce prosodic emphasis.

Paper
Code

Resynthesis

Benchmarks Add a Result

Datasets

Most implemented papers

textless-lib: a Library for Textless Spoken Language Processing

A Perceptual Measure for Evaluating the Resynthesis of Automatic Music Transcriptions

Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

Weakly-supervised Contrastive Learning for Unsupervised Object Discovery

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models

Content

Benchmarks

Add a Result