Audio Super-Resolution

14 papers with code • 4 benchmarks • 3 datasets

AUDIO SUPER-RESOLUTION or speech bandwidth extension (Upsampling Ratio = 2)

Benchmarks

Add a Result

These leaderboards are used to track progress in Audio Super-Resolution

Dataset	Best Model	Compare
VCTK Multi-Speaker	CMGAN	See all
Voice Bank corpus (VCTK)	U-Net + AFiLM	See all
Piano	U-Net + AFiLM	See all
DSD100	U-Net and ResNet	See all

Datasets

Latest papers

Most implemented Social Latest No code

AudioSR: Versatile Audio Super-resolution at Scale

haoheliu/versatile_audio_super_resolution • • 13 Sep 2023

Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolution audio, enhancing audio quality in digital applications.

875

13 Sep 2023

Paper
Code

AERO: Audio Super Resolution in the Spectral Domain

slp-rl/aero • • 22 Nov 2022

We optimize the model using both time and frequency domain loss functions.

172

22 Nov 2022

Paper
Code

Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs

chomeyama/DualCycleGAN • • 28 Oct 2022

Neural audio super-resolution models are typically trained on low- and high-resolution audio signal pairs.

28 Oct 2022

Paper
Code

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

ruizhecao96/cmgan • • 22 Sep 2022

Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain applications, such as automatic speech recognition (ASR) and speech separation, as they can capture both local and global dependencies.

253

22 Sep 2022

Paper
Code

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates

maum-ai/nuwave • • 17 Jun 2022

Conventionally, audio super-resolution models fixed the initial and the target sampling rates, which necessitate the model to be trained for each pair of sampling rates.

274

17 Jun 2022

Paper
Code

Neural Vocoder is All You Need for Speech Super-resolution

haoheliu/ssr_eval • • 28 Mar 2022

In this paper, we propose a neural vocoder based speech super-resolution method (NVSR) that can handle a variety of input resolution and upsampling ratios.

119

28 Mar 2022

Paper
Code

Learning Continuous Representation of Audio for Arbitrary Scale Super Resolution

ml-postech/lisa • • 30 Oct 2021

To obtain a continuous representation of audio and enable super resolution for arbitrary scale factor, we propose a method of implicit neural representation, coined Local Implicit representation for Super resolution of Arbitrary scale (LISA).

30 Oct 2021

Paper
Code

TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining

nxtproduct/tunet • • 26 Oct 2021

We introduce a block-online variant of the temporal feature-wise linear modulation (TFiLM) model to achieve bandwidth extension.

26 Oct 2021

Paper
Code

Self-Attention for Audio Super-Resolution

ncarraz/AFILM • • 26 Aug 2021

Convolutions operate only locally, thus failing to model global interactions.

26 Aug 2021

Paper
Code

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

maum-ai/nuwave • • 6 Apr 2021

In this work, we introduce NU-Wave, the first neural audio upsampling model to produce waveforms of sampling rate 48kHz from coarse 16kHz or 24kHz inputs, while prior works could generate only up to 16kHz.

274

06 Apr 2021

Paper
Code

Audio Super-Resolution

Benchmarks Add a Result

Datasets

Latest papers

Content

Benchmarks

Add a Result