Audio Super-Resolution

14 papers with code • 4 benchmarks • 3 datasets

AUDIO SUPER-RESOLUTION or speech bandwidth extension (Upsampling Ratio = 2)

AudioSR: Versatile Audio Super-resolution at Scale

haoheliu/versatile_audio_super_resolution 13 Sep 2023

Audio super-resolution is a fundamental task that predicts high-frequency components for low-resolution audio, enhancing audio quality in digital applications.

875
13 Sep 2023

AERO: Audio Super Resolution in the Spectral Domain

slp-rl/aero 22 Nov 2022

We optimize the model using both time and frequency domain loss functions.

172
22 Nov 2022

Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs

chomeyama/DualCycleGAN 28 Oct 2022

Neural audio super-resolution models are typically trained on low- and high-resolution audio signal pairs.

47
28 Oct 2022

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

ruizhecao96/cmgan 22 Sep 2022

Convolution-augmented transformers (Conformers) are recently proposed in various speech-domain applications, such as automatic speech recognition (ASR) and speech separation, as they can capture both local and global dependencies.

253
22 Sep 2022

NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates

maum-ai/nuwave 17 Jun 2022

Conventionally, audio super-resolution models fixed the initial and the target sampling rates, which necessitate the model to be trained for each pair of sampling rates.

274
17 Jun 2022

Neural Vocoder is All You Need for Speech Super-resolution

haoheliu/ssr_eval 28 Mar 2022

In this paper, we propose a neural vocoder based speech super-resolution method (NVSR) that can handle a variety of input resolution and upsampling ratios.

119
28 Mar 2022

Learning Continuous Representation of Audio for Arbitrary Scale Super Resolution

ml-postech/lisa 30 Oct 2021

To obtain a continuous representation of audio and enable super resolution for arbitrary scale factor, we propose a method of implicit neural representation, coined Local Implicit representation for Super resolution of Arbitrary scale (LISA).

12
30 Oct 2021

TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining

nxtproduct/tunet 26 Oct 2021

We introduce a block-online variant of the temporal feature-wise linear modulation (TFiLM) model to achieve bandwidth extension.

46
26 Oct 2021

Self-Attention for Audio Super-Resolution

ncarraz/AFILM 26 Aug 2021

Convolutions operate only locally, thus failing to model global interactions.

27
26 Aug 2021

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

maum-ai/nuwave 6 Apr 2021

In this work, we introduce NU-Wave, the first neural audio upsampling model to produce waveforms of sampling rate 48kHz from coarse 16kHz or 24kHz inputs, while prior works could generate only up to 16kHz.

274
06 Apr 2021