Browse > Audio > Audio Generation

Audio Generation

12 papers with code · Audio

Audio generation (synthesis) is the task of generating raw audio such as speech.

State-of-the-art leaderboards

Latest papers with code

MelNet: A Generative Model for Audio in the Frequency Domain

4 Jun 2019fatchord/MelNet

Capturing high-level structure in audio waveforms is challenging because a single second of audio spans tens of thousands of timesteps.

AUDIO GENERATION MUSIC GENERATION SPEECH SYNTHESIS TEXT-TO-SPEECH SYNTHESIS

169
04 Jun 2019

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion

3 Jun 2019liusongxiang/StarGAN-Voice-Conversion

End-to-end models for raw audio generation are a challenge, specially if they have to work with non-parallel data, which is a desirable setup in many situations.

AUDIO GENERATION VOICE CONVERSION

162
03 Jun 2019

Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders

12 Apr 2019acids-ircam/Expressive_WAE_FADER

Its training data subsets can directly be visualized in the 3D latent representation.

AUDIO GENERATION

2
12 Apr 2019

GANSynth: Adversarial Neural Audio Synthesis

ICLR 2019 tensorflow/magenta

Efficient audio synthesis is an inherently difficult machine learning task, as human perception is sensitive to both global structure and fine-scale waveform coherence.

AUDIO GENERATION

14,068
23 Feb 2019

Conditional WaveGAN

27 Sep 2018acheketa/cwavegan

Generative models are successfully used for image synthesis in the recent years.

AUDIO GENERATION

81
27 Sep 2018

Smoothed Dilated Convolutions for Improved Dense Prediction

27 Aug 2018divelab/dilated

Unlike existing models, which explore solutions by focusing on a block of cascaded dilated convolutional layers, our methods address the gridding artifacts by smoothing the dilated convolution itself.

AUDIO GENERATION MACHINE TRANSLATION OBJECT DETECTION SEMANTIC SEGMENTATION

57
27 Aug 2018

Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics

Conference 2018 acids-ircam/variational-timbre

Based on this, we introduce a method for descriptor-based synthesis and show that we can control the descriptors of an instrument while keeping its timbre structure.

AUDIO CLASSIFICATION AUDIO GENERATION MUSIC INFORMATION RETRIEVAL MUSIC MODELING

20
22 May 2018

Generating Long Sequences with Sparse Transformers

Preprint 2019 openai/sparse_attention

Transformers are powerful sequence models, but require time and memory that grows quadratically with the sequence length.

AUDIO GENERATION IMAGE GENERATION LANGUAGE MODELLING

860
23 Apr 2018

Adversarial Audio Synthesis

ICLR 2019 chrisdonahue/wavegan

Audio signals are sampled at high temporal resolutions, and learning to synthesize audio requires capturing structure across a range of timescales.

AUDIO GENERATION IMAGE GENERATION

660
12 Feb 2018

Audio Super Resolution using Neural Networks

2 Aug 2017kuleshov/audio-super-res

We introduce a new audio processing technique that increases the sampling rate of signals such as speech or music using deep convolutional neural networks.

AUDIO SUPER-RESOLUTION

256
02 Aug 2017