Browse > Audio > Audio Generation

Audio Generation

16 papers with code ยท Audio

Audio generation (synthesis) is the task of generating raw audio such as speech.

( Image credit: MelNet )

Leaderboards

Latest papers without code

High-Fidelity Audio Generation and Representation Learning with Guided Adversarial Autoencoder

1 Jun 2020

Hence, with the extensive experimental results, we have demonstrated that by harnessing the power of the high-fidelity audio generation, the proposed GAAE model can learn powerful representation from unlabelled dataset leveraging a fewer percentage of labelled data as supervision/guidance.

AUDIO GENERATION REPRESENTATION LEARNING

Cross-modal variational inference for bijective signal-symbol translation

10 Feb 2020

Extraction of symbolic information from signals is an active field of research enabling numerous applications especially in the Musical Information Retrieval domain.

AUDIO GENERATION DENSITY ESTIMATION INFORMATION RETRIEVAL

FastWave: Accelerating Autoregressive Convolutional Neural Networks on FPGA

9 Feb 2020

While WaveNet produces state-of-the art audio generation results, the naive inference implementation is quite slow; it takes a few minutes to generate just one second of audio on a high-end GPU.

AUDIO GENERATION LANGUAGE MODELLING MACHINE TRANSLATION

Score and Lyrics-Free Singing Voice Generation

ICLR 2020

Generative models for singing voice have been mostly concerned with the task of "singing voice synthesis," i. e., to produce singing voice waveforms given musical scores and text lyrics.

AUDIO GENERATION

Progressive Upsampling Audio Synthesis via Effective Adversarial Training

ICLR 2020

This paper proposes a novel generative model called PUGAN, which progressively synthesizes high-quality audio in a raw waveform.

AUDIO GENERATION

Score and Lyrics-Free Singing Voice Generation

ICLR 2020

Generative models for singing voice have been mostly concerned with the task of "singing voice synthesis," i. e., to produce singing voice waveforms given musical scores and text lyrics.

AUDIO GENERATION

Music Source Separation in the Waveform Domain

ICLR 2020

Experiments on the MusDB dataset show that Demucs beats previously reported results in terms of signal to distortion ratio (SDR), but lower than Conv-Tasnet.

AUDIO GENERATION MUSIC SOURCE SEPARATION

Transferring neural speech waveform synthesizers to musical instrument sounds generation

27 Oct 2019

Recent neural waveform synthesizers such as WaveNet, WaveGlow, and the neural-source-filter (NSF) model have shown good performance in speech synthesis despite their different methods of waveform generation.

AUDIO GENERATION SPEECH SYNTHESIS ZERO-SHOT LEARNING

Adversarial Audio Synthesis

ICLR 2019

While Generative Adversarial Networks (GANs) have seen wide success at the problem of synthesizing realistic images, they have seen little application to audio generation.

AUDIO GENERATION