Audio generation (synthesis) is the task of generating raw audio such as speech.
( Image credit: MelNet )
|TREND||DATASET||BEST METHOD||PAPER TITLE||PAPER||CODE||COMPARE|
We introduce a new audio processing technique that increases the sampling rate of signals such as speech or music using deep convolutional neural networks.
Ranked #2 on Audio Super-Resolution on Voice Bank corpus (VCTK)
In this paper we propose a novel model for unconditional audio generation based on generating one audio sample at a time.
End-to-end models for raw audio generation are a challenge, specially if they have to work with non-parallel data, which is a desirable setup in many situations.