no code implementations • 24 Jun 2020 • Chelhwon Kim, Andrew Port, Mitesh Patel
Further, we discover that the distance preservation constraint in the generative adversarial model leads to reduced diversity in the translated audio samples, and propose the use of an auxiliary discriminator to enhance the diversity of the translations while using the distance preservation constraint.
no code implementations • 27 May 2020 • Andrew Port, Chelhwon Kim, Mitesh Patel
A generative adversarial network (GAN) is then used to find a distance preserving map from this metric space of feature vectors into the metric space defined by a target audio dataset equipped with either the Euclidean metric or a mel-frequency cepstrum-based psychoacoustic distance metric.