Search Results for author: Shogo Seki

Found 4 papers, 1 papers with code

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

no code implementations • 14 Aug 2023 • Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki

Owing to the difficulty of a 1D CNN to model high-dimensional spectrograms, the frequency dimension is reduced via temporal upsampling.

Speech Synthesis

Paper
Add Code

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis

no code implementations • 24 Mar 2023 • Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki

This architecture provides a generator with sufficiently rich information for the synthesized speech to be closely matched to the real speech.

Generative Adversarial Network Speech Synthesis

Paper
Add Code

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

1 code implementation • 4 Mar 2022 • Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki

In recent text-to-speech synthesis and voice conversion systems, a mel-spectrogram is commonly applied as an intermediate representation, and the necessity for a mel-spectrogram vocoder is increasing.

Speech Synthesis Text-To-Speech Synthesis +1

208

Paper
Code

Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation

no code implementations • 29 Sep 2018 • Shogo Seki, Hirokazu Kameoka, Li Li, Tomoki Toda, Kazuya Takeda

This paper deals with a multichannel audio source separation problem under underdetermined conditions.

Audio Source Separation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.