Search Results for author: Shogo Seki

Found 4 papers, 1 papers with code

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

no code implementations14 Aug 2023 Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Shogo Seki

Owing to the difficulty of a 1D CNN to model high-dimensional spectrograms, the frequency dimension is reduced via temporal upsampling.

Speech Synthesis

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

1 code implementation4 Mar 2022 Takuhiro Kaneko, Kou Tanaka, Hirokazu Kameoka, Shogo Seki

In recent text-to-speech synthesis and voice conversion systems, a mel-spectrogram is commonly applied as an intermediate representation, and the necessity for a mel-spectrogram vocoder is increasing.

Speech Synthesis Text-To-Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.