no code implementations • SIGDIAL (ACL) 2021 • Koshiro Okano, Yu Suzuki, Masaya Kawamura, Tsuneo Kato, Akihiro Tamura, Jianming Wu
Responses generated by neural conversational models (NCMs) for non-task-oriented systems are difficult to evaluate.
no code implementations • 15 Sep 2023 • Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis system that allows control over speaker identity using natural language descriptions.
1 code implementation • 28 Oct 2022 • Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
We propose a lightweight end-to-end text-to-speech model using multi-band generation and inverse short-time Fourier transform.
no code implementations • 1 Feb 2022 • Masaya Kawamura, Tomohiko Nakamura, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo
A differentiable digital signal processing (DDSP) autoencoder is a musical sound synthesizer that combines a deep neural network (DNN) and spectral modeling synthesis.