Search Results for author: Masaya Kawamura

Found 4 papers, 1 papers with code

Contrastive Response Pairs for Automatic Evaluation of Non-task-oriented Neural Conversational Models

no code implementations • SIGDIAL (ACL) 2021 • Koshiro Okano, Yu Suzuki, Masaya Kawamura, Tsuneo Kato, Akihiro Tamura, Jianming Wu

Responses generated by neural conversational models (NCMs) for non-task-oriented systems are difficult to evaluate.

Paper
Add Code

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions

no code implementations • 15 Sep 2023 • Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana

We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis system that allows control over speaker identity using natural language descriptions.

Paper
Add Code

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

1 code implementation • 28 Oct 2022 • Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana

We propose a lightweight end-to-end text-to-speech model using multi-band generation and inverse short-time Fourier transform.

Knowledge Distillation

385

Paper
Code

Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds

no code implementations • 1 Feb 2022 • Masaya Kawamura, Tomohiko Nakamura, Daichi Kitamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo

A differentiable digital signal processing (DDSP) autoencoder is a musical sound synthesizer that combines a deep neural network (DNN) and spectral modeling synthesis.

Audio Source Separation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.