no code implementations • 2 Jul 2018 • Akihiro Kato, Tomi Kinnunen
The fundamental frequency (F0) represents pitch in speech that determines prosodic characteristics of speech and is needed in various tasks for speech analysis and synthesis.
no code implementations • 8 May 2018 • Akihiro Kato, Tomi Kinnunen
The latest prior research addresses this problem first as a frame-by-frame-classification problem followed by sequence tracking using deep neural network hidden Markov model (DNN-HMM) hybrid architecture.