no code implementations • 10 Apr 2024 • Kuo-Yu Liao, Cheng-Shang Chang, Y. -W. Peter Hong
Using density evolution analysis, we demonstrate the emergence of learned skills when the ratio of the size of training texts to the number of skills exceeds a certain threshold.
no code implementations • 5 Jun 2021 • Yu-Lin Huang, Bo-Hao Su, Y. -W. Peter Hong, Chi-Chun Lee
Specifically, we propose a layered-representation variational autoencoder (LR-VAE), which factorizes speech representation into attribute-sensitive nodes, to derive an identity-free representation for speech emotion recognition (SER), and an emotionless representation for speaker verification (SV).