no code implementations • 15 Apr 2024 • Yujia Yan, Zhiyao Duan
The neural semi-Markov Conditional Random Field (semi-CRF) framework has demonstrated promise for event-based piano transcription.
1 code implementation • 11 Mar 2023 • Ge Zhu, Yujia Yan, Juan-Pablo Caceres, Zhiyao Duan
Non-linguistic filler words, such as "uh" or "um", are prevalent in spontaneous speech and serve as indicators for expressing hesitation or uncertainty.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • NeurIPS 2021 • Yujia Yan, Frank Cwitkowitz, Zhiyao Duan
When formulating piano transcription in this way, we eliminate the need to rely on disjoint frame-level estimates for different stages of a note event.