1 code implementation • 23 May 2025 • Ziwei Zhou, Rui Wang, Zuxuan Wu
Recent Multimodal Large Language Models (MLLMs) achieve promising performance on visual and audio benchmarks independently.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+5
no code implementations • 27 Jan 2023 • Xingwu Guo, Ziwei Zhou, Yueling Zhang, Guy Katz, Min Zhang
The experimental results demonstrate our approach's effectiveness and efficiency in verifying DNNs' robustness against various occlusions, and its ability to generate counterexamples when these DNNs are not robust.
no code implementations • 16 Apr 2018 • Sowmya Vajjala, Ziwei Zhou
This paper describes our experiments with automatically identifying native accents from speech samples of non-native English speakers using low level audio features, and n-gram features from manual transcriptions.