Search Results for author: Zhixian Zhao

Found 3 papers, 3 papers with code

Steering Language Model to Stable Speech Emotion Recognition via Contextual Perception and Chain of Thought

1 code implementation25 Feb 2025 Zhixian Zhao, Xinfa Zhu, Xinsheng Wang, Shuiyuan Wang, Xuelong Geng, Wenjie Tian, Lei Xie

Large-scale audio language models (ALMs), such as Qwen2-Audio, are capable of comprehending diverse audio signal, performing audio analysis and generating textual responses.

Language Modeling Language Modelling +1

OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia

1 code implementation23 Jan 2025 Xuelong Geng, Kun Wei, Qijie Shao, Shuiyun Liu, Zhennan Lin, Zhixian Zhao, Guojian Li, Wenjie Tian, Peikun Chen, Yangze Li, Pengcheng Guo, Mingchen Shao, Shuiyuan Wang, Yuang Cao, Chengyou Wang, Tianyi Xu, Yuhang Dai, Xinfa Zhu, Yue Li, Li Zhang, Lei Xie

Large Language Models (LLMs) have made significant progress in various downstream tasks, inspiring the development of Speech Understanding Language Models (SULMs) to enable comprehensive speech-based interactions.

Event Detection Gender Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.