Search Results for author: Shuaijiang Zhao

Found 7 papers, 2 papers with code

Technical Report: Competition Solution For BetterMixture

no code implementations20 Mar 2024 Shuaijiang Zhao, Xiaoquan Fang

In the era of flourishing large-scale models, the challenge of selecting and optimizing datasets from the vast and complex sea of data, to enhance the performance of large language models within the constraints of limited computational resources, has become paramount.

ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

1 code implementation28 Jul 2023 Cheng Wen, Xianghui Sun, Shuaijiang Zhao, Xiaoquan Fang, Liangyu Chen, Wei Zou

This paper presents the development and evaluation of ChatHome, a domain-specific language model (DSLM) designed for the intricate field of home renovation.

Language Modelling

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

2 code implementations13 Jun 2021 Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan

This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10, 000 hours of high quality labeled audio suitable for supervised training, and 40, 000 hours of total audio suitable for semi-supervised and unsupervised training.

Sentence speech-recognition +1

DiDiSpeech: A Large Scale Mandarin Speech Corpus

no code implementations19 Oct 2020 Tingwei Guo, Cheng Wen, Dongwei Jiang, Ne Luo, Ruixiong Zhang, Shuaijiang Zhao, Wubo Li, Cheng Gong, Wei Zou, Kun Han, Xiangang Li

This paper introduces a new open-sourced Mandarin speech corpus, called DiDiSpeech.

Audio and Speech Processing

Towards End-to-End Code-Switching Speech Recognition

no code implementations31 Oct 2018 Ne Luo, Dongwei Jiang, Shuaijiang Zhao, Caixia Gong, Wei Zou, Xiangang Li

Code-switching speech recognition has attracted an increasing interest recently, but the need for expert linguistic knowledge has always been a big issue.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

A comparable study of modeling units for end-to-end Mandarin speech recognition

no code implementations10 May 2018 Wei Zou, Dongwei Jiang, Shuaijiang Zhao, Xiangang Li

We find that all types of modeling units can achieve approximate character error rate (CER) in CTC model and the performance of Chinese character attention model is better than syllable attention model.

speech-recognition Speech Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.