Search Results for author: Shiyu Zhou

Found 16 papers, 3 papers with code

Real-time computational powered landing guidance using convex optimization and neural networks

no code implementations14 Oct 2022 Zhipeng Shen, Shiyu Zhou, Jianglong Yu

Combining machine learning and convex optimization, this paper presents a real-time computational guidance method for the 6-degrees-of-freedom powered landing guidance problem.

Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection

1 code implementation30 Jan 2022 Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

Nowadays, most methods in end-to-end contextual speech recognition bias the recognition process towards contextual knowledge.

speech-recognition Speech Recognition

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

2 code implementations1 Jul 2021 Jing Liu, Xinxin Zhu, Fei Liu, Longteng Guo, Zijia Zhao, Mingzhen Sun, Weining Wang, Hanqing Lu, Shiyu Zhou, Jiajun Zhang, Jinqiao Wang

In this paper, we propose an Omni-perception Pre-Trainer (OPT) for cross-modal understanding and generation, by jointly modeling visual, text and audio resources.

Audio to Text Retrieval Cross-Modal Retrieval +3

Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition

no code implementations17 Jan 2021 Cheng Yi, Shiyu Zhou, Bo Xu

In this work, we fuse a pre-trained acoustic encoder (wav2vec2. 0) and a pre-trained linguistic encoder (BERT) into an end-to-end ASR model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Applying Wav2vec2.0 to Speech Recognition in Various Low-resource Languages

no code implementations22 Dec 2020 Cheng Yi, Jianzhong Wang, Ning Cheng, Shiyu Zhou, Bo Xu

To verify its universality over languages, we apply pre-trained models to solve low-resource speech recognition tasks in various spoken languages.

speech-recognition Speech Recognition

CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition

no code implementations17 Dec 2020 Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

End-to-end (E2E) models have achieved promising results on multiple speech recognition benchmarks, and shown the potential to become the mainstream.

speech-recognition Speech Recognition

Multi-output Gaussian Process Modulated Poisson Processes for Event Prediction

no code implementations6 Nov 2020 Salman Jahani, Shiyu Zhou, Dharmaraj Veeramani, Jeff Schmidt

Prediction of events such as part replacement and failure events plays a critical role in reliability engineering.

Variational Inference

Unsupervised pre-training for sequence to sequence speech recognition

no code implementations28 Oct 2019 Zhiyun Fan, Shiyu Zhou, Bo Xu

The unsupervised pre-training is finished on AISHELL-2 dataset and we apply the pre-trained model to multiple paired data ratios of AISHELL-1 and HKUST.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Minimizing Negative Transfer of Knowledge in Multivariate Gaussian Processes: A Scalable and Regularized Approach

no code implementations31 Jan 2019 Raed Kontar, Garvesh Raskutti, Shiyu Zhou

The proposed method has excellent scalability when the number of outputs is large and minimizes the negative transfer of knowledge between uncorrelated outputs.

Gaussian Processes

Multilingual End-to-End Speech Recognition with A Single Transformer on Low-Resource Languages

no code implementations12 Jun 2018 Shiyu Zhou, Shuang Xu, Bo Xu

Experiments on CALLHOME datasets demonstrate that the multilingual ASR Transformer with the language symbol at the end performs better and can obtain relatively 10. 5\% average word error rate (WER) reduction compared to SHL-MLSTM with residual learning.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese

no code implementations16 May 2018 Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu

Experiments on HKUST datasets demonstrate that the lexicon free modeling units can outperform lexicon related modeling units in terms of character error rate (CER).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese

1 code implementation28 Apr 2018 Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu

Furthermore, we investigate a comparison between syllable based model and context-independent phoneme (CI-phoneme) based model with the Transformer in Mandarin Chinese.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +6

Cannot find the paper you are looking for? You can Submit a new open access paper.