Search Results for author: Shen Huang

Found 12 papers, 1 papers with code

CSP:Code-Switching Pre-training for Neural Machine Translation

no code implementations EMNLP 2020 Zhen Yang, Bojie Hu, Ambyera Han, Shen Huang, Qi Ju

Unlike traditional pre-training method which randomly masks some fragments of the input sentence, the proposed CSP randomly replaces some words in the source sentence with their translation words in the target language.

Machine Translation Translation

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders

no code implementations ACL 2021 Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

To our knowledge, we are the first to develop an end-to-end ST system that achieves comparable or even better BLEU performance than the cascaded ST counterpart when large-scale ASR and MT data is available.

Automatic Speech Recognition Knowledge Distillation +2

Code-switching pre-training for neural machine translation

no code implementations17 Sep 2020 Zhen Yang, Bojie Hu, Ambyera Han, Shen Huang, Qi Ju

Unlike traditional pre-training method which randomly masks some fragments of the input sentence, the proposed CSP randomly replaces some words in the source sentence with their translation words in the target language.

Machine Translation Translation

Cognitive Representation Learning of Self-Media Online Article Quality

no code implementations13 Aug 2020 Yiru Wang, Shen Huang, Gongfu Li, Qiang Deng, Dongliang Liao, Pengda Si, Yujiu Yang, Jin Xu

The automatic quality assessment of self-media online articles is an urgent and new issue, which is of great value to the online recommendation and search.

Representation Learning

Utterance-level end-to-end language identification using attention-based CNN-BLSTM

no code implementations20 Feb 2019 Weicheng Cai, Danwei Cai, Shen Huang, Ming Li

In this paper, we present an end-to-end language identification framework, the attention-based Convolutional Neural Network-Bidirectional Long-short Term Memory (CNN-BLSTM).

Language Identification

TencentFmRD Neural Machine Translation for WMT18

no code implementations WS 2018 Bojie Hu, Ambyer Han, Shen Huang

Our systems are neural machine translation systems trained with our original system TenTrans.

Machine Translation Translation

Addressing Domain Adaptation for Chinese Word Segmentation with Global Recurrent Structure

no code implementations IJCNLP 2017 Shen Huang, Xu sun, Houfeng Wang

Boundary features are widely used in traditional Chinese Word Segmentation (CWS) methods as they can utilize unlabeled data to help improve the Out-of-Vocabulary (OOV) word recognition performance.

Chinese Word Segmentation Domain Adaptation +1

Bi-LSTM Neural Networks for Chinese Grammatical Error Diagnosis

no code implementations WS 2016 Shen Huang, Houfeng Wang

Grammatical Error Diagnosis for Chinese has always been a challenge for both foreign learners and NLP researchers, for the variousity of grammar and the flexibility of expression.

Grammatical Error Detection Word Embeddings

Cannot find the paper you are looking for? You can Submit a new open access paper.