Search Results for author: Chao Xing

Found 14 papers, 3 papers with code

Multimodal Audio-textual Architecture for Robust Spoken Language Understanding

no code implementations12 Jun 2023 Anderson R. Avila, Mehdi Rezagholizadeh, Chao Xing

In this work, we investigate impacts of this ASR error propagation on state-of-the-art NLU systems based on pre-trained language models (PLM), such as BERT and RoBERTa.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

DenseShift: Towards Accurate and Efficient Low-Bit Power-of-Two Quantization

1 code implementation ICCV 2023 Xinlin Li, Bang Liu, Rui Heng Yang, Vanessa Courville, Chao Xing, Vahid Partovi Nia

We further propose a sign-scale decomposition design to enhance training efficiency and a low-variance random initialization strategy to improve the model's transfer learning performance.

Quantization Transfer Learning

JABER and SABER: Junior and Senior Arabic BERt

1 code implementation8 Dec 2021 Abbas Ghaddar, Yimeng Wu, Ahmad Rashid, Khalil Bibi, Mehdi Rezagholizadeh, Chao Xing, Yasheng Wang, Duan Xinyu, Zhefeng Wang, Baoxing Huai, Xin Jiang, Qun Liu, Philippe Langlais

Language-specific pre-trained models have proven to be more accurate than multilingual ones in a monolingual evaluation setting, Arabic is no exception.

Language Modelling NER

Speech-MLP: a simple MLP architecture for speech processing

no code implementations29 Sep 2021 Chao Xing, Dong Wang, LiRong Dai, Qun Liu, Anderson Avila

Overparameterized transformer-based architectures have shown remarkable performance in recent years, achieving state-of-the-art results in speech processing tasks such as speech recognition, speech synthesis, keyword spotting, and speech enhancement et al.

Keyword Spotting Speech Enhancement +3

A Streaming End-to-End Framework For Spoken Language Understanding

no code implementations20 May 2021 Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen

In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.

Intent Detection Keyword Spotting +3

Deep Label Distribution Learning with Label Ambiguity

2 code implementations6 Nov 2016 Bin-Bin Gao, Chao Xing, Chen-Wei Xie, Jianxin Wu, Xin Geng

However, it is difficult to collect sufficient training images with precise labels in some domains such as apparent age estimation, head pose estimation, multi-label classification and semantic segmentation.

Age Estimation Classification +4

Logistic Boosting Regression for Label Distribution Learning

no code implementations CVPR 2016 Chao Xing, Xin Geng, Hui Xue

In order to learn this general model family, this paper uses a method called Logistic Boosting Regression (LogitBoost) which can be seen as an additive weighted function regression from the statistical viewpoint.

Age Estimation Facial Expression Recognition +3

Max-margin Metric Learning for Speaker Recognition

no code implementations20 Oct 2015 Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng

Probabilistic linear discriminant analysis (PLDA) is a popular normalization approach for the i-vector model, and has delivered state-of-the-art performance in speaker recognition.

Metric Learning Speaker Recognition

Binary Speaker Embedding

no code implementations20 Oct 2015 Lantian Li, Dong Wang, Chao Xing, Kaimin Yu, Thomas Fang Zheng

The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding.

Binarization Speaker Verification

Cannot find the paper you are looking for? You can Submit a new open access paper.