Search Results for author: Chao Xing

Found 14 papers, 3 papers with code

Multimodal Audio-textual Architecture for Robust Spoken Language Understanding

no code implementations • 12 Jun 2023 • Anderson R. Avila, Mehdi Rezagholizadeh, Chao Xing

In this work, we investigate impacts of this ASR error propagation on state-of-the-art NLU systems based on pre-trained language models (PLM), such as BERT and RoBERTa.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

DenseShift: Towards Accurate and Efficient Low-Bit Power-of-Two Quantization

1 code implementation • ICCV 2023 • Xinlin Li, Bang Liu, Rui Heng Yang, Vanessa Courville, Chao Xing, Vahid Partovi Nia

We further propose a sign-scale decomposition design to enhance training efficiency and a low-variance random initialization strategy to improve the model's transfer learning performance.

Quantization Transfer Learning

835

Paper
Code

Low-bit Shift Network for End-to-End Spoken Language Understanding

no code implementations • 15 Jul 2022 • Anderson R. Avila, Khalil Bibi, Rui Heng Yang, Xinlin Li, Chao Xing, Xiao Chen

Deep neural networks (DNN) have achieved impressive success in multiple domains.

Edge-computing Quantization +1

Paper
Add Code

Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

no code implementations • 21 May 2022 • Abbas Ghaddar, Yimeng Wu, Sunyam Bagga, Ahmad Rashid, Khalil Bibi, Mehdi Rezagholizadeh, Chao Xing, Yasheng Wang, Duan Xinyu, Zhefeng Wang, Baoxing Huai, Xin Jiang, Qun Liu, Philippe Langlais

There is a growing body of work in recent years to develop pre-trained language models (PLMs) for the Arabic language.

Natural Language Understanding

Paper
Add Code

JABER and SABER: Junior and Senior Arabic BERt

1 code implementation • 8 Dec 2021 • Abbas Ghaddar, Yimeng Wu, Ahmad Rashid, Khalil Bibi, Mehdi Rezagholizadeh, Chao Xing, Yasheng Wang, Duan Xinyu, Zhefeng Wang, Baoxing Huai, Xin Jiang, Qun Liu, Philippe Langlais

Language-specific pre-trained models have proven to be more accurate than multilingual ones in a monolingual evaluation setting, Arabic is no exception.

Language Modelling NER

2,957

Paper
Code

Speech-MLP: a simple MLP architecture for speech processing

no code implementations • 29 Sep 2021 • Chao Xing, Dong Wang, LiRong Dai, Qun Liu, Anderson Avila

Overparameterized transformer-based architectures have shown remarkable performance in recent years, achieving state-of-the-art results in speech processing tasks such as speech recognition, speech synthesis, keyword spotting, and speech enhancement et al.

Keyword Spotting Speech Enhancement +3

Paper
Add Code

A Streaming End-to-End Framework For Spoken Language Understanding

no code implementations • 20 May 2021 • Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen

In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.

Intent Detection Keyword Spotting +3

Paper
Add Code

Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation

no code implementations • 17 Mar 2021 • Md Akmal Haidar, Chao Xing, Mehdi Rezagholizadeh

End-to-end automatic speech recognition (ASR), unlike conventional ASR, does not have modules to learn the semantic representation from speech encoder.

Ranked #12 on Speech Recognition on LibriSpeech test-clean

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Deep Label Distribution Learning with Label Ambiguity

2 code implementations • 6 Nov 2016 • Bin-Bin Gao, Chao Xing, Chen-Wei Xie, Jianxin Wu, Xin Geng

However, it is difficult to collect sufficient training images with precise labels in some domains such as apparent age estimation, head pose estimation, multi-label classification and semantic segmentation.

Ranked #1 on Head Pose Estimation on BJUT-3D

Age Estimation Classification +4

Paper
Code

Logistic Boosting Regression for Label Distribution Learning

no code implementations • CVPR 2016 • Chao Xing, Xin Geng, Hui Xue

In order to learn this general model family, this paper uses a method called Logistic Boosting Regression (LogitBoost) which can be seen as an additive weighted function regression from the statistical viewpoint.

Age Estimation Facial Expression Recognition +3

Paper
Add Code

Chinese Song Iambics Generation with Neural Attention-based Model

no code implementations • 21 Apr 2016 • Qixin Wang, Tianyi Luo, Dong Wang, Chao Xing

Learning and generating Chinese poems is a charming yet challenging task.

Language Modelling Machine Translation +1

Paper
Add Code

Max-margin Metric Learning for Speaker Recognition

no code implementations • 20 Oct 2015 • Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng

Probabilistic linear discriminant analysis (PLDA) is a popular normalization approach for the i-vector model, and has delivered state-of-the-art performance in speaker recognition.

Metric Learning Speaker Recognition

Paper
Add Code

Binary Speaker Embedding

no code implementations • 20 Oct 2015 • Lantian Li, Dong Wang, Chao Xing, Kaimin Yu, Thomas Fang Zheng

The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding.

Binarization Speaker Verification

Paper
Add Code

Normalized Word Embedding and Orthogonal Transform for Bilingual Word Translation

no code implementations • HLT 2015 • Dong Wang, Chao Xing, Chao Liu, Yiye Lin

Translation Word Similarity +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.