Search Results for author: Weicheng Cai

Found 9 papers, 2 papers with code

A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data

no code implementations • 1 Dec 2020 • Weicheng Cai, Ming Li

This paper proposes a unified deep speaker embedding framework for modeling speech data with different sampling rates.

Bandwidth Extension Image Classification

Paper
Add Code

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

no code implementations • 23 Feb 2020 • Qingjian Lin, Weicheng Cai, Lin Yang, Jun-Jie Wang, Jun Zhang, Ming Li

Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection.

Action Detection Activity Detection +1

Paper
Add Code

The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion

no code implementations • 5 Jul 2019 • Weicheng Cai, Haiwei Wu, Danwei Cai, Ming Li

This paper describes our DKU replay detection system for the ASVspoof 2019 challenge.

Data Augmentation General Classification +1

Paper
Add Code

Utterance-level end-to-end language identification using attention-based CNN-BLSTM

no code implementations • 20 Feb 2019 • Weicheng Cai, Danwei Cai, Shen Huang, Ming Li

In this paper, we present an end-to-end language identification framework, the attention-based Convolutional Neural Network-Bidirectional Long-short Term Memory (CNN-BLSTM).

Language Identification

Paper
Add Code

End-to-end Language Identification using NetFV and NetVLAD

1 code implementation • 9 Sep 2018 • Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li

In this paper, we apply the NetFV and NetVLAD layers for the end-to-end language identification task.

Language Identification

Paper
Code

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

1 code implementation • 14 Apr 2018 • Weicheng Cai, Jinkun Chen, Ming Li

In the end-to-end system, the encoding layer plays a role in aggregating the variable-length input sequence into an utterance level representation.

Speaker Verification

Paper
Code

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification

no code implementations • 2 Apr 2018 • Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li

A novel learnable dictionary encoding layer is proposed in this paper for end-to-end language identification.

Language Identification

Paper
Add Code

Insights into End-to-End Learning Scheme for Language Identification

no code implementations • 2 Apr 2018 • Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li

After comparing with the state-of-the-art GMM i-vector methods, we give insights into CNN, and reveal its role and effect in the whole pipeline.

Language Identification

Paper
Add Code

The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge

no code implementations • 24 Jul 2015 • Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li

In order to detect these spoofed speech signals as a countermeasure, we propose a score level fusion approach with several different i-vector subsystems.

Speaker Verification Speech Synthesis +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.