Search Results for author: Weicheng Cai

Found 9 papers, 2 papers with code

A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data

no code implementations1 Dec 2020 Weicheng Cai, Ming Li

This paper proposes a unified deep speaker embedding framework for modeling speech data with different sampling rates.

Bandwidth Extension Image Classification

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

no code implementations23 Feb 2020 Qingjian Lin, Weicheng Cai, Lin Yang, Jun-Jie Wang, Jun Zhang, Ming Li

Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection.

Action Detection Activity Detection +1

Utterance-level end-to-end language identification using attention-based CNN-BLSTM

no code implementations20 Feb 2019 Weicheng Cai, Danwei Cai, Shen Huang, Ming Li

In this paper, we present an end-to-end language identification framework, the attention-based Convolutional Neural Network-Bidirectional Long-short Term Memory (CNN-BLSTM).

Language Identification

End-to-end Language Identification using NetFV and NetVLAD

1 code implementation9 Sep 2018 Jinkun Chen, Weicheng Cai, Danwei Cai, Zexin Cai, Haibin Zhong, Ming Li

In this paper, we apply the NetFV and NetVLAD layers for the end-to-end language identification task.

Language Identification

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System

1 code implementation14 Apr 2018 Weicheng Cai, Jinkun Chen, Ming Li

In the end-to-end system, the encoding layer plays a role in aggregating the variable-length input sequence into an utterance level representation.

Speaker Verification

A Novel Learnable Dictionary Encoding Layer for End-to-End Language Identification

no code implementations2 Apr 2018 Weicheng Cai, Zexin Cai, Xiang Zhang, Xiaoqi Wang, Ming Li

A novel learnable dictionary encoding layer is proposed in this paper for end-to-end language identification.

Language Identification

Insights into End-to-End Learning Scheme for Language Identification

no code implementations2 Apr 2018 Weicheng Cai, Zexin Cai, Wenbo Liu, Xiaoqi Wang, Ming Li

After comparing with the state-of-the-art GMM i-vector methods, we give insights into CNN, and reveal its role and effect in the whole pipeline.

Language Identification

The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge

no code implementations24 Jul 2015 Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li

In order to detect these spoofed speech signals as a countermeasure, we propose a score level fusion approach with several different i-vector subsystems.

Speaker Verification Speech Synthesis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.