Speaker Verification

169 papers with code • 5 benchmarks • 6 datasets

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Libraries

Use these libraries to find Speaker Verification models and implementations

Most implemented papers

Generalized End-to-End Loss for Speaker Verification

CorentinJ/Real-Time-Voice-Cloning 28 Oct 2017

In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss, which makes the training of speaker verification models more efficient than our previous tuple-based end-to-end (TE2E) loss function.

Speaker Recognition from Raw Waveform with SincNet

mravanelli/SincNet 29 Jul 2018

Rather than employing standard hand-crafted features, the latter CNNs learn low-level speech representations from waveforms, potentially allowing the network to better capture important narrow-band speaker characteristics such as pitch and formants.

ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification

PaddlePaddle/PaddleSpeech 10 Aug 2020

The successful x-vector architecture is a Time Delay Neural Network (TDNN) that applies statistics pooling to project variable-length utterances into fixed-length speaker characterizing embeddings.

Text-Independent Speaker Verification Using 3D Convolutional Neural Networks

astorfi/3D-convolutional-speaker-recognition 26 May 2017

In our paper, we propose an adaptive feature learning by utilizing the 3D-CNNs for direct speaker model creation in which, for both development and enrollment phases, an identical number of spoken utterances per speaker is fed to the network for representing the speakers' utterances and creation of the speaker model.

An Unsupervised Autoregressive Model for Speech Representation Learning

iamyuanchung/Autoregressive-Predictive-Coding 5 Apr 2019

This paper proposes a novel unsupervised autoregressive neural model for learning generic speech representations.

Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning

andi611/Self-Supervised-Speech-Pretraining-and-Representation-Learning 5 Jun 2020

To explore this issue, we proposed to employ Mockingjay, a self-supervised learning based model, to protect anti-spoofing models against adversarial attacks in the black-box scenario.

Speaker Diarization with LSTM

wq2012/SpectralCluster 28 Oct 2017

For many years, i-vector based audio embedding techniques were the dominant approach for speaker verification and speaker diarization applications.

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

lixucuhk/ASV-anti-spoofing-with-Res2Net 14 Apr 2019

ASVspoof, now in its third edition, is a series of community-led challenges which promote the development of countermeasures to protect automatic speaker verification (ASV) from the threat of spoofing.

RawNet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

Jungjee/RawNet 17 Apr 2019

In this study, we explore end-to-end deep neural networks that input raw waveforms to improve various aspects: front-end speaker embedding extraction including model architecture, pre-training scheme, additional objective functions, and back-end classification.