Speaker Recognition

52 papers with code • 1 benchmarks • 5 datasets

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Greatest papers with code

Filterbank design for end-to-end speech separation

mpariente/asteroid 23 Oct 2019

Also, we validate the use of parameterized filterbanks and show that complex-valued representations and masks are beneficial in all conditions.

Speaker Recognition Speech Separation

Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders

andi611/Self-Supervised-Speech-Pretraining-and-Representation-Learning 25 Oct 2019

We present Mockingjay as a new speech representation learning approach, where bidirectional Transformer encoders are pre-trained on a large amount of unlabeled speech.

Classification General Classification +3

Speech and Speaker Recognition from Raw Waveform with SincNet

mravanelli/SincNet 13 Dec 2018

Deep neural networks can learn complex and abstract representations, that are progressively obtained by combining simpler ones.

Speaker Recognition Speech Recognition

Speaker Recognition from Raw Waveform with SincNet

mravanelli/SincNet 29 Jul 2018

Rather than employing standard hand-crafted features, the latter CNNs learn low-level speech representations from waveforms, potentially allowing the network to better capture important narrow-band speaker characteristics such as pitch and formants.

Speaker Identification Speaker Recognition +1

Deep Speaker: an End-to-End Neural Speaker Embedding System

philipperemy/deep-speaker 5 May 2017

We present Deep Speaker, a neural speaker embedding system that maps utterances to a hypersphere where speaker similarity is measured by cosine similarity.

Speaker Identification Speaker Recognition

Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020

clovaai/voxceleb_trainer 29 Sep 2020

This report describes our submission to the VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020.

Speaker Recognition

VoxCeleb2: Deep Speaker Recognition

a-nagrani/VGGVox 14 Jun 2018

The objective of this paper is speaker recognition under noisy and unconstrained conditions.

 Ranked #1 on Speaker Verification on VoxCeleb2 (using extra training data)

Speaker Recognition Speaker Verification

Utterance-level Aggregation For Speaker Recognition In The Wild

taylorlu/Speaker-Diarization 26 Feb 2019

The objective of this paper is speaker recognition "in the wild"-where utterances may be of variable length and also contain irrelevant signals.

Speaker Recognition Text-Independent Speaker Verification

AutoSpeech: Neural Architecture Search for Speaker Recognition

TAMU-VITA/AutoSpeech 7 May 2020

Speaker recognition systems based on Convolutional Neural Networks (CNNs) are often built with off-the-shelf backbones such as VGG-Net or ResNet.

Image Classification Neural Architecture Search +3