Speaker Recognition

90 papers with code • 1 benchmarks • 6 datasets

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Benchmarks

Add a Result

These leaderboards are used to track progress in Speaker Recognition

Trend	Dataset	Best Model	Paper	Code	Compare
	VoxCeleb1	WavLM+ECAPA-TDNN			See all

Libraries

Use these libraries to find Speaker Recognition models and implementations

s3prl/s3prl

2 papers

2,106

andi611/Self-Supervised-Speech-Pret…

2 papers

2,106

Jungjee/RawNet

2 papers

333

Datasets

Latest papers with no code

Most implemented Social Latest No code

Who is Authentic Speaker

no code yet • 30 Apr 2024

Therefore our experiments are geared towards recognising the source speakers given the converted voices, which are generated by using FragmentVC on the randomly paired utterances from source and target speakers.

Paper
Add Code

Certification of Speaker Recognition Models to Additive Perturbations

no code yet • 29 Apr 2024

In this paper, we pioneer applying robustness certification techniques to speaker recognition, originally developed for the image domain.

Paper
Add Code

TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches

no code yet • 18 Apr 2024

This study employs deep learning techniques to explore four speaker profiling tasks on the TIMIT dataset, namely gender classification, accent classification, age estimation, and speaker identification, highlighting the potential and challenges of multi-task learning versus single-task models.

Paper
Add Code

Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech

no code yet • 18 Apr 2024

In this paper, we presented a method that will provide a speakers geographical identity in a certain region using continuous Bengali speech.

Paper
Add Code

Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

no code yet • 1 Apr 2024

Our experimental results on three created datasets demonstrated that VCA-NN effectively mitigates these dataset problems, which provides a new direction for handling the speaker recognition problems from the data aspect.

Paper
Add Code

Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 2

no code yet • 28 Mar 2024

The SdSv challenge Task 2 provided an opportunity to assess efficiency and robustness of modern text-independent speaker verification systems.

Paper
Add Code

Cosine Scoring with Uncertainty for Neural Speaker Embedding

no code yet • 11 Mar 2024

Uncertainty modeling in speaker representation aims to learn the variability present in speech utterances.

Paper
Add Code

Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

no code yet • 23 Jan 2024

Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services.

Paper
Add Code

Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices

no code yet • 20 Dec 2023

This paper presents VoxCeleb-ESP, a collection of pointers and timestamps to YouTube videos facilitating the creation of a novel speaker recognition dataset.

Paper
Add Code

Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes

no code yet • 29 Nov 2023

From the publicly available speech dataset LibriTTS, we also created a separate database of only audio deepfakes LibriTTS-DF using several latest text to speech methods: YourTTS, Adaspeech, and TorToiSe.

Paper
Add Code

Speaker Recognition

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result