Speaker Identification

61 papers with code • 4 benchmarks • 4 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

On Learning Associations of Faces and Voices

changil/facevoice 15 May 2018

We computationally model the overlapping information between faces and voices and show that the learned cross-modal representation contains enough information to identify matching faces and voices with performance similar to that of humans.

Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks

KinWaiCheuk/MCE2018 1 Oct 2019

When reducing the training data to only using the train set, our method results in 309 confusions for the Multi-target speaker identification task, which is 46% better than the baseline model.

Delving into VoxCeleb: environment invariant speaker recognition

theolepage/sslsv 24 Oct 2019

Research in speaker recognition has recently seen significant progress due to the application of neural network models and the availability of new large-scale datasets.

Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam

jyhan03/channel-decorrelation 23 Jan 2020

First, we propose a time-domain implementation of SpeakerBeam similar to that proposed for a time-domain audio separation network (TasNet), which has achieved state-of-the-art performance for speech separation.

Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs

seongmin-kye/meta-SR 6 Apr 2020

By combining these two learning schemes, our model outperforms existing state-of-the-art speaker verification models learned with a standard supervised learning framework on short utterance (1-2 seconds) on the VoxCeleb datasets.

Identify Speakers in Cocktail Parties with End-to-End Attention

JunzheJosephZhu/Identify-Speakers-in-Cocktail-Parties-with-E2E-Attention 22 May 2020

In scenarios where multiple speakers talk at the same time, it is important to be able to identify the talkers accurately.

audino: A Modern Annotation Tool for Audio and Speech

midas-research/audino 9 Jun 2020

The tool allows audio data and their corresponding annotations to be uploaded and assigned to a user through a key-based API.

Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings

NaoyukiKanda/LibriSpeechMix 11 Aug 2020

However, the model required prior knowledge of speaker profiles to perform speaker identification, which significantly limited the application of the model.

Sum-Product Networks for Robust Automatic Speaker Identification

anicolson/SPN-ASI 13 Aug 2020

Though current SPN toolkits and learning algorithms are in their infancy, we aim to show that SPNs have the potential to become a useful tool for robust speech processing in the future.