Browse > Speech > Speaker Diarization

Speaker Diarization

5 papers with code · Speech

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Fully Supervised Speaker Diarization

10 Oct 2018google/uis-rnn

In this paper, we propose a fully supervised speaker diarization approach, named unbounded interleaved-state recurrent neural networks (UIS-RNN).

SPEAKER DIARIZATION

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

5 Jan 2019cvdfoundation/ava-dataset

The dataset contains temporally labeled face tracks in video, where each face instance is labeled as speaking or not, and whether the speech is audible.

SPEAKER DIARIZATION SPEECH ENHANCEMENT

Speaker Diarization with LSTM

28 Oct 2017wq2012/SpectralCluster

For many years, i-vector based audio embedding techniques were the dominant approach for speaker verification and speaker diarization applications.

SPEAKER DIARIZATION SPEAKER VERIFICATION

End-to-End Neural Speaker Diarization with Self-attention

13 Sep 2019hitachi-speech/EEND

Our method was even better than that of the state-of-the-art x-vector clustering-based method.

SPEAKER DIARIZATION

End-to-End Neural Speaker Diarization with Permutation-Free Objectives

12 Sep 2019hitachi-speech/EEND

To realize such a model, we formulate the speaker diarization problem as a multi-label classification problem, and introduces a permutation-free objective function to directly minimize diarization errors without being suffered from the speaker-label permutation problem.

DOMAIN ADAPTATION MULTI-LABEL CLASSIFICATION SPEAKER DIARIZATION