TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speaker Diarization	AMI	pyannote (MFCC)	DER(%)	6.3	# 2
Speaker Diarization	AMI	pyannote (MFCC)	FA	3.5	# 1
Speaker Diarization	AMI	pyannote (MFCC)	Miss	2.7	# 2
Speaker Diarization	AMI	pyannote (waveform)	DER(%)	6.0	# 1
Speaker Diarization	AMI	pyannote (waveform)	FA	3.6	# 2
Speaker Diarization	AMI	pyannote (waveform)	Miss	2.4	# 1
Speaker Diarization	DIHARD	pyannote (waveform)	DER(%)	9.9	# 1
Speaker Diarization	DIHARD	pyannote (waveform)	FA	5.7	# 1
Speaker Diarization	DIHARD	pyannote (waveform)	Miss	4.2	# 2
Speaker Diarization	DIHARD	Baseline (the best result in the literature as of Oct.2019)	DER(%)	11.2	# 3
Speaker Diarization	DIHARD	Baseline (the best result in the literature as of Oct.2019)	FA	6.5	# 2
Speaker Diarization	DIHARD	Baseline (the best result in the literature as of Oct.2019)	Miss	4.7	# 3
Speaker Diarization	DIHARD	pyannote (MFCC)	DER(%)	10.5	# 2
Speaker Diarization	DIHARD	pyannote (MFCC)	FA	6.8	# 3
Speaker Diarization	DIHARD	pyannote (MFCC)	Miss	3.7	# 1
Speaker Diarization	ETAPE	Baseline	DER(%)	7.7	# 3
Speaker Diarization	ETAPE	Baseline	FA	7.5	# 3
Speaker Diarization	ETAPE	Baseline	Miss	0.2	# 1
Speaker Diarization	ETAPE	pyannote (MFCC)	DER(%)	5.6	# 2
Speaker Diarization	ETAPE	pyannote (MFCC)	FA	5.2	# 2
Speaker Diarization	ETAPE	pyannote (MFCC)	Miss	0.4	# 2
Speaker Diarization	ETAPE	pyannote (waveform)	DER(%)	4.9	# 1
Speaker Diarization	ETAPE	pyannote (waveform)	FA	4.2	# 1
Speaker Diarization	ETAPE	pyannote (waveform)	Miss	0.7	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pyannoteaudio-neural-building-blocks-for/speaker-diarization-on-ami)](https://paperswithcode.com/sota/speaker-diarization-on-ami?p=pyannoteaudio-neural-building-blocks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pyannoteaudio-neural-building-blocks-for/speaker-diarization-on-dihard-1)](https://paperswithcode.com/sota/speaker-diarization-on-dihard-1?p=pyannoteaudio-neural-building-blocks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pyannoteaudio-neural-building-blocks-for/speaker-diarization-on-etape)](https://paperswithcode.com/sota/speaker-diarization-on-etape?p=pyannoteaudio-neural-building-blocks-for)`

pyannote.audio: neural building blocks for speaker diarization

4 Nov 2019 · Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, Marie-Philippe Gill ·

We introduce pyannote.audio, an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized to build speaker diarization pipelines. pyannote.audio also comes with pre-trained models covering a wide range of domains for voice activity detection, speaker change detection, overlapped speech detection, and speaker embedding -- reaching state-of-the-art performance for most of them.

PDF Abstract

Code

Add Remove Mark official

pyannote/pyannote-audio official

5,040

muskang48/Speaker-Diarization

MarvinLvn/voice-type-classifier

Tasks

Add Remove

Action Detection

Activity Detection

BIG-bench Machine Learning

Change Detection

speaker-diarization

Speaker Diarization

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #1 on Speaker Diarization on ETAPE

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speaker Diarization	AMI	pyannote (MFCC)	DER(%)	6.3	# 2	Compare
			FA	3.5	# 1	Compare
			Miss	2.7	# 2	Compare
Speaker Diarization	AMI	pyannote (waveform)	DER(%)	6.0	# 1	Compare
			FA	3.6	# 2	Compare
			Miss	2.4	# 1	Compare
Speaker Diarization	DIHARD	pyannote (waveform)	DER(%)	9.9	# 1	Compare
			FA	5.7	# 1	Compare
			Miss	4.2	# 2	Compare
Speaker Diarization	DIHARD	Baseline (the best result in the literature as of Oct.2019)	DER(%)	11.2	# 3	Compare
			FA	6.5	# 2	Compare
			Miss	4.7	# 3	Compare
Speaker Diarization	DIHARD	pyannote (MFCC)	DER(%)	10.5	# 2	Compare
			FA	6.8	# 3	Compare
			Miss	3.7	# 1	Compare
Speaker Diarization	ETAPE	Baseline	DER(%)	7.7	# 3	Compare
			FA	7.5	# 3	Compare
			Miss	0.2	# 1	Compare
Speaker Diarization	ETAPE	pyannote (MFCC)	DER(%)	5.6	# 2	Compare
			FA	5.2	# 2	Compare
			Miss	0.4	# 2	Compare
Speaker Diarization	ETAPE	pyannote (waveform)	DER(%)	4.9	# 1	Compare
			FA	4.2	# 1	Compare
			Miss	0.7	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

pyannote.audio: neural building blocks for speaker diarization

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove