1 code implementation • 4 Oct 2023 • Dominik Klement, Mireia Diez, Federico Landini, Lukáš Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara
Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges.
no code implementations • 23 May 2023 • Marc Delcroix, Naohiro Tawara, Mireia Diez, Federico Landini, Anna Silnova, Atsunori Ogawa, Tomohiro Nakatani, Lukas Burget, Shoko Araki
Combining end-to-end neural speaker diarization (EEND) with vector clustering (VC), known as EEND-VC, has gained interest for leveraging the strengths of both methods.
2 code implementations • 27 Oct 2022 • Anna Silnova, Niko Brümmer, Albert Swart, Lukáš Burget
It extends PSDA with the ability to model within and between-speaker variabilities in toroidal submanifolds of the hypersphere.
no code implementations • 29 Mar 2022 • Themos Stafylakis, Ladislav Mošner, Oldřich Plchot, Johan Rohdin, Anna Silnova, Lukáš Burget, Jan "Honza'' Černocký
In this paper, we demonstrate a method for training speaker embedding extractors using weak annotation.
3 code implementations • 28 Mar 2022 • Niko Brümmer, Albert Swart, Ladislav Mošner, Anna Silnova, Oldřich Plchot, Themos Stafylakis, Lukáš Burget
In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring or PLDA.
no code implementations • 19 Mar 2022 • Anna Silnova, Themos Stafylakis, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Pavel Matejka, Lukas Burget, Ondrej Glembek, Niko Brummer
In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch.
1 code implementation • 6 Apr 2020 • Anna Silnova, Niko Brümmer, Johan Rohdin, Themos Stafylakis, Lukáš Burget
We apply the proposed probabilistic embeddings as input to an agglomerative hierarchical clustering (AHC) algorithm to do diarization in the DIHARD'19 evaluation set.
no code implementations • 16 Oct 2019 • Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matějka, Oldřich Plchot
The last two networks are one-dimensional CNN and are based on the x-vector extraction topology.
no code implementations • 13 Jul 2019 • Hossein Zeinali, Pavel Matějka, Ladislav Mošner, Oldřich Plchot, Anna Silnova, Ondřej Novotný, Ján Profant, Ondřej Glembek, Lukáš Burget
This is a description of our effort in VOiCES 2019 Speaker Recognition challenge.
1 code implementation • 24 Mar 2018 • Anna Silnova, Niko Brummer, Daniel Garcia-Romero, David Snyder, Lukas Burget
We have recently introduced a fast scoring algorithm for a discriminatively trained HT-PLDA backend.
no code implementations • 27 Feb 2018 • Niko Brummer, Anna Silnova, Lukas Burget, Themos Stafylakis
Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks.