Search Results for author: Anna Silnova

Found 11 papers, 5 papers with code

Discriminative Training of VBx Diarization

1 code implementation4 Oct 2023 Dominik Klement, Mireia Diez, Federico Landini, Lukáš Burget, Anna Silnova, Marc Delcroix, Naohiro Tawara

Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges.

Bayesian Inference

Toroidal Probabilistic Spherical Discriminant Analysis

2 code implementations27 Oct 2022 Anna Silnova, Niko Brümmer, Albert Swart, Lukáš Burget

It extends PSDA with the ability to model within and between-speaker variabilities in toroidal submanifolds of the hypersphere.

Speaker Recognition

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

3 code implementations28 Mar 2022 Niko Brümmer, Albert Swart, Ladislav Mošner, Anna Silnova, Oldřich Plchot, Themos Stafylakis, Lukáš Burget

In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring or PLDA.

Speaker Recognition

Probabilistic embeddings for speaker diarization

1 code implementation6 Apr 2020 Anna Silnova, Niko Brümmer, Johan Rohdin, Themos Stafylakis, Lukáš Burget

We apply the proposed probabilistic embeddings as input to an agglomerative hierarchical clustering (AHC) algorithm to do diarization in the DIHARD'19 evaluation set.

Clustering speaker-diarization +1

Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors

1 code implementation24 Mar 2018 Anna Silnova, Niko Brummer, Daniel Garcia-Romero, David Snyder, Lukas Burget

We have recently introduced a fast scoring algorithm for a discriminatively trained HT-PLDA backend.

Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model

no code implementations27 Feb 2018 Niko Brummer, Anna Silnova, Lukas Burget, Themos Stafylakis

Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks.

Speaker Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.