1 code implementation • 27 Oct 2020 • Chau Luu, Peter Bell, Steve Renals
On a test set of US Supreme Court recordings, we show that by leveraging two additional forms of speaker attribute information derived respectively from the matched training data, and VoxCeleb corpus, we improve the performance of our deep speaker embeddings for both verification and diarization tasks, achieving a relative improvement of 26. 2% in DER and 6. 7% in EER compared to baselines using speaker labels only.
1 code implementation • 2 Feb 2020 • Chau Luu, Peter Bell, Steve Renals
The first proposed method, DropClass, works via periodically dropping a random subset of classes from the training data and the output layer throughout training, resulting in a feature extractor trained on many different classification tasks.
no code implementations • 25 Oct 2019 • Chau Luu, Peter Bell, Steve Renals
Previous work has encouraged domain-invariance in deep speaker embedding by adversarially classifying the dataset or labelled environment to which the generated features belong.