no code implementations • 21 Jun 2021 • Omid Ghahabi, Volker Fischer
Speech Activity Detection (SAD), locating speech segments within an audio recording, is a main part of most speech technology applications.
no code implementations • 23 Oct 2020 • Omid Ghahabi, Volker Fischer
This technical report describes the EML submission to the first VoxCeleb speaker diarization challenge.
no code implementations • 8 Dec 2015 • Omid Ghahabi, Javier Hernando
Given i-vectors as inputs, the authors proposed an impostor selection algorithm and a universal model adaptation process in a hybrid system based on Deep Belief Networks (DBN) and Deep Neural Networks (DNN) to discriminatively model each target speaker.