no code implementations • 16 Oct 2023 • Tomer Wullach, Shlomo E. Chazan
The challenges facing speech recognition systems, such as variations in pronunciations, adverse audio conditions, and the scarcity of labeled data, emphasize the necessity for a post-processing step that corrects recurring errors.
no code implementations • 27 Dec 2022 • Tomer Wullach, Shlomo E. Chazan
One prominent speech recognition decoding heuristic is beam search, which seeks the transcript with the greatest likelihood computed using the predicted distribution.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 21 Mar 2022 • Tomer Wullach, Shlomo E. Chazan
Recently proposed speech recognition systems are designed to predict using representations generated by their top layers, employing greedy decoding which isolates each timestep from the rest of the sequence.
no code implementations • 6 Mar 2022 • Aviad Eisenberg, Sharon Gannot, Shlomo E. Chazan
In this paper we present a unified time-frequency method for speaker extraction in clean and noisy conditions.
no code implementations • 11 Feb 2021 • Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot
The experts estimate a mask from the noisy input and the final mask is then obtained as a weighted average of the experts' estimates, with the weights determined by the gating DNN.
2 code implementations • 4 Nov 2020 • Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi
The proposed approach is composed of several separation heads optimized together with a speaker classification branch.
1 code implementation • 26 Aug 2020 • Hodaya Hammer, Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot
In this paper, we present a deep neural network-based online multi-speaker localisation algorithm.
no code implementations • 16 Dec 2018 • Shlomo E. Chazan, Sharon Gannot, Jacob Goldberger
The optimal clustering is found by minimizing the reconstruction loss of the mixture of autoencoder network.