Search Results for author: Shlomo E. Chazan

Found 8 papers, 2 papers with code

Optimized Tokenization for Transcribed Error Correction

no code implementations16 Oct 2023 Tomer Wullach, Shlomo E. Chazan

The challenges facing speech recognition systems, such as variations in pronunciations, adverse audio conditions, and the scarcity of labeled data, emphasize the necessity for a post-processing step that corrects recurring errors.

speech-recognition Speech Recognition

Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation

no code implementations27 Dec 2022 Tomer Wullach, Shlomo E. Chazan

One prominent speech recognition decoding heuristic is beam search, which seeks the transcript with the greatest likelihood computed using the predicted distribution.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Enhancing Speech Recognition Decoding via Layer Aggregation

no code implementations21 Mar 2022 Tomer Wullach, Shlomo E. Chazan

Recently proposed speech recognition systems are designed to predict using representations generated by their top layers, employing greedy decoding which isolates each timestep from the rest of the sequence.

Language Modelling speech-recognition +1

Single microphone speaker extraction using unified time-frequency Siamese-Unet

no code implementations6 Mar 2022 Aviad Eisenberg, Sharon Gannot, Shlomo E. Chazan

In this paper we present a unified time-frequency method for speaker extraction in clean and noisy conditions.

blind source separation

Speech enhancement with mixture-of-deep-experts with clean clustering pre-training

no code implementations11 Feb 2021 Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot

The experts estimate a mask from the noisy input and the final mask is then obtained as a weighted average of the experts' estimates, with the weights determined by the gating DNN.

Clustering Speech Enhancement

Single channel voice separation for unknown number of speakers under reverberant and noisy settings

2 code implementations4 Nov 2020 Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi

The proposed approach is composed of several separation heads optimized together with a speaker classification branch.

Classification General Classification

FCN Approach for Dynamically Locating Multiple Speakers

1 code implementation26 Aug 2020 Hodaya Hammer, Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot

In this paper, we present a deep neural network-based online multi-speaker localisation algorithm.

Deep Clustering Based on a Mixture of Autoencoders

no code implementations16 Dec 2018 Shlomo E. Chazan, Sharon Gannot, Jacob Goldberger

The optimal clustering is found by minimizing the reconstruction loss of the mixture of autoencoder network.

Clustering Deep Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.