Search Results for author: Shlomo E. Chazan

Found 8 papers, 2 papers with code

Optimized Tokenization for Transcribed Error Correction

no code implementations • 16 Oct 2023 • Tomer Wullach, Shlomo E. Chazan

The challenges facing speech recognition systems, such as variations in pronunciations, adverse audio conditions, and the scarcity of labeled data, emphasize the necessity for a post-processing step that corrects recurring errors.

speech-recognition Speech Recognition

Paper
Add Code

Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation

no code implementations • 27 Dec 2022 • Tomer Wullach, Shlomo E. Chazan

One prominent speech recognition decoding heuristic is beam search, which seeks the transcript with the greatest likelihood computed using the predicted distribution.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Enhancing Speech Recognition Decoding via Layer Aggregation

no code implementations • 21 Mar 2022 • Tomer Wullach, Shlomo E. Chazan

Recently proposed speech recognition systems are designed to predict using representations generated by their top layers, employing greedy decoding which isolates each timestep from the rest of the sequence.

Language Modelling speech-recognition +1

Paper
Add Code

Single microphone speaker extraction using unified time-frequency Siamese-Unet

no code implementations • 6 Mar 2022 • Aviad Eisenberg, Sharon Gannot, Shlomo E. Chazan

In this paper we present a unified time-frequency method for speaker extraction in clean and noisy conditions.

blind source separation

Paper
Add Code

Speech enhancement with mixture-of-deep-experts with clean clustering pre-training

no code implementations • 11 Feb 2021 • Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot

The experts estimate a mask from the noisy input and the final mask is then obtained as a weighted average of the experts' estimates, with the weights determined by the gating DNN.

Clustering Speech Enhancement

Paper
Add Code

Single channel voice separation for unknown number of speakers under reverberant and noisy settings

2 code implementations • 4 Nov 2020 • Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi

The proposed approach is composed of several separation heads optimized together with a speaker classification branch.

Classification General Classification

1,158

Paper
Code

FCN Approach for Dynamically Locating Multiple Speakers

1 code implementation • 26 Aug 2020 • Hodaya Hammer, Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot

In this paper, we present a deep neural network-based online multi-speaker localisation algorithm.

Paper
Code

Deep Clustering Based on a Mixture of Autoencoders

no code implementations • 16 Dec 2018 • Shlomo E. Chazan, Sharon Gannot, Jacob Goldberger

The optimal clustering is found by minimizing the reconstruction loss of the mixture of autoencoder network.

Clustering Deep Clustering

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.