Search Results for author: Soheil Khorram

Found 9 papers, 2 papers with code

Contrastive Siamese Network for Semi-supervised Speech Recognition

no code implementations27 May 2022 Soheil Khorram, Jaeyoung Kim, Anshuman Tripathi, Han Lu, Qian Zhang, Hasim Sak

This paper introduces contrastive siamese (c-siam) network, an architecture for leveraging unlabeled acoustic data in speech recognition.

speech-recognition Speech Recognition

Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition

no code implementations15 Oct 2019 Salar Jafarlou, Soheil Khorram, Vinay Kothapally, John H. L. Hansen

In the present study, we address this issue by investigating variants of large receptive field CNNs (LRF-CNNs) which include deeply recursive networks, dilated convolutional neural networks, and stacked hourglass networks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition

no code implementations1 Oct 2019 Shahram Ghorbani, Soheil Khorram, John H. L. Hansen

An obvious approach to leverage data from a new domain (e. g., new accented speech) is to first generate a comprehensive dataset of all domains, by combining all available data, and then use this dataset to retrain the acoustic models.

Robust Speech Recognition speech-recognition

Probabilistic Permutation Invariant Training for Speech Separation

no code implementations4 Aug 2019 Midia Yousefi, Soheil Khorram, John H. L. Hansen

Recently proposed Permutation Invariant Training (PIT) addresses this problem by determining the output-label assignment which minimizes the separation error.

Speech Separation

Jointly Aligning and Predicting Continuous Emotion Annotations

no code implementations5 Jul 2019 Soheil Khorram, Melvin G McInnis, Emily Mower Provost

To deal with this challenge, we introduce a new convolutional neural network (multi-delay sinc network) that is able to simultaneously align and predict labels in an end-to-end manner.

Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients

no code implementations3 Jul 2019 Nursadul Mamun, Soheil Khorram, John H. L. Hansen

To improve speech enhancement methods for CI users, we propose to perform speech enhancement in a cochlear filter-bank feature space, a feature-set specifically designed for CI users based on CI auditory stimuli.

Speech Enhancement

Trainable Time Warping: Aligning Time-Series in the Continuous-Time Domain

1 code implementation21 Mar 2019 Soheil Khorram, Melvin G McInnis, Emily Mower Provost

We introduce trainable time warping (TTW), whose complexity is linear in both the number and the length of time-series.

General Classification Time Series +1

Progressive Neural Networks for Transfer Learning in Emotion Recognition

1 code implementation10 Jun 2017 John Gideon, Soheil Khorram, Zakaria Aldeneh, Dimitrios Dimitriadis, Emily Mower Provost

Many paralinguistic tasks are closely related and thus representations learned in one domain can be leveraged for another.

Emotion Recognition Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.