Search Results for author: Vikramjit Mitra

Found 14 papers, 0 papers with code

Investigating salient representations and label Variance in Dimensional Speech Emotion Analysis

no code implementations17 Dec 2023 Vikramjit Mitra, Jingping Nie, Erdrin Azemi

Representations derived from models such as BERT (Bidirectional Encoder Representations from Transformers) and HuBERT (Hidden units BERT), have helped to achieve state-of-the-art performance in dimensional speech emotion recognition.

Speech Emotion Recognition

Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

no code implementations3 Mar 2023 Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano

We investigated the use of pre-trained model representations for estimating dimensional emotions, such as activation, valence, and dominance, from speech.

Emotion Recognition Knowledge Distillation +3

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

no code implementations18 Jun 2021 Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Dysfluencies and variations in speech pronunciation can severely degrade speech recognition performance, and for many individuals with moderate-to-severe speech disorders, voice operated systems do not work.

Intent Recognition speech-recognition +1

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

no code implementations24 Feb 2021 Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns.

Event Detection speech-recognition +1

Investigation and Analysis of Hyper and Hypo neuron pruning to selectively update neurons during Unsupervised Adaptation

no code implementations6 Jan 2020 Vikramjit Mitra, Horacio Franco

This work investigates if pruning approaches are successful in detecting neurons that are either high-salient (mostly active or hyper) or low-salient (barely active or hypo), and whether removal of such neurons can help to improve the model's generalization capacity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech

no code implementations16 May 2019 Emre Yilmaz, Vikramjit Mitra, Ganesh Sivaraman, Horacio Franco

The rapid population aging has stimulated the development of assistive devices that provide personalized medical support to the needies suffering from various etiologies.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Articulatory Features for ASR of Pathological Speech

no code implementations28 Jul 2018 Emre Yilmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco

In this work, we investigate the joint use of articulatory and acoustic features for automatic speech recognition (ASR) of pathological speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition

no code implementations16 Feb 2018 Vikramjit Mitra, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri

This paper explores the use of multi-view features and their discriminative transforms in a convolutional deep neural network (CNN) architecture for a continuous large vocabulary speech recognition task.

speech-recognition Speech Recognition

Interpreting DNN output layer activations: A strategy to cope with unseen data in speech recognition

no code implementations16 Feb 2018 Vikramjit Mitra, Horacio Franco

This work proposes a strategy to assess a model's performance by analyzing the output layer activations by using a distance measure between the most likely target and the next most likely target, which is used for data selection for performing unsupervised adaptation.

speech-recognition Speech Recognition

Leveraging Deep Neural Network Activation Entropy to cope with Unseen Data in Speech Recognition

no code implementations31 Aug 2017 Vikramjit Mitra, Horacio Franco

This work aims to estimate the propagation of such distortion in the form of network activation entropy, which is measured over a short- time running window on the activation from each neuron of a given hidden layer, and these measurements are then used to compute summary entropy.

BIG-bench Machine Learning speech-recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.