Search Results for author: Vikramjit Mitra

Found 14 papers, 0 papers with code

Investigating salient representations and label Variance in Dimensional Speech Emotion Analysis

no code implementations • 17 Dec 2023 • Vikramjit Mitra, Jingping Nie, Erdrin Azemi

Representations derived from models such as BERT (Bidirectional Encoder Representations from Transformers) and HuBERT (Hidden units BERT), have helped to achieve state-of-the-art performance in dimensional speech emotion recognition.

Speech Emotion Recognition

Paper
Add Code

Pre-trained Model Representations and their Robustness against Noise for Speech Emotion Analysis

no code implementations • 3 Mar 2023 • Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano

We investigated the use of pre-trained model representations for estimating dimensional emotions, such as activation, valence, and dominance, from speech.

Emotion Recognition Knowledge Distillation +3

Paper
Add Code

Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation

no code implementations • 2 Jul 2022 • Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi

We investigate the use of pre-trained model representations to improve valence estimation from acoustic speech signal.

Knowledge Distillation Multi-Task Learning +1

Paper
Add Code

Estimating Respiratory Rate From Breath Audio Obtained Through Wearable Microphones

no code implementations • 28 Jul 2021 • Agni Kumar, Vikramjit Mitra, Carolyn Oliver, Adeeti Ullal, Matt Biddulph, Irida Mance

Respiratory rate (RR) is a clinical metric used to assess overall health and physical fitness.

Paper
Add Code

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

no code implementations • 18 Jun 2021 • Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Dysfluencies and variations in speech pronunciation can severely degrade speech recognition performance, and for many individuals with moderate-to-severe speech disorders, voice operated systems do not work.

Intent Recognition speech-recognition +1

Paper
Add Code

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

no code implementations • 24 Feb 2021 • Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham

The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns.

Event Detection speech-recognition +1

Paper
Add Code

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

no code implementations • 31 Jan 2020 • Vasudha Kowtha, Vikramjit Mitra, Chris Bartels, Erik Marchi, Sue Booker, William Caruso, Sachin Kajarekar, Devang Naik

Emotion plays an essential role in human-to-human communication, enabling us to convey feelings such as happiness, frustration, and sincerity.

Natural Language Understanding speech-recognition +1

Paper
Add Code

Investigation and Analysis of Hyper and Hypo neuron pruning to selectively update neurons during Unsupervised Adaptation

no code implementations • 6 Jan 2020 • Vikramjit Mitra, Horacio Franco

This work investigates if pruning approaches are successful in detecting neurons that are either high-salient (mostly active or hyper) or low-salient (barely active or hypo), and whether removal of such neurons can help to improve the model's generalization capacity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice

no code implementations • 28 Jun 2019 • Vikramjit Mitra, Sue Booker, Erik Marchi, David Scott Farrar, Ute Dorothea Peitz, Bridget Cheng, Ermine Teves, Anuj Mehta, Devang Naik

The expectation is that such assistants should understand the intent of the users query.

Paper
Add Code

Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech

no code implementations • 16 May 2019 • Emre Yilmaz, Vikramjit Mitra, Ganesh Sivaraman, Horacio Franco

The rapid population aging has stimulated the development of assistive devices that provide personalized medical support to the needies suffering from various etiologies.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Articulatory Features for ASR of Pathological Speech

no code implementations • 28 Jul 2018 • Emre Yilmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco

In this work, we investigate the joint use of articulatory and acoustic features for automatic speech recognition (ASR) of pathological speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Articulatory information and Multiview Features for Large Vocabulary Continuous Speech Recognition

no code implementations • 16 Feb 2018 • Vikramjit Mitra, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri

This paper explores the use of multi-view features and their discriminative transforms in a convolutional deep neural network (CNN) architecture for a continuous large vocabulary speech recognition task.

speech-recognition Speech Recognition

Paper
Add Code

Interpreting DNN output layer activations: A strategy to cope with unseen data in speech recognition

no code implementations • 16 Feb 2018 • Vikramjit Mitra, Horacio Franco

This work proposes a strategy to assess a model's performance by analyzing the output layer activations by using a distance measure between the most likely target and the next most likely target, which is used for data selection for performing unsupervised adaptation.

speech-recognition Speech Recognition

Paper
Add Code

Leveraging Deep Neural Network Activation Entropy to cope with Unseen Data in Speech Recognition

no code implementations • 31 Aug 2017 • Vikramjit Mitra, Horacio Franco

This work aims to estimate the propagation of such distortion in the form of network activation entropy, which is measured over a short- time running window on the activation from each neuron of a given hidden layer, and these measurements are then used to compute summary entropy.

BIG-bench Machine Learning speech-recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.