no code implementations • 17 Dec 2023 • Vikramjit Mitra, Jingping Nie, Erdrin Azemi
Representations derived from models such as BERT (Bidirectional Encoder Representations from Transformers) and HuBERT (Hidden units BERT), have helped to achieve state-of-the-art performance in dimensional speech emotion recognition.
no code implementations • 3 Mar 2023 • Vikramjit Mitra, Vasudha Kowtha, Hsiang-Yun Sherry Chien, Erdrin Azemi, Carlos Avendano
We investigated the use of pre-trained model representations for estimating dimensional emotions, such as activation, valence, and dominance, from speech.
no code implementations • 2 Jul 2022 • Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi
We investigate the use of pre-trained model representations to improve valence estimation from acoustic speech signal.
no code implementations • 28 Jul 2021 • Agni Kumar, Vikramjit Mitra, Carolyn Oliver, Adeeti Ullal, Matt Biddulph, Irida Mance
Respiratory rate (RR) is a clinical metric used to assess overall health and physical fitness.
no code implementations • 18 Jun 2021 • Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham
Dysfluencies and variations in speech pronunciation can severely degrade speech recognition performance, and for many individuals with moderate-to-severe speech disorders, voice operated systems do not work.
no code implementations • 24 Feb 2021 • Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey P. Bigham
The ability to automatically detect stuttering events in speech could help speech pathologists track an individual's fluency over time or help improve speech recognition systems for people with atypical speech patterns.
no code implementations • 31 Jan 2020 • Vasudha Kowtha, Vikramjit Mitra, Chris Bartels, Erik Marchi, Sue Booker, William Caruso, Sachin Kajarekar, Devang Naik
Emotion plays an essential role in human-to-human communication, enabling us to convey feelings such as happiness, frustration, and sincerity.
no code implementations • 6 Jan 2020 • Vikramjit Mitra, Horacio Franco
This work investigates if pruning approaches are successful in detecting neurons that are either high-salient (mostly active or hyper) or low-salient (barely active or hypo), and whether removal of such neurons can help to improve the model's generalization capacity.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 28 Jun 2019 • Vikramjit Mitra, Sue Booker, Erik Marchi, David Scott Farrar, Ute Dorothea Peitz, Bridget Cheng, Ermine Teves, Anuj Mehta, Devang Naik
The expectation is that such assistants should understand the intent of the users query.
no code implementations • 16 May 2019 • Emre Yilmaz, Vikramjit Mitra, Ganesh Sivaraman, Horacio Franco
The rapid population aging has stimulated the development of assistive devices that provide personalized medical support to the needies suffering from various etiologies.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 28 Jul 2018 • Emre Yilmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco
In this work, we investigate the joint use of articulatory and acoustic features for automatic speech recognition (ASR) of pathological speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 16 Feb 2018 • Vikramjit Mitra, Wen Wang, Chris Bartels, Horacio Franco, Dimitra Vergyri
This paper explores the use of multi-view features and their discriminative transforms in a convolutional deep neural network (CNN) architecture for a continuous large vocabulary speech recognition task.
no code implementations • 16 Feb 2018 • Vikramjit Mitra, Horacio Franco
This work proposes a strategy to assess a model's performance by analyzing the output layer activations by using a distance measure between the most likely target and the next most likely target, which is used for data selection for performing unsupervised adaptation.
no code implementations • 31 Aug 2017 • Vikramjit Mitra, Horacio Franco
This work aims to estimate the propagation of such distortion in the form of network activation entropy, which is measured over a short- time running window on the activation from each neuron of a given hidden layer, and these measurements are then used to compute summary entropy.