Search Results for author: Shiva Sundaram

Found 8 papers, 0 papers with code

Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning

no code implementations1 Feb 2020 Sanna Wager, Aparna Khare, Minhua Wu, Kenichi Kumatani, Shiva Sundaram

Using a large offline teacher model trained on beamformed audio, we trained a simpler multi-channel student acoustic model used in the speech recognition system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Robust Multi-channel Speech Recognition using Frequency Aligned Network

no code implementations6 Feb 2020 Taejin Park, Kenichi Kumatani, Minhua Wu, Shiva Sundaram

In this paper, we further develop this idea and use frequency aligned network for robust multi-channel automatic speech recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Self-Supervised learning with cross-modal transformers for emotion recognition

no code implementations20 Nov 2020 Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram

Self-supervised learning has shown improvements on tasks with limited labeled datasets in domains like speech and natural language.

Emotion Recognition Language Modelling +4

Audiovisual Highlight Detection in Videos

no code implementations11 Feb 2021 Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram

To better study the task of highlight detection, we run a pilot experiment with highlights annotations for a small subset of video clips and fine-tune our best model on it.

Highlight Detection Object Recognition +2

Cannot find the paper you are looking for? You can Submit a new open access paper.