Search Results for author: Ankit Parag Shah

Found 3 papers, 1 papers with code

Audio-visual fine-tuning of audio-only ASR models

no code implementations • 14 Dec 2023 • Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan

Audio-visual automatic speech recognition (AV-ASR) models are very effective at reducing word error rates on noisy speech, but require large amounts of transcribed AV training data.

Automatic Speech Recognition Self-Supervised Learning +2

Paper
Add Code

Multimodal Behavioral Markers Exploring Suicidal Intent in Social Media Videos

1 code implementation • International Conference on Multimodal Interaction 2019 • Ankit Parag Shah, Vaibhav Vaibhav, Vasu Sharma, Mahmoud Al Ismail, Jeffrey M. Girard, Louis Philippe Morency

In this work, we set out to study multimodal behavioral markers related to suicidal intent when expressed on social media videos.

Multimodal Emotion Recognition Multimodal Sentiment Analysis

Paper
Code

Tartan: A retrieval-based socialbot powered by a dynamic finite-state machine architecture

no code implementations • 4 Dec 2018 • George Larionov, Zachary Kaden, Hima Varsha Dureddy, Gabriel Bayomi T. Kalejaiye, Mihir Kale, Srividya Pranavi Potharaju, Ankit Parag Shah, Alexander I. Rudnicky

Tartan is a non-goal-oriented socialbot focused around providing users with an engaging and fluent casual conversation.

Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.