no code implementations • 14 Dec 2023 • Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan
Audio-visual automatic speech recognition (AV-ASR) models are very effective at reducing word error rates on noisy speech, but require large amounts of transcribed AV training data.
1 code implementation • International Conference on Multimodal Interaction 2019 • Ankit Parag Shah, Vaibhav Vaibhav, Vasu Sharma, Mahmoud Al Ismail, Jeffrey M. Girard, Louis Philippe Morency
In this work, we set out to study multimodal behavioral markers related to suicidal intent when expressed on social media videos.
Multimodal Emotion Recognition Multimodal Sentiment Analysis
no code implementations • 4 Dec 2018 • George Larionov, Zachary Kaden, Hima Varsha Dureddy, Gabriel Bayomi T. Kalejaiye, Mihir Kale, Srividya Pranavi Potharaju, Ankit Parag Shah, Alexander I. Rudnicky
Tartan is a non-goal-oriented socialbot focused around providing users with an engaging and fluent casual conversation.