no code implementations • 26 Jun 2021 • Nihar Bendre, Kevin Desai, Peyman Najafirad
To overcome this problem, we propose a Multimodal Variational Auto-Encoder (M-VAE) which can learn the shared latent space of image features and the semantic space.
no code implementations • 15 May 2021 • Nihar Bendre, Kevin Desai, Peyman Najafirad
This results in achieving better understanding of complex questions and also provides reasoning as to why the module predicts a particular answer.
no code implementations • 30 Jul 2020 • Nihar Bendre, Hugo Terashima Marín, Peyman Najafirad
Deep neural networks have been able to outperform humans in some cases like image recognition and image classification.
no code implementations • 29 Jan 2020 • Nihar Bendre, Nima Ebadi, John J. Prevost, Paul Rad
A great number of computer vision publications have focused on distinguishing between human action recognition and classification rather than the intensity of actions performed.