no code implementations • 10 Aug 2021 • Kranti Kumar Parida, Siddharth Srivastava, Neeraj Matiyali, Gaurav Sharma
Binaural audio gives the listener the feeling of being in the recording place and enhances the immersive experience if coupled with AR/VR.
no code implementations • 19 Oct 2019 • Kranti Kumar Parida, Neeraj Matiyali, Tanaya Guha, Gaurav Sharma
We present an audio-visual multimodal approach for the task of zeroshot learning (ZSL) for classification and retrieval of videos.
Ranked #5 on GZSL Video Classification on VGGSound-GZSL(main)
no code implementations • 17 Oct 2019 • Neeraj Matiyali, Gaurav Sharma
We show that using a learned clip similarity aggregation function allows filtering out hard clip pairs, e. g. where the person is not clearly visible, is in a challenging pose, or where the poses in the two clips are too different to be informative.
Optical Flow Estimation Video-Based Person Re-Identification
no code implementations • 20 Nov 2018 • Pravendra Singh, Manikandan. R, Neeraj Matiyali, Vinay P. Namboodiri
Additionally, we also empirically show our method's adaptability for classification based architecture VGG16 on datasets CIFAR and German Traffic Sign Recognition Benchmark (GTSRB) achieving a compression rate of 125X and 200X with the reduction in flops by 90. 50% and 96. 6% respectively with no loss of accuracy.