no code implementations • ECCV 2020 • Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang
As the first novelty, we propose an attention mechanism which focuses on more discriminative clips and directly optimizes for video-level (cf.
Ranked #3 on Unsupervised Domain Adaptation on UCF-HMDB
no code implementations • 7 Nov 2023 • Siddharth Srivastava, Gaurav Sharma
We demonstrate empirically that, using a joint network to train across modalities leads to meaningful information sharing and this allows us to achieve state-of-the-art results on most of the benchmarks.
Ranked #1 on Image Classification on ImageNet
1 code implementation • 29 Oct 2023 • Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo
We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data.
no code implementations • 20 Dec 2022 • Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas
We investigate data-driven texture modeling via analysis and synthesis with generative adversarial networks.
1 code implementation • 24 May 2022 • Vasu Goel, Dhruv Sahnan, Venktesh V, Gaurav Sharma, Deep Dwivedi, Mukesh Mohania
However, there has not been a model specifically adapted for the education domain (particularly K-12) across subjects to the best of our knowledge.
1 code implementation • 8 Mar 2022 • Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas
We present a new approach for universal texture synthesis by incorporating a multi-scale texton broadcasting module in the StyleGAN-2 framework.
no code implementations • 15 Nov 2021 • Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma
In this work, we argue that depth map of the scene can act as a proxy for inducing distance information of different objects in the scene, for the task of audio binauralization.
no code implementations • 10 Aug 2021 • Kranti Kumar Parida, Siddharth Srivastava, Neeraj Matiyali, Gaurav Sharma
Binaural audio gives the listener the feeling of being in the recording place and enhances the immersive experience if coupled with AR/VR.
1 code implementation • 24 May 2021 • Tianming Liang, Yang Liu, Xiaoyan Liu, Hao Zhang, Gaurav Sharma, Maozu Guo
On top of that, we further propose a novel constraint graph-based relation extraction framework(CGRE) to handle the two challenges simultaneously.
no code implementations • 28 Mar 2021 • Siddharth Srivastava, Gaurav Sharma
As a second contribution, we propose to improve the graph construction for GNNs for 3D point clouds.
Ranked #1 on 3D Part Segmentation on ShapeNet-Part
no code implementations • 25 Mar 2021 • Kranti Kumar Parida, Gaurav Sharma
Cross-modal retrieval is generally performed by projecting and aligning the data from two different modalities onto a shared representation space.
1 code implementation • CVPR 2021 • Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma
We propose a novel multi modal fusion technique, which incorporates the material properties explicitly while combining audio (echoes) and visual modalities to predict the scene depth.
no code implementations • ECCV 2020 • Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu
To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case.
no code implementations • 19 Oct 2019 • Kranti Kumar Parida, Neeraj Matiyali, Tanaya Guha, Gaurav Sharma
We present an audio-visual multimodal approach for the task of zeroshot learning (ZSL) for classification and retrieval of videos.
Ranked #5 on GZSL Video Classification on VGGSound-GZSL(main)
no code implementations • 17 Oct 2019 • Neeraj Matiyali, Gaurav Sharma
We show that using a learned clip similarity aggregation function allows filtering out hard clip pairs, e. g. where the person is not clearly visible, is in a challenging pose, or where the poses in the two clips are too different to be informative.
Optical Flow Estimation Video-Based Person Re-Identification
no code implementations • 5 Jul 2019 • Li Ding, Mohammad H. Bawany, Ajay E. Kuriyan, Rajeev S. Ramchandran, Charles C. Wykoff, Gaurav Sharma
We propose a novel pipeline to detect retinal vessels in FA images using deep neural networks that reduces the effort required for generating labeled ground truth data by combining two key components: cross-modality transfer and human-in-the-loop learning.
no code implementations • 27 Mar 2019 • Siddharth Srivastava, Frederic Jurie, Gaurav Sharma
We address the problem of 3D object detection from 2D monocular images in autonomous driving scenarios.
3D Object Detection 3D Object Detection From Monocular Images +4
no code implementations • ECCV 2018 • Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran
We introduce and tackle the problem of zero-shot object detection (ZSD), which aims to detect object classes which are not observed during training.
1 code implementation • 3 Mar 2018 • Samyak Datta, Gaurav Sharma, C. V. Jawahar
Although faces extracted from videos have a lower spatial resolution than those which are available as part of standard supervised face datasets such as LFW and CASIA-WebFace, the former represent a much more realistic setting, e. g. in surveillance scenarios where most of the faces detected are very small.
no code implementations • 11 Feb 2018 • Shubham Gupta, Gaurav Sharma, Ambedkar Dukkipati
Networks observed in real world like social networks, collaboration networks etc., exhibit temporal dynamics, i. e. nodes and edges appear and/or disappear over time.
no code implementations • 18 Sep 2017 • Ahmed Elliethy, Gaurav Sharma
The stochastic dis-association at each iteration maintains each estimated association according to an estimated probability for confidence, obtained via a probabilistic model.
no code implementations • CVPR 2017 • Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal
We study the problem of answering questions about images in the harder setting, where the test questions and corresponding images contain novel objects, which were not queried about in the training data.
no code implementations • 22 Jan 2017 • Siddharth Srivastava, Gaurav Sharma, Brejesh lall
We test on unknown objects, which were not seen during training, and perform clustering in the learned embedding space of supervoxels to effectively perform novel object discovery.
no code implementations • CVPR 2017 • Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma
We propose a novel method for temporally pooling frames in a video for the task of human action recognition.
no code implementations • 1 Nov 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie
The challenge addressed in this paper is to design a common universal representation such that a single merged signature is transmitted to the server, whatever be the type and number of features computed by the client, ensuring nonetheless an optimal performance.
no code implementations • 8 Aug 2016 • Karan Sikka, Gaurav Sharma
We study the problem of video classification for facial analysis and human action recognition.
no code implementations • CVPR 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie
The experiments clearly demonstrate the scalability and improved performance of the proposed method on the tasks of identity and age based face image retrieval compared to competitive existing methods, on the standard datasets and with the presence of a million distractor face images.
no code implementations • CVPR 2016 • Karan Sikka, Gaurav Sharma, Marian Bartlett
We study the problem of facial analysis in videos.
no code implementations • CVPR 2016 • Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele
We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image.
no code implementations • 2 Oct 2015 • Gaurav Sharma, Frederic Jurie
We propose a new image representation for texture categorization and facial analysis, relying on the use of higher-order local differential statistics as features.
no code implementations • ICCV 2015 • Gaurav Sharma, Bernt Schiele
We propose a novel algorithm for the task of supervised discriminative distance learning by nonlinearly embedding vectors into a low dimensional Euclidean space.
no code implementations • 14 Sep 2015 • Gaurav Sharma, Frederic Jurie, Cordelia Schmid
We validate our method on three recent challenging datasets of human attributes and actions.
no code implementations • CVPR 2013 • Gaurav Sharma, Frederic Jurie, Cordelia Schmid
We propose a new model for recognizing human attributes (e. g. wearing a suit, sitting, short hair) and actions (e. g. running, riding a horse) in still images.