Search Results for author: Gaurav Sharma

Found 33 papers, 6 papers with code

Shuffle and Attend: Video Domain Adaptation

no code implementations ECCV 2020 Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang

As the first novelty, we propose an attention mechanism which focuses on more discriminative clips and directly optimizes for video-level (cf.

Action Recognition Temporal Action Localization +1

OmniVec: Learning robust representations with cross modal sharing

no code implementations7 Nov 2023 Siddharth Srivastava, Gaurav Sharma

We demonstrate empirically that, using a joint network to train across modalities leads to meaningful information sharing and this allows us to achieve state-of-the-art results on most of the benchmarks.

3D Point Cloud Classification Action Classification +5

Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction

1 code implementation29 Oct 2023 Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo

We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data.

Relation Relation Extraction +1

Texture Representation via Analysis and Synthesis with Generative Adversarial Networks

no code implementations20 Dec 2022 Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas

We investigate data-driven texture modeling via analysis and synthesis with generative adversarial networks.

Texture Classification

K-12BERT: BERT for K-12 education

1 code implementation24 May 2022 Vasu Goel, Dhruv Sahnan, Venktesh V, Gaurav Sharma, Deep Dwivedi, Mukesh Mohania

However, there has not been a model specifically adapted for the education domain (particularly K-12) across subjects to the best of our knowledge.

Language Modelling

Towards Universal Texture Synthesis by Combining Texton Broadcasting with Noise Injection in StyleGAN-2

1 code implementation8 Mar 2022 Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas

We present a new approach for universal texture synthesis by incorporating a multi-scale texton broadcasting module in the StyleGAN-2 framework.

Inductive Bias Texture Synthesis

Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention

no code implementations15 Nov 2021 Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma

In this work, we argue that depth map of the scene can act as a proxy for inducing distance information of different objects in the scene, for the task of audio binauralization.

Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention

no code implementations10 Aug 2021 Kranti Kumar Parida, Siddharth Srivastava, Neeraj Matiyali, Gaurav Sharma

Binaural audio gives the listener the feeling of being in the recording place and enhances the immersive experience if coupled with AR/VR.

Audio Generation

Distantly-Supervised Long-Tailed Relation Extraction Using Constraint Graphs

1 code implementation24 May 2021 Tianming Liang, Yang Liu, Xiaoyan Liu, Hao Zhang, Gaurav Sharma, Maozu Guo

On top of that, we further propose a novel constraint graph-based relation extraction framework(CGRE) to handle the two challenges simultaneously.

Denoising Relation +2

Discriminative Semantic Transitive Consistency for Cross-Modal Learning

no code implementations25 Mar 2021 Kranti Kumar Parida, Gaurav Sharma

Cross-modal retrieval is generally performed by projecting and aligning the data from two different modalities onto a shared representation space.

Cross-Modal Retrieval Retrieval

Beyond Image to Depth: Improving Depth Prediction using Echoes

1 code implementation CVPR 2021 Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma

We propose a novel multi modal fusion technique, which incorporates the material properties explicitly while combining audio (echoes) and visual modalities to predict the scene depth.

Depth Estimation Depth Prediction

Object Detection with a Unified Label Space from Multiple Datasets

no code implementations ECCV 2020 Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu

To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case.

Object object-detection +1

Video Person Re-Identification using Learned Clip Similarity Aggregation

no code implementations17 Oct 2019 Neeraj Matiyali, Gaurav Sharma

We show that using a learned clip similarity aggregation function allows filtering out hard clip pairs, e. g. where the person is not clearly visible, is in a challenging pose, or where the poses in the two clips are too different to be informative.

Optical Flow Estimation Video-Based Person Re-Identification

A Novel Deep Learning Pipeline for Retinal Vessel Detection in Fluorescein Angiography

no code implementations5 Jul 2019 Li Ding, Mohammad H. Bawany, Ajay E. Kuriyan, Rajeev S. Ramchandran, Charles C. Wykoff, Gaurav Sharma

We propose a novel pipeline to detect retinal vessels in FA images using deep neural networks that reduces the effort required for generating labeled ground truth data by combining two key components: cross-modality transfer and human-in-the-loop learning.

Vessel Detection

Zero-Shot Object Detection

no code implementations ECCV 2018 Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran

We introduce and tackle the problem of zero-shot object detection (ZSD), which aims to detect object classes which are not observed during training.

Object object-detection +2

Unsupervised Learning of Face Representations

1 code implementation3 Mar 2018 Samyak Datta, Gaurav Sharma, C. V. Jawahar

Although faces extracted from videos have a lower spatial resolution than those which are available as part of standard supervised face datasets such as LFW and CASIA-WebFace, the former represent a much more realistic setting, e. g. in surveillance scenarios where most of the faces detected are very small.

A Generative Model for Dynamic Networks with Applications

no code implementations11 Feb 2018 Shubham Gupta, Gaurav Sharma, Ambedkar Dukkipati

Networks observed in real world like social networks, collaboration networks etc., exhibit temporal dynamics, i. e. nodes and edges appear and/or disappear over time.

Community Detection Link Prediction

Vehicle Tracking in Wide Area Motion Imagery via Stochastic Progressive Association Across Multiple Frames (SPAAM)

no code implementations18 Sep 2017 Ahmed Elliethy, Gaurav Sharma

The stochastic dis-association at each iteration maintains each estimated association according to an estimated probability for confidence, obtained via a probabilistic model.

An Empirical Evaluation of Visual Question Answering for Novel Objects

no code implementations CVPR 2017 Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal

We study the problem of answering questions about images in the harder setting, where the test questions and corresponding images contain novel objects, which were not queried about in the training data.

Question Answering Visual Question Answering

Large Scale Novel Object Discovery in 3D

no code implementations22 Jan 2017 Siddharth Srivastava, Gaurav Sharma, Brejesh lall

We test on unknown objects, which were not seen during training, and perform clustering in the learned embedding space of supervoxels to effectively perform novel object discovery.

Clustering Object +1

Deep fusion of visual signatures for client-server facial analysis

no code implementations1 Nov 2016 Binod Bhattarai, Gaurav Sharma, Frederic Jurie

The challenge addressed in this paper is to design a common universal representation such that a single merged signature is transmitted to the server, whatever be the type and number of features computed by the client, ensuring nonetheless an optimal performance.

Discriminatively Trained Latent Ordinal Model for Video Classification

no code implementations8 Aug 2016 Karan Sikka, Gaurav Sharma

We study the problem of video classification for facial analysis and human action recognition.

Action Recognition Classification +5

CP-mtML: Coupled Projection multi-task Metric Learning for Large Scale Face Retrieval

no code implementations CVPR 2016 Binod Bhattarai, Gaurav Sharma, Frederic Jurie

The experiments clearly demonstrate the scalability and improved performance of the proposed method on the tasks of identity and age based face image retrieval compared to competitive existing methods, on the standard datasets and with the presence of a million distractor face images.

Face Image Retrieval Metric Learning +2

Latent Embeddings for Zero-shot Classification

no code implementations CVPR 2016 Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele

We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image.

Classification General Classification +1

Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns

no code implementations2 Oct 2015 Gaurav Sharma, Frederic Jurie

We propose a new image representation for texture categorization and facial analysis, relying on the use of higher-order local differential statistics as features.

Quantization

Scalable Nonlinear Embeddings for Semantic Category-based Image Retrieval

no code implementations ICCV 2015 Gaurav Sharma, Bernt Schiele

We propose a novel algorithm for the task of supervised discriminative distance learning by nonlinearly embedding vectors into a low dimensional Euclidean space.

Image Retrieval Metric Learning +1

Expanded Parts Model for Semantic Description of Humans in Still Images

no code implementations14 Sep 2015 Gaurav Sharma, Frederic Jurie, Cordelia Schmid

We validate our method on three recent challenging datasets of human attributes and actions.

Expanded Parts Model for Human Attribute and Action Recognition in Still Images

no code implementations CVPR 2013 Gaurav Sharma, Frederic Jurie, Cordelia Schmid

We propose a new model for recognizing human attributes (e. g. wearing a suit, sitting, short hair) and actions (e. g. running, riding a horse) in still images.

Action Recognition In Still Images Attribute

Cannot find the paper you are looking for? You can Submit a new open access paper.