Search Results for author: Gaurav Sharma

Found 33 papers, 6 papers with code

Shuffle and Attend: Video Domain Adaptation

no code implementations • ECCV 2020 • Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang

As the first novelty, we propose an attention mechanism which focuses on more discriminative clips and directly optimizes for video-level (cf.

Ranked #3 on Unsupervised Domain Adaptation on UCF-HMDB

Action Recognition Temporal Action Localization +1

Paper
Add Code

OmniVec: Learning robust representations with cross modal sharing

no code implementations • 7 Nov 2023 • Siddharth Srivastava, Gaurav Sharma

We demonstrate empirically that, using a joint network to train across modalities leads to meaningful information sharing and this allows us to achieve state-of-the-art results on most of the benchmarks.

Ranked #1 on Image Classification on ImageNet

3D Point Cloud Classification Action Classification +5

Paper
Add Code

Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction

1 code implementation • 29 Oct 2023 • Hao Zhang, Yang Liu, Xiaoyan Liu, Tianming Liang, Gaurav Sharma, Liang Xue, Maozu Guo

We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data.

Relation Relation Extraction +1

Paper
Code

Texture Representation via Analysis and Synthesis with Generative Adversarial Networks

no code implementations • 20 Dec 2022 • Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas

We investigate data-driven texture modeling via analysis and synthesis with generative adversarial networks.

Texture Classification

Paper
Add Code

K-12BERT: BERT for K-12 education

1 code implementation • 24 May 2022 • Vasu Goel, Dhruv Sahnan, Venktesh V, Gaurav Sharma, Deep Dwivedi, Mukesh Mohania

However, there has not been a model specifically adapted for the education domain (particularly K-12) across subjects to the best of our knowledge.

Language Modelling

Paper
Code

Towards Universal Texture Synthesis by Combining Texton Broadcasting with Noise Injection in StyleGAN-2

1 code implementation • 8 Mar 2022 • Jue Lin, Gaurav Sharma, Thrasyvoulos N. Pappas

We present a new approach for universal texture synthesis by incorporating a multi-scale texton broadcasting module in the StyleGAN-2 framework.

Inductive Bias Texture Synthesis

Paper
Code

Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention

no code implementations • 15 Nov 2021 • Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma

In this work, we argue that depth map of the scene can act as a proxy for inducing distance information of different objects in the scene, for the task of audio binauralization.

Paper
Add Code

Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention

no code implementations • 10 Aug 2021 • Kranti Kumar Parida, Siddharth Srivastava, Neeraj Matiyali, Gaurav Sharma

Binaural audio gives the listener the feeling of being in the recording place and enhances the immersive experience if coupled with AR/VR.

Audio Generation

Paper
Add Code

Distantly-Supervised Long-Tailed Relation Extraction Using Constraint Graphs

1 code implementation • 24 May 2021 • Tianming Liang, Yang Liu, Xiaoyan Liu, Hao Zhang, Gaurav Sharma, Maozu Guo

On top of that, we further propose a novel constraint graph-based relation extraction framework(CGRE) to handle the two challenges simultaneously.

Ranked #3 on Relationship Extraction (Distant Supervised) on New York Times Corpus

Denoising Relation +2

Paper
Code

Exploiting Local Geometry for Feature and Graph Construction for Better 3D Point Cloud Processing with Graph Neural Networks

no code implementations • 28 Mar 2021 • Siddharth Srivastava, Gaurav Sharma

As a second contribution, we propose to improve the graph construction for GNNs for 3D point clouds.

Ranked #1 on 3D Part Segmentation on ShapeNet-Part

3D Classification 3D Instance Segmentation +4

Paper
Add Code

Discriminative Semantic Transitive Consistency for Cross-Modal Learning

no code implementations • 25 Mar 2021 • Kranti Kumar Parida, Gaurav Sharma

Cross-modal retrieval is generally performed by projecting and aligning the data from two different modalities onto a shared representation space.

Cross-Modal Retrieval Retrieval

Paper
Add Code

Beyond Image to Depth: Improving Depth Prediction using Echoes

1 code implementation • CVPR 2021 • Kranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma

We propose a novel multi modal fusion technique, which incorporates the material properties explicitly while combining audio (echoes) and visual modalities to predict the scene depth.

Depth Estimation Depth Prediction

Paper
Code

Object Detection with a Unified Label Space from Multiple Datasets

no code implementations • ECCV 2020 • Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu

To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo labeling approach that we adapt for our specific case.

Object object-detection +1

Paper
Add Code

Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zeroshot Classification and Retrieval of Videos

no code implementations • 19 Oct 2019 • Kranti Kumar Parida, Neeraj Matiyali, Tanaya Guha, Gaurav Sharma

We present an audio-visual multimodal approach for the task of zeroshot learning (ZSL) for classification and retrieval of videos.

Ranked #5 on GZSL Video Classification on VGGSound-GZSL(main)

General Classification GZSL Video Classification +1

Paper
Add Code

Video Person Re-Identification using Learned Clip Similarity Aggregation

no code implementations • 17 Oct 2019 • Neeraj Matiyali, Gaurav Sharma

We show that using a learned clip similarity aggregation function allows filtering out hard clip pairs, e. g. where the person is not clearly visible, is in a challenging pose, or where the poses in the two clips are too different to be informative.

Optical Flow Estimation Video-Based Person Re-Identification

Paper
Add Code

A Novel Deep Learning Pipeline for Retinal Vessel Detection in Fluorescein Angiography

no code implementations • 5 Jul 2019 • Li Ding, Mohammad H. Bawany, Ajay E. Kuriyan, Rajeev S. Ramchandran, Charles C. Wykoff, Gaurav Sharma

We propose a novel pipeline to detect retinal vessels in FA images using deep neural networks that reduces the effort required for generating labeled ground truth data by combining two key components: cross-modality transfer and human-in-the-loop learning.

Vessel Detection

Paper
Add Code

Learning 2D to 3D Lifting for Object Detection in 3D for Autonomous Vehicles

no code implementations • 27 Mar 2019 • Siddharth Srivastava, Frederic Jurie, Gaurav Sharma

We address the problem of 3D object detection from 2D monocular images in autonomous driving scenarios.

3D Object Detection 3D Object Detection From Monocular Images +4

Paper
Add Code

Zero-Shot Object Detection

no code implementations • ECCV 2018 • Ankan Bansal, Karan Sikka, Gaurav Sharma, Rama Chellappa, Ajay Divakaran

We introduce and tackle the problem of zero-shot object detection (ZSD), which aims to detect object classes which are not observed during training.

Object object-detection +2

Paper
Add Code

Unsupervised Learning of Face Representations

1 code implementation • 3 Mar 2018 • Samyak Datta, Gaurav Sharma, C. V. Jawahar

Although faces extracted from videos have a lower spatial resolution than those which are available as part of standard supervised face datasets such as LFW and CASIA-WebFace, the former represent a much more realistic setting, e. g. in surveillance scenarios where most of the faces detected are very small.

Paper
Code

A Generative Model for Dynamic Networks with Applications

no code implementations • 11 Feb 2018 • Shubham Gupta, Gaurav Sharma, Ambedkar Dukkipati

Networks observed in real world like social networks, collaboration networks etc., exhibit temporal dynamics, i. e. nodes and edges appear and/or disappear over time.

Community Detection Link Prediction

Paper
Add Code

Vehicle Tracking in Wide Area Motion Imagery via Stochastic Progressive Association Across Multiple Frames (SPAAM)

no code implementations • 18 Sep 2017 • Ahmed Elliethy, Gaurav Sharma

The stochastic dis-association at each iteration maintains each estimated association according to an estimated probability for confidence, obtained via a probabilistic model.

Paper
Add Code

An Empirical Evaluation of Visual Question Answering for Novel Objects

no code implementations • CVPR 2017 • Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal

We study the problem of answering questions about images in the harder setting, where the test questions and corresponding images contain novel objects, which were not queried about in the training data.

Question Answering Visual Question Answering

Paper
Add Code

Large Scale Novel Object Discovery in 3D

no code implementations • 22 Jan 2017 • Siddharth Srivastava, Gaurav Sharma, Brejesh lall

We test on unknown objects, which were not seen during training, and perform clustering in the learned embedding space of supervoxels to effectively perform novel object discovery.

Clustering Object +1

Paper
Add Code

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

no code implementations • CVPR 2017 • Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma

We propose a novel method for temporally pooling frames in a video for the task of human action recognition.

Action Recognition In Videos Optical Flow Estimation +1

Paper
Add Code

Deep fusion of visual signatures for client-server facial analysis

no code implementations • 1 Nov 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie

The challenge addressed in this paper is to design a common universal representation such that a single merged signature is transmitted to the server, whatever be the type and number of features computed by the client, ensuring nonetheless an optimal performance.

Paper
Add Code

Discriminatively Trained Latent Ordinal Model for Video Classification

no code implementations • 8 Aug 2016 • Karan Sikka, Gaurav Sharma

We study the problem of video classification for facial analysis and human action recognition.

Action Recognition Classification +5

Paper
Add Code

CP-mtML: Coupled Projection multi-task Metric Learning for Large Scale Face Retrieval

no code implementations • CVPR 2016 • Binod Bhattarai, Gaurav Sharma, Frederic Jurie

The experiments clearly demonstrate the scalability and improved performance of the proposed method on the tasks of identity and age based face image retrieval compared to competitive existing methods, on the standard datasets and with the presence of a million distractor face images.

Face Image Retrieval Metric Learning +2

Paper
Add Code

LOMo: Latent Ordinal Model for Facial Analysis in Videos

no code implementations • CVPR 2016 • Karan Sikka, Gaurav Sharma, Marian Bartlett

We study the problem of facial analysis in videos.

Multiple Instance Learning Weakly-supervised Learning

Paper
Add Code

Latent Embeddings for Zero-shot Classification

no code implementations • CVPR 2016 • Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele

We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image.

Classification General Classification +1

Paper
Add Code

Local Higher-Order Statistics (LHS) describing images with statistics of local non-binarized pixel patterns

no code implementations • 2 Oct 2015 • Gaurav Sharma, Frederic Jurie

We propose a new image representation for texture categorization and facial analysis, relying on the use of higher-order local differential statistics as features.

Quantization

Paper
Add Code

Scalable Nonlinear Embeddings for Semantic Category-based Image Retrieval

no code implementations • ICCV 2015 • Gaurav Sharma, Bernt Schiele

We propose a novel algorithm for the task of supervised discriminative distance learning by nonlinearly embedding vectors into a low dimensional Euclidean space.

Image Retrieval Metric Learning +1

Paper
Add Code

Expanded Parts Model for Semantic Description of Humans in Still Images

no code implementations • 14 Sep 2015 • Gaurav Sharma, Frederic Jurie, Cordelia Schmid

We validate our method on three recent challenging datasets of human attributes and actions.

Paper
Add Code

Expanded Parts Model for Human Attribute and Action Recognition in Still Images

no code implementations • CVPR 2013 • Gaurav Sharma, Frederic Jurie, Cordelia Schmid

We propose a new model for recognizing human attributes (e. g. wearing a suit, sitting, short hair) and actions (e. g. running, riding a horse) in still images.

Action Recognition In Still Images Attribute

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.