no code implementations • 28 Mar 2023 • Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar
We present a self-supervised technique that directly optimizes on a sparse collection of images of a particular object/object category to obtain consistent dense correspondences across the collection.
no code implementations • 25 Mar 2023 • Vinoj Jayasundara, Amit Agrawal, Nicolas Heron, Abhinav Shrivastava, Larry S. Davis
We present FlexNeRF, a method for photorealistic freeviewpoint rendering of humans in motion from monocular videos.
no code implementations • 24 Mar 2023 • Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
Implicit neural representations (INR) have gained increasing attention in representing 3D scenes and images, and have been recently applied to encode videos (e. g., NeRV, E-NeRV).
1 code implementation • 13 Mar 2023 • Bo He, Jun Wang, JieLin Qiu, Trung Bui, Abhinav Shrivastava, Zhaowen Wang
The goal of multimodal summarization is to extract the most important information from different modalities to form summaries.
Ranked #3 on
Supervised Video Summarization
on SumMe
Extractive Text Summarization
Supervised Video Summarization
1 code implementation • 15 Feb 2023 • Fuxiao Liu, Yaser Yacoob, Abhinav Shrivastava
We introduce a new benchmark, COVID-VTS, for fact-checking multi-modal information involving short-duration videos with COVID19- focused information from both the real world and machine generation.
no code implementations • 30 Dec 2022 • Shishira R Maiya, Sharath Girish, Max Ehrlich, Hanyu Wang, Kwot Sin Lee, Patrick Poirson, Pengxiang Wu, Chen Wang, Abhinav Shrivastava
This design shares computation within each group, in the spatial and temporal dimensions, resulting in reduced encoding time of the video.
1 code implementation • 7 Dec 2022 • Matthew Walmer, Saksham Suri, Kamal Gupta, Abhinav Shrivastava
We compare ViTs trained through different methods of supervision, and show that they learn a diverse range of behaviors in terms of their attention, representations, and downstream performance.
no code implementations • 18 Nov 2022 • Hao Chen, Matt Gwilliam, Bo He, Ser-Nam Lim, Abhinav Shrivastava
We match the performance of NeRV, a state-of-the-art implicit neural representation, on the reconstruction task for frames seen during training while far surpassing for frames that are skipped during training (unseen images).
1 code implementation • 8 Nov 2022 • Yifei Zhou, Zilu Li, Abhinav Shrivastava, Hengshuang Zhao, Antonio Torralba, Taipeng Tian, Ser-Nam Lim
In this way, the new representation can be directly compared with the old representation, in principle avoiding the need for any backfilling.
no code implementations • 15 Aug 2022 • Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava
In this paper, we aim to address the challenge of label sparsity in semantic correspondence by enriching supervision signals from sparse keypoint annotations.
1 code implementation • CVPR 2022 • Matthew Gwilliam, Abhinav Shrivastava
In this paper, we compare methods using performance-based benchmarks such as linear evaluation, nearest neighbor classification, and clustering for several different datasets, demonstrating the lack of a clear front-runner within the current state-of-the-art.
1 code implementation • CVPR 2022 • Nirat Saini, Khoi Pham, Abhinav Shrivastava
We use visual decomposed features to hallucinate embeddings that are representative for the seen and novel compositions to better regularize the learning of our model.
no code implementations • 18 Apr 2022 • Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava
We present Neural Space-filling Curves (SFCs), a data-driven approach to infer a context-based scan order for a set of images.
1 code implementation • 6 Apr 2022 • Sharath Girish, Kamal Gupta, Saurabh Singh, Abhinav Shrivastava
We introduce LilNetX, an end-to-end trainable technique for neural networks that enables learning models with specified accuracy-rate-computation trade-off.
1 code implementation • CVPR 2022 • Bo He, Xitong Yang, Le Kang, Zhiyu Cheng, Xin Zhou, Abhinav Shrivastava
Without the boundary information of action segments, existing methods mostly rely on multiple instance learning (MIL), where the predictions of unlabeled instances (i. e., video snippets) are supervised by classifying labeled bags (i. e., untrimmed videos).
no code implementations • CVPR 2022 • Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
Recent advances in image editing techniques have posed serious challenges to the trustworthiness of multimedia data, which drives the research of image tampering detection.
no code implementations • 15 Mar 2022 • Sharath Girish, Debadeepta Dey, Neel Joshi, Vibhav Vineet, Shital Shah, Caio Cesar Teodoro Mendes, Abhinav Shrivastava, Yale Song
We conduct a large-scale study with over 100 variants of ResNet and MobileNet architectures and evaluate them across 11 downstream scenarios in the SSL setting.
no code implementations • 31 Jan 2022 • Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava
Video compression is a central feature of the modern internet powering technologies from social media to video conferencing.
no code implementations • 12 Jan 2022 • Sai Saketh Rambhatla, Saksham Suri, Rama Chellappa, Abhinav Shrivastava
Our algorithm then processes the labeled and un-labeled foreground regions differently, a common practice in semi-supervised methods.
1 code implementation • CVPR 2022 • Matthew Walmer, Karan Sikka, Indranil Sur, Abhinav Shrivastava, Susmit Jha
This is challenging for the attacker as the detector can distort or ignore the visual trigger entirely, which leads to models where backdoors are over-reliant on the language trigger.
no code implementations • 8 Dec 2021 • Luyu Yang, Mingfei Gao, Zeyuan Chen, ran Xu, Abhinav Shrivastava, Chetan Ramaiah
In the context of online privacy, many methods propose complex privacy and security preserving measures to protect sensitive data.
1 code implementation • NeurIPS 2021 • Kamal Gupta, Gowthami Somepalli, Anubhav Gupta, Vinoj Jayasundara, Matthias Zwicker, Abhinav Shrivastava
We study a referential game (a type of signaling game) where two agents communicate with each other via a discrete bottleneck to achieve a common goal.
no code implementations • 26 Oct 2021 • Hao Chen, Abhinav Shrivastava
Incorporating relational reasoning in neural networks for object recognition remains an open problem.
1 code implementation • NeurIPS 2021 • Hao Chen, Bo He, Hanyu Wang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava
In contrast, with NeRV, we can use any neural network compression method as a proxy for video compression, and achieve comparable performance to traditional frame-based video compression approaches (H. 264, HEVC \etc).
no code implementations • 26 Oct 2021 • Shishira R Maiya, Max Ehrlich, Vatsal Agarwal, Ser-Nam Lim, Tom Goldstein, Abhinav Shrivastava
Our analysis shows that adversarial examples are neither in high-frequency nor in low-frequency components, but are simply dataset dependent.
1 code implementation • ICLR 2021 • Gaurav Shrivastava, Abhinav Shrivastava
Our approach, Diverse Video Generator, uses a Gaussian Process (GP) to learn priors on future states given the past and maintains a probability distribution over possible futures given a particular sample.
Ranked #1 on
Video Generation
on KTH
no code implementations • CVPR 2021 • Navaneeth Bodla, Gaurav Shrivastava, Rama Chellappa, Abhinav Shrivastava
Our work builds on hierarchical video prediction models, which disentangle the video generation process into two stages: predicting a high-level representation, such as pose sequence, and then learning a pose-to-pixels translation model for pixel generation.
no code implementations • CVPR 2021 • Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava
The standard paradigm is to utilize relationships in the input graph to transfer information using GCNs from training to testing nodes in the graph; for example, the semi-supervised, zero-shot, and few-shot learning setups.
no code implementations • CVPR 2021 • Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava
In this paper, we introduce a large-scale in-the-wild visual attribute prediction dataset consisting of over 927K attribute annotations for over 260K object instances.
no code implementations • 1 Jun 2021 • Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis
In this paper, we introduce certainty-aware pseudo labels tailored for object detection, which can effectively estimate the classification and localization quality of derived pseudo labels.
Ranked #5 on
Semi-Supervised Object Detection
on COCO 100% labeled data
(using extra training data)
no code implementations • ICCV 2021 • Sharath Girish, Saksham Suri, Saketh Rambhatla, Abhinav Shrivastava
Through extensive experiments, we show that our algorithm discovers unseen GANs with high accuracy and also generalizes to GANs trained on unseen real datasets.
no code implementations • ICCV 2021 • Sai Saketh Rambhatla, Rama Chellappa, Abhinav Shrivastava
We tackle object category discovery, which is the problem of discovering and localizing novel objects in a large unlabeled dataset.
no code implementations • ICCV 2021 • Moustafa Meshry, Saksham Suri, Larry S. Davis, Abhinav Shrivastava
In contrast, we propose to factorize the representation of a subject into its spatial and style components.
no code implementations • CVPR 2021 • Moustafa Meshry, Yixuan Ren, Larry S Davis, Abhinav Shrivastava
Specifically, we pre-train a generic style encoder using a novel proxy task to learn an embedding of images, from arbitrary domains, into a low-dimensional style latent space.
1 code implementation • CVPR 2021 • Ahmed Taha, Abhinav Shrivastava, Larry Davis
We evaluate KE using relatively small datasets (e. g., CUB-200) and randomly initialized deep networks.
1 code implementation • 4 Mar 2021 • Ahmed Taha, Alex Hanson, Abhinav Shrivastava, Larry Davis
The SVMax regularizer supports both supervised and unsupervised learning.
no code implementations • 26 Jan 2021 • Peng Zhou, Ning Yu, Zuxuan Wu, Larry S. Davis, Abhinav Shrivastava, Ser-Nam Lim
This paper studies video inpainting detection, which localizes an inpainted region in a video both spatially and temporally.
no code implementations • 1 Jan 2021 • Gaurav Shrivastava, Harsh Shrivastava, Abhinav Shrivastava
But, what if for an input point '$\bar{\mathbf{x}}$', we want to constrain the GP to avoid a target regression value '$\bar{y}(\bar{\mathbf{x}})$' (a negative datapair)?
no code implementations • 1 Jan 2021 • Kamal Gupta, Vijay Mahadevan, Alessandro Achille, Justin Lazarow, Larry S. Davis, Abhinav Shrivastava
We address the problem of scene layout generation for diverse domains such as images, mobile applications, documents and 3D objects.
no code implementations • CVPR 2021 • Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis
Then, only frames and convolutions that are selected by the selection network are used in the 3D model to generate predictions.
Ranked #10 on
Action Recognition
on ActivityNet
no code implementations • 15 Dec 2020 • Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava
To this end, we introduce Global Temporal Attention (GTA), which performs global temporal attention on top of spatial attention in a decoupled manner.
1 code implementation • CVPR 2021 • Sharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava
The recently proposed Lottery Ticket Hypothesis (LTH) states that deep neural networks trained on large datasets contain smaller subnetworks that achieve on par performance as the dense networks.
no code implementations • 17 Nov 2020 • Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava
We show that there is a significant penalty on common performance metrics for high compression.
1 code implementation • 3 Nov 2020 • Shlok Mishra, Anshul Shah, Ankan Bansal, Janit Anjaria, Jonghyun Choi, Abhinav Shrivastava, Abhishek Sharma, David Jacobs
Recent literature has shown that features obtained from supervised training of CNNs may over-emphasize texture rather than encoding high-level information.
Ranked #18 on
Object Detection
on PASCAL VOC 2007
1 code implementation • 16 Oct 2020 • Anshul Shah, Shlok Mishra, Ankan Bansal, Jun-Cheng Chen, Rama Chellappa, Abhinav Shrivastava
Unlike other modalities, constellation of joints and their motion generate models with succinct human motion information for activity recognition.
Ranked #1 on
Action Recognition
on Mimetics
1 code implementation • 7 Sep 2020 • Kamal Gupta, Susmija Jabbireddy, Ketul Shah, Abhinav Shrivastava, Matthias Zwicker
Our simple encoder-decoder framework, comprised of a novel identity encoder and class-conditional viewpoint generator, generates 3D consistent depth maps.
no code implementations • 28 Aug 2020 • Pallabi Ghosh, Nirat Saini, Larry S. Davis, Abhinav Shrivastava
Current action recognition systems require large amounts of training data for recognizing an action.
Ranked #12 on
Zero-Shot Action Recognition
on Kinetics
1 code implementation • ICCV 2021 • Luyu Yang, Yan Wang, Mingfei Gao, Abhinav Shrivastava, Kilian Q. Weinberger, Wei-Lun Chao, Ser-Nam Lim
To integrate the strengths of the two classifiers, we apply the well-established co-training framework, in which the two classifiers exchange their high confident predictions to iteratively "teach each other" so that both classifiers can excel in the target domain.
1 code implementation • 23 Jul 2020 • Saurabh Singh, Sami Abu-El-Haija, Nick Johnston, Johannes Ballé, Abhinav Shrivastava, George Toderici
We propose a learned method that jointly optimizes for compressibility along with the task objective for learning the features.
2 code implementations • ECCV 2020 • Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis
Compared to classification networks, attention visualization for retrieval networks is hardly studied.
no code implementations • ECCV 2020 • Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava
In this paper, we proposed an adversarial agent that learns a dynamic curriculum for source samples, called Curriculum Manager for Source Selection (CMSS).
Multi-Source Unsupervised Domain Adaptation
Unsupervised Domain Adaptation
1 code implementation • 1 Jul 2020 • Hao Chen, Abhinav Shrivastava
Owing to group convolution and the shared-base, GENet can fully leverage the advantage of explicit ensemble learning while retaining the same computation as a single ConvNet.
2 code implementations • ICCV 2021 • Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava
Generating a new layout or extending an existing layout requires understanding the relationships between these primitives.
1 code implementation • ECCV 2020 • Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava
The JPEG image compression algorithm is the most popular method of image compression because of its ability for large compression ratios.
no code implementations • 9 Apr 2020 • Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa
The proposed method consists of a layout module which primes a visual module to predict the type of interaction between a human and an object.
1 code implementation • CVPR 2020 • Kamal Gupta, Saurabh Singh, Abhinav Shrivastava
Unsupervised representation learning holds the promise of exploiting large amounts of unlabeled data to learn general representations.
no code implementations • 28 Feb 2020 • Kyungjun Lee, Abhinav Shrivastava, Hernisa Kacorri
Egocentric vision holds great promises for increasing access to visual information and improving the quality of life for people with visual impairments, with object recognition being one of the daily challenges for this population.
no code implementations • 21 Jan 2020 • Pallabi Ghosh, Vibhav Vineet, Larry S. Davis, Abhinav Shrivastava, Sudipta Sinha, Neel Joshi
Given color images and noisy and incomplete target depth maps, we optimize a randomly-initialized CNN model to reconstruct a depth map restored by virtue of using the CNN network structure as a prior combined with a view-constrained photo-consistency loss.
no code implementations • 25 Sep 2019 • Moustafa Meshry, Yixuan Ren, Ricardo Martin-Brualla, Larry Davis, Abhinav Shrivastava
Then we train a generator to transform an input image along with a style-code to the output domain.
no code implementations • ICLR 2020 • Deniz Oktay, Johannes Ballé, Saurabh Singh, Abhinav Shrivastava
We describe a simple and general neural network weight compression approach, in which the network parameters (weights and biases) are represented in a "latent" space, amounting to a reparameterization.
no code implementations • 17 Apr 2019 • Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker
Different from image-to-image translation network that completes each view separately, our novel network, multi-view completion net (MVCN), leverages information from all views of a 3D shape to help the completion of each single view.
no code implementations • ICCV 2019 • Saurabh Singh, Abhinav Shrivastava
Batch normalization (BN) has been very effective for deep learning and is widely used.
no code implementations • CVPR 2019 • Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid
This paper focuses on multi-person action forecasting in videos.
no code implementations • WS 2019 • Peratham Wiriyathammabhum, Abhinav Shrivastava, Vlad I. Morariu, Larry S. Davis
This paper presents a new task, the grounding of spatio-temporal identifying descriptions in videos.
no code implementations • 5 Apr 2019 • Ankan Bansal, Sai Saketh Rambhatla, Abhinav Shrivastava, Rama Chellappa
We present an approach for detecting human-object interactions (HOIs) in images, based on the idea that humans interact with functionally similar objects in a similar manner.
no code implementations • 7 Feb 2019 • Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis
We introduce an unsupervised formulation to estimate heteroscedastic uncertainty in retrieval systems.
1 code implementation • 24 Jan 2019 • Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis
We employ triplet loss as a feature embedding regularizer to boost classification performance.
1 code implementation • 24 Nov 2018 • Peng Zhou, Bor-Chun Chen, Xintong Han, Mahyar Najibi, Abhinav Shrivastava, Ser Nam Lim, Larry S. Davis
The advent of image sharing platforms and the easy availability of advanced photo editing software have resulted in a large quantities of manipulated images being shared on the internet.
1 code implementation • ECCV 2018 • Chen Sun, Abhinav Shrivastava, Carl Vondrick, Kevin Murphy, Rahul Sukthankar, Cordelia Schmid
A visualization of the learned relation features confirms that our approach is able to attend to the relevant relations for each action.
Ranked #14 on
Action Recognition
on AVA v2.1
1 code implementation • ECCV 2018 • Carl Vondrick, Abhinav Shrivastava, Alireza Fathi, Sergio Guadarrama, Kevin Murphy
We use large amounts of unlabeled video to learn models for visual tracking without manual human supervision.
2 code implementations • ICCV 2017 • Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
What will happen if we increase the dataset size by 10x or 100x?
Ranked #2 on
Semantic Segmentation
on PASCAL VOC 2007
4 code implementations • CVPR 2017 • Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta
We propose to learn an adversarial network that generates examples with occlusions and deformations.
Ranked #19 on
Object Detection
on PASCAL VOC 2007
1 code implementation • 20 Dec 2016 • Abhinav Shrivastava, Rahul Sukthankar, Jitendra Malik, Abhinav Gupta
But most of these fine details are lost in the early convolutional layers.
Ranked #215 on
Object Detection
on COCO test-dev
1 code implementation • CVPR 2016 • Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert
In this paper, we propose a principled approach to learn shared representations in ConvNets using multi-task learning.
Ranked #68 on
Semantic Segmentation
on NYU Depth v2
5 code implementations • CVPR 2016 • Abhinav Shrivastava, Abhinav Gupta, Ross Girshick
Our motivation is the same as it has always been -- detection datasets contain an overwhelming number of easy examples and a small number of hard examples.
Ranked #6 on
Face Verification
on Trillion Pairs Dataset
no code implementations • CVPR 2015 • Ishan Misra, Abhinav Shrivastava, Martial Hebert
We present a semi-supervised approach that localizes multiple unknown object instances in long videos.
no code implementations • 21 May 2015 • Ishan Misra, Abhinav Shrivastava, Martial Hebert
We present a semi-supervised approach that localizes multiple unknown object instances in long videos.
no code implementations • 27 Apr 2015 • Aayush Bansal, Abhinav Shrivastava, Carl Doersch, Abhinav Gupta
Building on the success of recent discriminative mid-level elements, we propose a surprisingly simple approach for object detection which performs comparable to the current state-of-the-art approaches on PASCAL VOC comp-3 detection challenge (no external data).
no code implementations • CVPR 2014 • Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta
In this paper, we propose to enrich these knowledge bases by automatically discovering objects and their segmentations from noisy Internet images.