Search Results for author: Larry Davis

Found 42 papers, 21 papers with code

Neural Space-filling Curves

no code implementations • 18 Apr 2022 • Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava

We present Neural Space-filling Curves (SFCs), a data-driven approach to infer a context-based scan order for a set of images.

Image Compression

Paper
Add Code

Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement

no code implementations • 31 Jan 2022 • Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava

Video compression is a central feature of the modern internet powering technologies from social media to video conferencing.

Quantization Video Compression

Paper
Add Code

Learning Realistic Human Reposing using Cyclic Self-Supervision with 3D Shape, Pose, and Appearance Consistency

no code implementations • ICCV 2021 • Soubhik Sanyal, Alex Vorobiov, Timo Bolkart, Matthew Loper, Betty Mohler, Larry Davis, Javier Romero, Michael J. Black

Synthesizing images of a person in novel poses from a single image is a highly ambiguous task.

Paper
Add Code

More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching

no code implementations • 20 May 2021 • Yuxiao Chen, Jianbo Yuan, Long Zhao, Tianlang Chen, Rui Luo, Larry Davis, Dimitris N. Metaxas

Cross-modal attention mechanisms have been widely applied to the image-text matching task and have achieved remarkable improvements thanks to its capability of learning fine-grained relevance across different modalities.

Contrastive Learning Image Captioning +4

Paper
Add Code

Unsupervised Super-Resolution of Satellite Imagery for High Fidelity Material Label Transfer

no code implementations • 16 May 2021 • Arthita Ghosh, Max Ehrlich, Larry Davis, Rama Chellappa

Urban material recognition in remote sensing imagery is a highly relevant, yet extremely challenging problem due to the difficulty of obtaining human annotations, especially on low resolution satellite images.

Material Recognition Super-Resolution +1

Paper
Add Code

VideoLT: Large-scale Long-tailed Video Recognition

1 code implementation • ICCV 2021 • Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis

In this paper, we introduce VideoLT, a large-scale long-tailed video recognition dataset, as a step toward real-world video recognition.

Image Classification Video Recognition

Paper
Code

M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers

1 code implementation • 24 Apr 2021 • Tianrui Guan, Jun Wang, Shiyi Lan, Rohan Chandra, Zuxuan Wu, Larry Davis, Dinesh Manocha

We present a novel architecture for 3D object detection, M3DeTR, which combines different point cloud representations (raw, voxels, bird-eye view) with different feature scales based on multi-scale feature pyramids.

Ranked #1 on 3D Object Detection on KITTI Cars Hard val

3D Object Detection object-detection +1

Paper
Code

Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories

no code implementations • CVPR 2021 • Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry Davis, Heng Wang

The standard way of training video models entails sampling at each iteration a single clip from a video and optimizing the clip prediction with respect to the video-level label.

Action Detection Action Recognition +1

Paper
Add Code

Dual Contrastive Loss and Attention for GANs

1 code implementation • ICCV 2021 • Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz

Lastly, we study different attention architectures in the discriminator, and propose a reference attention mechanism.

Image Generation Unconditional Image Generation

Paper
Code

Knowledge Evolution in Neural Networks

1 code implementation • CVPR 2021 • Ahmed Taha, Abhinav Shrivastava, Larry Davis

We evaluate KE using relatively small datasets (e. g., CUB-200) and randomly initialized deep networks.

Metric Learning

Paper
Code

SVMax: A Feature Embedding Regularizer

1 code implementation • 4 Mar 2021 • Ahmed Taha, Alex Hanson, Abhinav Shrivastava, Larry Davis

The SVMax regularizer supports both supervised and unsupervised learning.

Retrieval

Paper
Code

Responsible Disclosure of Generative Models Using Scalable Fingerprinting

1 code implementation • ICLR 2022 • Ning Yu, Vladislav Skripniuk, Dingfan Chen, Larry Davis, Mario Fritz

Over the past years, deep generative models have achieved a new level of performance.

Misinformation

Paper
Code

The Lottery Ticket Hypothesis for Object Recognition

1 code implementation • CVPR 2021 • Sharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava

The recently proposed Lottery Ticket Hypothesis (LTH) states that deep neural networks trained on large datasets contain smaller subnetworks that achieve on par performance as the dense networks.

Instance Segmentation Keypoint Estimation +5

Paper
Code

Analyzing and Mitigating JPEG Compression Defects in Deep Learning

no code implementations • 17 Nov 2020 • Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava

We show that there is a significant penalty on common performance metrics for high compression.

Paper
Add Code

Hierarchical Contrastive Motion Learning for Video Action Recognition

no code implementations • 20 Jul 2020 • Xitong Yang, Xiaodong Yang, Sifei Liu, Deqing Sun, Larry Davis, Jan Kautz

Thus, the motion features at higher levels are trained to gradually capture semantic dynamics and evolve more discriminative for action recognition.

Action Recognition Contrastive Learning +2

Paper
Add Code

A Generic Visualization Approach for Convolutional Neural Networks

2 code implementations • ECCV 2020 • Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis

Compared to classification networks, attention visualization for retrieval networks is hardly studied.

Classification General Classification +2

Paper
Code

ASAP-NMS: Accelerating Non-Maximum Suppression Using Spatially Aware Priors

1 code implementation • 19 Jul 2020 • Rohun Tripathi, Vasu Singla, Mahyar Najibi, Bharat Singh, Abhishek Sharma, Larry Davis

The widely adopted sequential variant of Non Maximum Suppression (or Greedy-NMS) is a crucial module for object-detection pipelines.

object-detection Object Detection +1

Paper
Code

LayoutTransformer: Layout Generation and Completion with Self-attention

2 code implementations • ICCV 2021 • Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava

Generating a new layout or extending an existing layout requires understanding the relationships between these primitives.

139

Paper
Code

Quantization Guided JPEG Artifact Correction

1 code implementation • ECCV 2020 • Max Ehrlich, Larry Davis, Ser-Nam Lim, Abhinav Shrivastava

The JPEG image compression algorithm is the most popular method of image compression because of its ability for large compression ratios.

Ranked #1 on JPEG Artifact Correction on ICB (Quality 20 Grayscale)

JPEG Artifact Correction Quantization

Paper
Code

Inclusive GAN: Improving Data and Minority Coverage in Generative Models

1 code implementation • ECCV 2020 • Ning Yu, Ke Li, Peng Zhou, Jitendra Malik, Larry Davis, Mario Fritz

Generative Adversarial Networks (GANs) have brought about rapid progress towards generating photorealistic images.

Paper
Code

WSLLN:Weakly Supervised Natural Language Localization Networks

no code implementations • IJCNLP 2019 • Mingfei Gao, Larry Davis, Richard Socher, Caiming Xiong

We propose weakly supervised language localization networks (WSLLN) to detect events in long, untrimmed videos given language queries.

Sentence

Paper
Add Code

Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors

2 code implementations • ECCV 2020 • Zuxuan Wu, Ser-Nam Lim, Larry Davis, Tom Goldstein

We present a systematic study of adversarial attacks on state-of-the-art object detection frameworks.

Object object-detection +1

Paper
Code

A weakly supervised adaptive triplet loss for deep metric learning

no code implementations • 27 Sep 2019 • Xiaonan Zhao, Huan Qi, Rui Luo, Larry Davis

We address the problem of distance metric learning in visual similarity search, defined as learning an image embedding model which projects images into Euclidean space where semantically and visually similar images are closer and dissimilar images are further from one another.

Metric Learning Retrieval +2

Paper
Add Code

Style-based Encoder Pre-training for Multi-modal Image Synthesis

no code implementations • 25 Sep 2019 • Moustafa Meshry, Yixuan Ren, Ricardo Martin-Brualla, Larry Davis, Abhinav Shrivastava

Then we train a generator to transform an input image along with a style-code to the output domain.

Image Generation Translation

Paper
Add Code

STEP: Spatio-Temporal Progressive Learning for Video Action Detection

1 code implementation • CVPR 2019 • Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry Davis, Jan Kautz

In this paper, we propose Spatio-TEmporal Progressive (STEP) action detector---a progressive learning framework for spatio-temporal action detection in videos.

Ranked #7 on Action Detection on UCF101-24

Action Detection Action Recognition

244

Paper
Code

Unsupervised Data Uncertainty Learning in Visual Retrieval Systems

no code implementations • 7 Feb 2019 • Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis

We introduce an unsupervised formulation to estimate heteroscedastic uncertainty in retrieval systems.

Retrieval Video Retrieval

Paper
Add Code

Boosting Standard Classification Architectures Through a Ranking Regularizer

1 code implementation • 24 Jan 2019 • Ahmed Taha, Yi-Ting Chen, Teruhisa Misu, Abhinav Shrivastava, Larry Davis

We employ triplet loss as a feature embedding regularizer to boost classification performance.

Classification General Classification

Paper
Code

Exploring Uncertainty in Conditional Multi-Modal Retrieval Systems

no code implementations • 23 Jan 2019 • Ahmed Taha, Yi-Ting Chen, Xitong Yang, Teruhisa Misu, Larry Davis

We cast visual retrieval as a regression problem by posing triplet loss as a regression loss.

Action Understanding Person Re-Identification +2

Paper
Add Code

Deep Residual Learning in the JPEG Transform Domain

1 code implementation • ICCV 2019 • Max Ehrlich, Larry Davis

We introduce a general method of performing Residual Network inference and learning in the JPEG transform domain that allows the network to consume compressed images as input.

General Classification Image Classification

Paper
Code

Attributing Fake Images to GANs: Learning and Analyzing GAN Fingerprints

2 code implementations • ICCV 2019 • Ning Yu, Larry Davis, Mario Fritz

Our experiments show that (1) GANs carry distinct model fingerprints and leave stable fingerprints in their generated images, which support image attribution; (2) even minor differences in GAN training can result in different fingerprints, which enables fine-grained model authentication; (3) fingerprints persist across different image frequencies and patches and are not biased by GAN artifacts; (4) fingerprint finetuning is effective in immunizing against five types of adversarial image perturbations; and (5) comparisons also show our learned fingerprints consistently outperform several baselines in a variety of setups.

Image Generation

115

Paper
Code

Two Stream Self-Supervised Learning for Action Recognition

no code implementations • 16 Jun 2018 • Ahmed Taha, Moustafa Meshry, Xitong Yang, Yi-Ting Chen, Larry Davis

The self-supervised pre-trained weights effectiveness is validated on the action recognition task.

Action Recognition Representation Learning +3

Paper
Add Code

Fused Deep Neural Networks for Efficient Pedestrian Detection

no code implementations • 2 May 2018 • Xianzhi Du, Mostafa El-Khamy, Vlad I. Morariu, Jungwon Lee, Larry Davis

The classification system further classifies the generated candidates based on opinions of multiple deep verification networks and a fusion network which utilizes a novel soft-rejection fusion method to adjust the confidence in the detection results.

Ensemble Learning General Classification +2

Paper
Add Code

Learning to Color from Language

1 code implementation • NAACL 2018 • Varun Manjunatha, Mohit Iyyer, Jordan Boyd-Graber, Larry Davis

Automatic colorization is the process of adding color to greyscale images.

Colorization Descriptive

Paper
Code

Deep Motion Boundary Detection

no code implementations • 13 Apr 2018 • Xiaoqing Yin, Xiyang Dai, Xinchao Wang, Maojun Zhang, DaCheng Tao, Larry Davis

In this paper, we propose the first dedicated end-to-end deep learning approach for motion boundary detection, which we term as MoBoNet.

Boundary Detection Optical Flow Estimation

Paper
Add Code

Class Subset Selection for Transfer Learning using Submodularity

no code implementations • 30 Mar 2018 • Varun Manjunatha, Srikumar Ramalingam, Tim K. Marks, Larry Davis

To accomplish this, we use a submodular set function to model the accuracy achievable on a new task when the features have been learned on a given subset of classes of the source dataset.

Image Classification Transfer Learning

Paper
Add Code

Face-MagNet: Magnifying Feature Maps to Detect Small Faces

1 code implementation • 14 Mar 2018 • Pouya Samangouei, Mahyar Najibi, Larry Davis, Rama Chellappa

In this paper, we introduce the Face Magnifier Network (Face-MageNet), a face detector based on the Faster-RCNN framework which enables the flow of discriminative information of small scale faces to the classifier without any skip or residual connections.

Face Detection Region Proposal

Paper
Code

Boundary-sensitive Network for Portrait Segmentation

no code implementations • 22 Dec 2017 • Xianzhi Du, Xiaolong Wang, Dawei Li, Jingwen Zhu, Serafettin Tasci, Cameron Upright, Stephen Walsh, Larry Davis

Compared to the general semantic segmentation problem, portrait segmentation has higher precision requirement on boundary area.

Attribute Image Segmentation +3

Paper
Add Code

SSH: Single Stage Headless Face Detector

6 code implementations • ICCV 2017 • Mahyar Najibi, Pouya Samangouei, Rama Chellappa, Larry Davis

Surprisingly, with a headless VGG-16, SSH beats the ResNet-101-based state-of-the-art on the WIDER dataset.

General Classification

27,708

Paper
Code

The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

2 code implementations • CVPR 2017 • Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry Davis

While computers can now describe what is explicitly depicted in natural images, in this paper we examine whether they can understand the closure-driven narratives conveyed by stylized artwork and dialogue in comic book panels.

Paper
Code

Weakly Supervised Learning of Heterogeneous Concepts in Videos

no code implementations • 12 Jul 2016 • Sohil Shah, Kuldeep Kulkarni, Arijit Biswas, Ankit Gandhi, Om Deshmukh, Larry Davis

Typical textual descriptions that accompany online videos are 'weak': i. e., they mention the main concepts in the video but not their corresponding spatio-temporal locations.

General Classification Weakly-supervised Learning

Paper
Add Code

Learning Discriminative Features via Label Consistent Neural Network

no code implementations • 3 Feb 2016 • Zhuolin Jiang, Yaming Wang, Larry Davis, Walt Andrews, Viktor Rozgic

Deep Convolutional Neural Networks (CNN) enforces supervised information only at the output layer, and hidden layers are trained by back propagating the prediction error from the output layer without explicit supervision.

General Classification

Paper
Add Code

Learning Structured Ordinal Measures for Video based Face Recognition

no code implementations • 9 Jul 2015 • Ran He, Tieniu Tan, Larry Davis, Zhenan Sun

This paper presents a structured ordinal measure method for video-based face recognition that simultaneously learns ordinal filters and structured ordinal features.

Face Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.