CVPR 2018

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

CVPR 2018 tensorflow/models

The AVA dataset densely annotates 80 atomic visual actions in 430 15-minute video clips, where actions are localized in space and time, resulting in 1. 58M action labels with multiple labels per person occurring frequently.

ACTION LOCALIZATION ACTION RECOGNITION VIDEO UNDERSTANDING

Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking

CVPR 2018 tensorflow/models

In particular, annotation errors, the size of the dataset, and the level of challenge are addressed: new annotation for both datasets is created with an extra attention to the reliability of the ground truth.

IMAGE RETRIEVAL

MorphNet: Fast & Simple Resource-Constrained Structure Learning of Deep Networks

CVPR 2018 tensorflow/models

We present MorphNet, an approach to automate the design of neural network structures.

Learning to Segment Every Thing

CVPR 2018 facebookresearch/detectron

Most methods for object instance segmentation require all training examples to be labeled with segmentation masks.

INSTANCE SEGMENTATION SEMANTIC SEGMENTATION

DensePose: Dense Human Pose Estimation In The Wild

CVPR 2018 facebookresearch/DensePose

In this work, we establish dense correspondences between RGB image and a surface-based representation of the human body, a task we refer to as dense human pose estimation.

POSE ESTIMATION

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

CVPR 2018 yunjey/StarGAN

To address this limitation, we propose StarGAN, a novel and scalable approach that can perform image-to-image translations for multiple domains using only a single model.

IMAGE-TO-IMAGE TRANSLATION

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

CVPR 2018 NVIDIA/pix2pixHD

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs).

CONDITIONAL IMAGE GENERATION IMAGE-TO-IMAGE TRANSLATION INSTANCE SEGMENTATION SEMANTIC SEGMENTATION

MegDet: A Large Mini-Batch Object Detector

CVPR 2018 CSAILVision/semantic-segmentation-pytorch

The improvements in recent CNN-based object detection works, from R-CNN [11], Fast/Faster R-CNN [10, 31] to recent Mask R-CNN [14] and RetinaNet [24], mainly come from new network, new framework, or novel loss design.

OBJECT DETECTION

Camera Style Adaptation for Person Re-identification

CVPR 2018 layumi/Person_reID_baseline_pytorch

With LSR, we demonstrate consistent improvement in all systems regardless of the extent of over-fitting.

PERSON RE-IDENTIFICATION

Context Encoding for Semantic Segmentation

CVPR 2018 zhanghang1989/PyTorch-Encoding

In this paper, we explore the impact of global contextual information in semantic segmentation by introducing the Context Encoding Module, which captures the semantic context of scenes and selectively highlights class-dependent featuremaps.

IMAGE CLASSIFICATION SEMANTIC SEGMENTATION