Search Results for author: Jiaya Jia

Found 120 papers, 61 papers with code

CN: Channel Normalization For Point Cloud Recognition

no code implementations ECCV 2020 Zetong Yang, Yanan sun, Shu Liu, Xiaojuan Qi, Jiaya Jia

In 3D recognition, to fuse multi-scale structure information, existing methods apply hierarchical frameworks stacked by multiple fusion layers for integrating current relative locations with structure information from the previous level.

Memory Selection Network for Video Propagation

no code implementations ECCV 2020 Ruizheng Wu, Huaijia Lin, Xiaojuan Qi, Jiaya Jia

Video propagation is a fundamental problem in video processing where guidance frame predictions are propagated to guide predictions of the target frame.

Colorization Frame +4

Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References

no code implementations ECCV 2020 Ruizheng Wu, Xin Tao, Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia

Unpaired image-to-image translation aims to translate images from the source class to target one by providing sufficient data for these classes.

Image-to-Image Translation Translation

Video Frame Interpolation with Transformer

1 code implementation15 May 2022 Liying Lu, Ruizheng Wu, Huaijia Lin, Jiangbo Lu, Jiaya Jia

Video frame interpolation (VFI), which aims to synthesize intermediate frames of a video, has made remarkable progress with development of deep convolutional networks over past years.

Frame Video Frame Interpolation

Focal Sparse Convolutional Networks for 3D Object Detection

2 code implementations26 Apr 2022 Yukang Chen, Yanwei Li, Xiangyu Zhang, Jian Sun, Jiaya Jia

In this paper, we introduce two new modules to enhance the capability of Sparse CNNs, both are based on making feature sparsity learnable with position-wise importance prediction.

3D Object Detection

DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors

1 code implementation6 Apr 2022 Yilun Chen, Shijia Huang, Shu Liu, Bei Yu, Jiaya Jia

Camera-based 3D object detectors are welcome due to their wider deployment and lower price than LiDAR sensors.

Region Rebalance for Long-Tailed Semantic Segmentation

2 code implementations5 Apr 2022 Jiequan Cui, Yuhui Yuan, Zhisheng Zhong, Zhuotao Tian, Han Hu, Stephen Lin, Jiaya Jia

In this paper, we study the problem of class imbalance in semantic segmentation.

Ranked #6 on Semantic Segmentation on ADE20K (using extra training data)

Semantic Segmentation

Multi-View Transformer for 3D Visual Grounding

1 code implementation5 Apr 2022 Shijia Huang, Yilun Chen, Jiaya Jia, LiWei Wang

The multi-view space enables the network to learn a more robust multi-modal representation for 3D visual grounding and eliminates the dependence on specific views.

Visual Grounding

MAT: Mask-Aware Transformer for Large Hole Image Inpainting

1 code implementation29 Mar 2022 Wenbo Li, Zhe Lin, Kun Zhou, Lu Qi, Yi Wang, Jiaya Jia

Recent studies have shown the importance of modeling long-range interactions in the inpainting problem.

Image Inpainting

Stratified Transformer for 3D Point Cloud Segmentation

2 code implementations28 Mar 2022 Xin Lai, Jianhui Liu, Li Jiang, LiWei Wang, Hengshuang Zhao, Shu Liu, Xiaojuan Qi, Jiaya Jia

In this paper, we propose Stratified Transformer that is able to capture long-range contexts and demonstrates strong generalization ability and high performance.

Point Cloud Segmentation Semantic Segmentation

SEA: Bridging the Gap Between One- and Two-stage Detector Distillation via SEmantic-aware Alignment

no code implementations2 Mar 2022 Yixin Chen, Zhuotao Tian, Pengguang Chen, Shu Liu, Jiaya Jia

We revisit the one- and two-stage detector distillation tasks and present a simple and efficient semantic-aware framework to fill the gap between them.

Instance Segmentation Object Detection +1

A Unified Query-based Paradigm for Point Cloud Understanding

1 code implementation2 Mar 2022 Zetong Yang, Li Jiang, Yanan sun, Bernt Schiele, Jiaya Jia

This is achieved by introducing an intermediate representation, i. e., Q-representation, in the querying stage to serve as a bridge between the embedding stage and task heads.

Autonomous Driving Object Detection +1

On Efficient Transformer-Based Image Pre-training for Low-Level Vision

1 code implementation19 Dec 2021 Wenbo Li, Xin Lu, Shengju Qian, Jiangbo Lu, Xiangyu Zhang, Jiaya Jia

Pre-training has marked numerous state of the arts in high-level computer vision, while few attempts have ever been made to investigate how pre-training acts in image processing systems.

Denoising Super-Resolution

CaSP: Class-agnostic Semi-Supervised Pretraining for Detection and Segmentation

1 code implementation9 Dec 2021 Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, Jiaya Jia

To this end, we propose a novel Class-agnostic Semi-supervised Pretraining (CaSP) framework to achieve a more favorable task-specificity balance in extracting training signals from unlabeled data.

Object Detection

High Quality Segmentation for Ultra High-resolution Images

1 code implementation29 Nov 2021 Tiancheng Shen, Yuechen Zhang, Lu Qi, Jason Kuen, Xingyu Xie, Jianlong Wu, Zhe Lin, Jiaya Jia

To segment 4K or 6K ultra high-resolution images needs extra computation consideration in image segmentation.

Semantic Segmentation

Blending Anti-Aliasing into Vision Transformer

no code implementations NeurIPS 2021 Shengju Qian, Hao Shao, Yi Zhu, Mu Li, Jiaya Jia

In this work, we analyze the uncharted problem of aliasing in vision transformer and explore to incorporate anti-aliasing properties.

Guided Point Contrastive Learning for Semi-supervised Point Cloud Semantic Segmentation

no code implementations ICCV 2021 Li Jiang, Shaoshuai Shi, Zhuotao Tian, Xin Lai, Shu Liu, Chi-Wing Fu, Jiaya Jia

To address the high cost and challenges of 3D point-level labeling, we present a method for semi-supervised point cloud semantic segmentation to adopt unlabeled point clouds in training to boost the model performance.

3D Semantic Segmentation Contrastive Learning

Deep Structured Instance Graph for Distilling Object Detectors

1 code implementation ICCV 2021 Yixin Chen, Pengguang Chen, Shu Liu, LiWei Wang, Jiaya Jia

Effectively structuring deep knowledge plays a pivotal role in transfer from teacher to student, especially in semantic vision tasks.

Instance Segmentation Knowledge Distillation +2

Image Synthesis via Semantic Composition

no code implementations ICCV 2021 Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia

In this paper, we present a novel approach to synthesize realistic images based on their semantic layouts.

Image Generation Semantic Composition

Exploring and Improving Mobile Level Vision Transformers

no code implementations30 Aug 2021 Pengguang Chen, Yixin Chen, Shu Liu, MingChang Yang, Jiaya Jia

We analyze the reason behind this phenomenon, and propose a novel irregular patch embedding module and adaptive patch fusion module to improve the performance.

Fully Convolutional Networks for Panoptic Segmentation with Point-based Supervision

1 code implementation17 Aug 2021 Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Yukang Chen, Lu Qi, LiWei Wang, Zeming Li, Jian Sun, Jiaya Jia

In particular, Panoptic FCN encodes each object instance or stuff category with the proposed kernel generator and produces the prediction by convolving the high-resolution feature directly.

Panoptic Segmentation Weakly-supervised panoptic segmentation

Conditional Temporal Variational AutoEncoder for Action Video Prediction

no code implementations12 Aug 2021 Xiaogang Xu, Yi Wang, LiWei Wang, Bei Yu, Jiaya Jia

To synthesize a realistic action sequence based on a single human image, it is crucial to model both motion patterns and diversity in the action video.

motion prediction Video Prediction

Open-World Entity Segmentation

2 code implementations29 Jul 2021 Lu Qi, Jason Kuen, Yi Wang, Jiuxiang Gu, Hengshuang Zhao, Zhe Lin, Philip Torr, Jiaya Jia

By removing the need of class label prediction, the models trained for such task can focus more on improving segmentation quality.

Image Manipulation Panoptic Segmentation

Self-Supervised 3D Mesh Reconstruction From Single Images

no code implementations CVPR 2021 Tao Hu, LiWei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia

Recent single-view 3D reconstruction methods reconstruct object's shape and texture from a single image with only 2D image-level annotation.

3D Reconstruction Image Generation +1

LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-Resolution and Beyond

2 code implementations NeurIPS 2020 Wenbo Li, Kun Zhou, Lu Qi, Nianjuan Jiang, Jiangbo Lu, Jiaya Jia

Single image super-resolution (SISR) deals with a fundamental problem of upsampling a low-resolution (LR) image to its high-resolution (HR) version.

Image Deblocking Image Denoising +1

Distilling Knowledge via Knowledge Review

4 code implementations CVPR 2021 Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia

Knowledge distillation transfers knowledge from the teacher network to the student one, with the goal of greatly improving the performance of the student network.

Instance Segmentation Knowledge Distillation +2

Improving Calibration for Long-Tailed Recognition

2 code implementations CVPR 2021 Zhisheng Zhong, Jiequan Cui, Shu Liu, Jiaya Jia

Motivated by the fact that predicted probability distributions of classes are highly related to the numbers of class instances, we propose label-aware smoothing to deal with different degrees of over-confidence for classes and improve classifier learning.

Long-tail Learning Representation Learning

Best-Buddy GANs for Highly Detailed Image Super-Resolution

2 code implementations29 Mar 2021 Wenbo Li, Kun Zhou, Lu Qi, Liying Lu, Nianjuan Jiang, Jiangbo Lu, Jiaya Jia

We consider the single image super-resolution (SISR) problem, where a high-resolution (HR) image is generated based on a low-resolution (LR) input.

Image Super-Resolution

Bidirectional Projection Network for Cross Dimension Scene Understanding

1 code implementation CVPR 2021 WenBo Hu, Hengshuang Zhao, Li Jiang, Jiaya Jia, Tien-Tsin Wong

Via the \emph{BPM}, complementary 2D and 3D information can interact with each other in multiple architectural levels, such that advantages in these two visual domains can be combined for better scene recognition.

2D Semantic Segmentation 3D Semantic Segmentation +3

Video Instance Segmentation with a Propose-Reduce Paradigm

1 code implementation ICCV 2021 Huaijia Lin, Ruizheng Wu, Shu Liu, Jiangbo Lu, Jiaya Jia

Video instance segmentation (VIS) aims to segment and associate all instances of predefined classes for each frame in videos.

 Ranked #1 on Unsupervised Video Object Segmentation on DAVIS 2017 (val) (using extra training data)

Frame Instance Segmentation +3

ResLT: Residual Learning for Long-tailed Recognition

3 code implementations26 Jan 2021 Jiequan Cui, Shu Liu, Zhuotao Tian, Zhisheng Zhong, Jiaya Jia

From this perspective, the trivial solution utilizes different branches for the head, medium, and tail classes respectively, and then sums their outputs as the final results is not feasible.

Long-tail Learning

General Adversarial Defense via Pixel Level and Feature Level Distribution Alignment

no code implementations1 Jan 2021 Xiaogang Xu, Hengshuang Zhao, Philip Torr, Jiaya Jia

Specifically, compared with previous methods, we propose a more efficient pixel-level training constraint to weaken the hardness of aligning adversarial samples to clean samples, which can thus obviously enhance the robustness on adversarial samples.

Adversarial Defense Image Classification +2

Point Transformer

7 code implementations ICCV 2021 Hengshuang Zhao, Li Jiang, Jiaya Jia, Philip Torr, Vladlen Koltun

For example, on the challenging S3DIS dataset for large-scale semantic scene segmentation, the Point Transformer attains an mIoU of 70. 4% on Area 5, outperforming the strongest prior model by 3. 3 absolute percentage points and crossing the 70% mIoU threshold for the first time.

3D Part Segmentation 3D Point Cloud Classification +5

GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation

2 code implementations13 Dec 2020 Xiaojuan Qi, Zhengzhe Liu, Renjie Liao, Philip H. S. Torr, Raquel Urtasun, Jiaya Jia

Note that GeoNet++ is generic and can be used in other depth/normal prediction frameworks to improve the quality of 3D reconstruction and pixel-wise accuracy of depth and surface normals.

3D Reconstruction Depth Estimation

Fully Convolutional Networks for Panoptic Segmentation

5 code implementations CVPR 2021 Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, LiWei Wang, Zeming Li, Jian Sun, Jiaya Jia

In this paper, we present a conceptually simple, strong, and efficient framework for panoptic segmentation, called Panoptic FCN.

 Ranked #1 on Panoptic Segmentation on Cityscapes val (PQst metric)

Panoptic Segmentation

Learnable Boundary Guided Adversarial Training

3 code implementations ICCV 2021 Jiequan Cui, Shu Liu, LiWei Wang, Jiaya Jia

Previous adversarial training raises model robustness under the compromise of accuracy on natural data.

Adversarial Defense

Generalized Few-shot Semantic Segmentation

no code implementations11 Oct 2020 Zhuotao Tian, Xin Lai, Li Jiang, Michelle Shu, Hengshuang Zhao, Jiaya Jia

Then, since context is essential for semantic segmentation, we propose the Context-Aware Prototype Learning (CAPL) that significantly improves performance by 1) leveraging the co-occurrence prior knowledge from support samples, and 2) dynamically enriching contextual information to the classifier, conditioned on the content of each query image.

Few-Shot Semantic Segmentation Semantic Segmentation

Prior Guided Feature Enrichment Network for Few-Shot Segmentation

3 code implementations4 Aug 2020 Zhuotao Tian, Hengshuang Zhao, Michelle Shu, Zhicheng Yang, Ruiyu Li, Jiaya Jia

It consists of novel designs of (1) a training-free prior mask generation method that not only retains generalization power but also improves model performance and (2) Feature Enrichment Module (FEM) that overcomes spatial inconsistency by adaptively enriching query features with support features and prior masks.

Few-Shot Semantic Segmentation Semantic Segmentation

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

1 code implementation ECCV 2020 Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia

Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales.

Frame Optical Flow Estimation +1

Exploring Self-attention for Image Recognition

1 code implementation CVPR 2020 Hengshuang Zhao, Jiaya Jia, Vladlen Koltun

Recent work has shown that self-attention can serve as a basic building block for image recognition models.

Dynamic Scale Training for Object Detection

4 code implementations26 Apr 2020 Yukang Chen, Peizhen Zhang, Zeming Li, Yanwei Li, Xiangyu Zhang, Lu Qi, Jian Sun, Jiaya Jia

We propose a Dynamic Scale Training paradigm (abbreviated as DST) to mitigate scale variation challenge in object detection.

Instance Segmentation Object Detection +1

Attentive Normalization for Conditional Image Generation

1 code implementation CVPR 2020 Yi Wang, Ying-Cong Chen, Xiangyu Zhang, Jian Sun, Jiaya Jia

Traditional convolution-based generative adversarial networks synthesize images based on hierarchical local operations, where long-range dependency relation is implicitly modeled with a Markov chain.

Conditional Image Generation Semantic correspondence +2

VCNet: A Robust Approach to Blind Image Inpainting

2 code implementations ECCV 2020 Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia

Blind inpainting is a task to automatically complete visual contents without specifying masks for missing areas in an image.

Image Inpainting

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation

1 code implementation ICCV 2021 Xiaogang Xu, Hengshuang Zhao, Jiaya Jia

Adversarial training is promising for improving robustness of deep neural networks towards adversarial perturbations, especially on the classification task.

Semantic Segmentation

PointINS: Point-based Instance Segmentation

no code implementations13 Mar 2020 Lu Qi, Yi Wang, Yukang Chen, Yingcong Chen, Xiangyu Zhang, Jian Sun, Jiaya Jia

In this paper, we explore the mask representation in instance segmentation with Point-of-Interest (PoI) features.

Instance Segmentation Object Detection +2

3DSSD: Point-based 3D Single Stage Object Detector

2 code implementations CVPR 2020 Zetong Yang, Yanan sun, Shu Liu, Jiaya Jia

Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.

GridMask Data Augmentation

7 code implementations13 Jan 2020 Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia

Then we show limitation of existing information dropping algorithms and propose our structured method, which is simple and yet very effective.

Data Augmentation Object Detection +2

DSGN: Deep Stereo Geometry Network for 3D Object Detection

1 code implementation CVPR 2020 Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensors because there is a large performance gap between image-based and LiDAR-based methods.

3D Object Detection Vehicle Pose Estimation

Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation

no code implementations ICCV 2019 Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia

To incorporate point features in the edge branch, we establish a hierarchical graph framework, where the graph is initialized from a coarse layer and gradually enriched along the point decoding process.

Scene Labeling Semantic Segmentation

Fast Point R-CNN

no code implementations ICCV 2019 Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia

We present a unified, efficient and effective framework for point-cloud based 3D object detection.

3D Object Detection

Landmark Assisted CycleGAN for Cartoon Face Generation

no code implementations2 Jul 2019 Ruizheng Wu, Xiaodong Gu, Xin Tao, Xiaoyong Shen, Yu-Wing Tai, Jiaya Jia

In this paper, we are interested in generating an cartoon face of a person by using unpaired training data between real faces and cartoon ones.

Face Generation

Attribute-Driven Spontaneous Motion in Unpaired Image Translation

1 code implementation ICCV 2019 Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia

Current image translation methods, albeit effective to produce high-quality results in various applications, still do not consider much geometric transform.

Motion Estimation Translation

Associatively Segmenting Instances and Semantics in Point Clouds

3 code implementations CVPR 2019 Xinlong Wang, Shu Liu, Xiaoyong Shen, Chunhua Shen, Jiaya Jia

A 3D point cloud describes the real scene precisely and intuitively. To date how to segment diversified elements in such an informative 3D scene is rarely discussed.

Ranked #11 on 3D Instance Segmentation on S3DIS (mRec metric)

3D Instance Segmentation 3D Semantic Segmentation

Human Pose Estimation with Spatial Contextual Information

no code implementations7 Jan 2019 Hong Zhang, Hao Ouyang, Shu Liu, Xiaojuan Qi, Xiaoyong Shen, Ruigang Yang, Jiaya Jia

With this principle, we present two conceptually simple and yet computational efficient modules, namely Cascade Prediction Fusion (CPF) and Pose Graph Neural Network (PGNN), to exploit underlying contextual information.

Pose Estimation

Sequential Context Encoding for Duplicate Removal

no code implementations NeurIPS 2018 Lu Qi, Shu Liu, Jianping Shi, Jiaya Jia

Duplicate removal is a critical step to accomplish a reasonable amount of predictions in prevalent proposal-based object detection frameworks.

Object Detection

PSANet: Point-wise Spatial Attention Network for Scene Parsing

3 code implementations ECCV 2018 Hengshuang Zhao, Yi Zhang, Shu Liu, Jianping Shi, Chen Change Loy, Dahua Lin, Jiaya Jia

We notice information flow in convolutional neural networks is restricted inside local neighborhood regions due to the physical design of convolutional filters, which limits the overall understanding of complex scenes.

Scene Parsing Semantic Segmentation

Compositing-aware Image Search

no code implementations ECCV 2018 Hengshuang Zhao, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Brian Price, Jiaya Jia

We present a new image search technique that, given a background image, returns compatible foreground objects for image compositing tasks.

Image Retrieval

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation

1 code implementation CVPR 2018 Xiaojuan Qi, Renjie Liao, Zhengzhe Liu, Raquel Urtasun, Jiaya Jia

In this paper, we propose Geometric Neural Network (GeoNet) to jointly predict depth and surface normal maps from a single image.

Depth Estimation

Facelet-Bank for Fast Portrait Manipulation

no code implementations CVPR 2018 Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Yangang Ye, Xiaoyong Shen, Jiaya Jia

Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks.

Facial Editing

Scale-recurrent Network for Deep Image Deblurring

4 code implementations CVPR 2018 Xin Tao, Hongyun Gao, Yi Wang, Xiaoyong Shen, Jue Wang, Jiaya Jia

In single image deblurring, the "coarse-to-fine" scheme, i. e. gradually restoring the sharp image on different resolutions in a pyramid, is very successful in both traditional optimization-based methods and recent neural-network-based approaches.

Deblurring Image Deblurring +1

3D Graph Neural Networks for RGBD Semantic Segmentation

2 code implementations ICCV 2017 Xiaojuan Qi, Renjie Liao, Jiaya Jia, Sanja Fidler, Raquel Urtasun

Each node in the graph corresponds to a set of points and is associated with a hidden representation vector initialized with an appearance feature extracted by a unary CNN from 2D images.

Semantic Segmentation

Unsupervised Learning of Stereo Matching

no code implementations ICCV 2017 Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia

However, due to the limitations of these datasets and the difficulty of collecting new stereo data, current methods fail in real-life cases.

Stereo Matching Stereo Matching Hand

SGN: Sequential Grouping Networks for Instance Segmentation

no code implementations ICCV 2017 Shu Liu, Jiaya Jia, Sanja Fidler, Raquel Urtasun

By exploiting two-directional information, the second network groups horizontal and vertical lines into connected components.

Instance Segmentation Semantic Segmentation

Makeup-Go: Blind Reversion of Portrait Edit

no code implementations ICCV 2017 Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia

In this paper, we propose the task of restoring a portrait image from this process.

Automatic Real-time Background Cut for Portrait Videos

no code implementations28 Apr 2017 Xiaoyong Shen, RuiXing Wang, Hengshuang Zhao, Jiaya Jia

A spatial-temporal refinement network is developed to further refine the segmentation errors in each frame and ensure temporal coherence in the segmentation map.

Frame Semantic Segmentation +2

Zero-order Reverse Filtering

1 code implementation ICCV 2017 Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia

In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters.

Detail-revealing Deep Video Super-resolution

1 code implementation ICCV 2017 Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia

In this paper, we show that proper frame alignment and motion compensation is crucial for achieving high quality results.

Frame Image Super-Resolution +2

High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits

no code implementations ICCV 2017 Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia

Estimating correspondence between two images and extracting the foreground object are two challenges in computer vision.

Convolutional Neural Pyramid for Image Processing

no code implementations7 Apr 2017 Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia

We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks.

Colorization Image Enhancement +1

Multi-Scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation

no code implementations CVPR 2016 Shu Liu, Xiaojuan Qi, Jianping Shi, Hong Zhang, Jiaya Jia

Aiming at simultaneous detection and segmentation (SDS), we propose a proposal-free framework, which detect and segment object instances via mid-level patches.

Object Proposal Generation

ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation

no code implementations CVPR 2016 Di Lin, Jifeng Dai, Jiaya Jia, Kaiming He, Jian Sun

Large-scale data is of crucial importance for learning semantic segmentation models, but annotating per-pixel masks is a tedious and inefficient procedure.

Semantic Segmentation

A Closed-Form Solution to Tensor Voting: Theory and Applications

no code implementations19 Jan 2016 Tai-Pang Wu, Sai-Kit Yeung, Jiaya Jia, Chi-Keung Tang, Gerard Medioni

We prove a closed-form solution to tensor voting (CFTV): given a point set in any dimensions, our closed-form solution provides an exact, continuous and efficient algorithm for computing a structure-aware tensor that simultaneously achieves salient structure detection and outlier attenuation.

Stereo Matching Stereo Matching Hand

Box Aggregation for Proposal Decimation: Last Mile of Object Detection

no code implementations ICCV 2015 Shu Liu, Cewu Lu, Jiaya Jia

Regions-with-convolutional-neural-network (RCNN) is now a commonly employed object detection pipeline.

Object Detection

Mutual-Structure for Joint Filtering

no code implementations ICCV 2015 Xiaoyong Shen, Chao Zhou, Li Xu, Jiaya Jia

Previous joint/guided filters directly transfer the structural information in the reference image to the target one.

Depth Completion Image Enhancement +3

Video Super-Resolution via Deep Draft-Ensemble Learning

no code implementations ICCV 2015 Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia

We propose a new direction for fast video super-resolution (VideoSR) via a SR draft ensemble, which is defined as the set of high-resolution patch candidates before final image deconvolution.

Ensemble Learning Image Deconvolution +1

ENFT: Efficient Non-Consecutive Feature Tracking for Robust Structure-from-Motion

3 code implementations27 Oct 2015 Guofeng Zhang, Hao-Min Liu, Zilong Dong, Jiaya Jia, Tien-Tsin Wong, Hujun Bao

Our framework consists of steps of solving the feature `dropout' problem when indistinctive structures, noise or large image distortion exists, and of rapidly recognizing and joining common features located in different subsequences.

Deep LAC: Deep Localization, Alignment and Classification for Fine-Grained Recognition

no code implementations CVPR 2015 Di Lin, Xiaoyong Shen, Cewu Lu, Jiaya Jia

Our major contribution is to propose a valve linkage function(VLF) for back-propagation chaining and form our deep localization, alignment and classification (LAC) system.

Classification General Classification

Just Noticeable Defocus Blur Detection and Estimation

no code implementations CVPR 2015 Jianping Shi, Li Xu, Jiaya Jia

We tackle a fundamental problem to detect and estimate just noticeable blur (JNB) caused by defocus that spans a small number of pixels in images.

Bounded-Distortion Metric Learning

no code implementations10 May 2015 Renjie Liao, Jianping Shi, Ziyang Ma, Jun Zhu, Jiaya Jia

Metric learning aims to embed one metric space into another to benefit tasks like classification and clustering.

General Classification Metric Learning

Understanding and Diagnosing Visual Tracking Systems

no code implementations ICCV 2015 Naiyan Wang, Jianping Shi, Dit-yan Yeung, Jiaya Jia

Surprisingly, our findings are discrepant with some common beliefs in the visual tracking research community.

Visual Tracking

Discriminative Blur Detection Features

no code implementations CVPR 2014 Jianping Shi, Li Xu, Jiaya Jia

Ubiquitous image blur brings out a practically important question – what are effective features to differentiate between blurred and unblurred image regions.


L0 Regularized Stationary Time Estimation for Crowd Group Analysis

no code implementations CVPR 2014 Shuai Yi, Xiaogang Wang, Cewu Lu, Jiaya Jia

We tackle stationary crowd analysis in this paper, which is similarly important as modeling mobile groups in crowd scenes and finds many applications in surveillance.


100+ Times Faster Weighted Median Filter (WMF)

no code implementations CVPR 2014 Qi Zhang, Li Xu, Jiaya Jia

Weighted median, in the form of either solver or filter, has been employed in a wide range of computer vision solutions for its beneficial properties in sparsity representation.

Optical Flow Estimation Stereo Matching +1

Two-Class Weather Classification

no code implementations CVPR 2014 Cewu Lu, Di Lin, Jiaya Jia, Chi-Keung Tang

Given a single outdoor image, this paper proposes a collaborative learning approach for labeling it as either sunny or cloudy.

Classification General Classification

Learning Important Spatial Pooling Regions for Scene Classification

no code implementations CVPR 2014 Di Lin, Cewu Lu, Renjie Liao, Jiaya Jia

We address the false response influence problem when learning and applying discriminative parts to construct the mid-level representation in scene classification.

Classification General Classification +1

ESSP: An Efficient Approach to Minimizing Dense and Nonsubmodular Energy Functions

no code implementations19 May 2014 Wei Feng, Jiaya Jia, Zhi-Qiang Liu

From our study, we make some reasonable recommendations of combining existing methods that perform the best in different situations for this challenging problem.

Dense Scattering Layer Removal

no code implementations13 Oct 2013 Qiong Yan, Li Xu, Jiaya Jia

We propose a new model, together with advanced optimization, to separate a thick scattering media layer from a single natural image.

Unnatural L0 Sparse Representation for Natural Image Deblurring

no code implementations CVPR 2013 Li Xu, Shicheng Zheng, Jiaya Jia

We show in this paper that the success of previous maximum a posterior (MAP) based blur removal methods partly stems from their respective intermediate steps, which implicitly or explicitly create an unnatural representation containing salient image structures.

Ranked #9 on Deblurring on RealBlur-R (trained on GoPro) (SSIM (sRGB) metric)

Deblurring Image Deblurring

Hierarchical Saliency Detection

no code implementations CVPR 2013 Qiong Yan, Li Xu, Jianping Shi, Jiaya Jia

When dealing with objects with complex structures, saliency detection confronts a critical problem namely that detection accuracy could be adversely affected if salient foreground or background in an image contains small-scale high-contrast patterns.

Saliency Detection

Online Robust Dictionary Learning

no code implementations CVPR 2013 Cewu Lu, Jiaping Shi, Jiaya Jia

Online dictionary learning is particularly useful for processing large-scale and dynamic data in computer vision.

Dictionary Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.