no code implementations • ECCV 2020 • Ruizheng Wu, Xin Tao, Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
Unpaired image-to-image translation aims to translate images from the source class to target one by providing sufficient data for these classes.
no code implementations • ECCV 2020 • Wanli Chen, Xinge Zhu, Ruoqi Sun, Junjun He, Ruiyu Li, Xiaoyong Shen, Bei Yu
Then we use these rank-1 tensors to recover the high-rank context features through our proposed tensor reconstruction module (TRM).
1 code implementation • CVPR 2020 • Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia
Most state-of-the-art 3D object detectors heavily rely on LiDAR sensors because there is a large performance gap between image-based and LiDAR-based methods.
no code implementations • 18 Dec 2019 • Jihan Yang, Ruijia Xu, Ruiyu Li, Xiaojuan Qi, Xiaoyong Shen, Guanbin Li, Liang Lin
In contrast to adversarial alignment, we propose to explicitly train a domain-invariant classifier by generating and defensing against pointwise feature space adversarial perturbations.
no code implementations • ICCV 2019 • Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia
To incorporate point features in the edge branch, we establish a hierarchical graph framework, where the graph is initialized from a coarse layer and gradually enriched along the point decoding process.
Ranked #46 on
Semantic Segmentation
on S3DIS Area5
no code implementations • ICCV 2019 • Lei Ke, Wenjie Pei, Ruiyu Li, Xiaoyong Shen, Yu-Wing Tai
State-of-the-art image captioning methods mostly focus on improving visual features, less attention has been paid to utilizing the inherent properties of language to boost captioning performance.
Ranked #5 on
Image Captioning
on MS COCO
no code implementations • ICCV 2019 • Canmiao Fu, Wenjie Pei, Qiong Cao, Chaopeng Zhang, Yong Zhao, Xiaoyong Shen, Yu-Wing Tai
Typical methods for supervised sequence modeling are built upon the recurrent neural networks to capture temporal dependencies.
no code implementations • ICCV 2019 • Jinkun Cao, Hongyang Tang, Hao-Shu Fang, Xiaoyong Shen, Cewu Lu, Yu-Wing Tai
Therefore, the easily available human pose dataset, which is of a much larger scale than our labeled animal dataset, provides important prior knowledge to boost up the performance on animal pose estimation.
no code implementations • ICCV 2019 • Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia
We present a unified, efficient and effective framework for point-cloud based 3D object detection.
no code implementations • 4 Aug 2019 • Zhaoyang Yang, Zhenmei Shi, Xiaoyong Shen, Yu-Wing Tai
The proposed SF-Net extracts features in a structured manner and gradually encodes information at the frame level, the gloss level and the sentence level into the feature representation.
no code implementations • ICCV 2019 • Zetong Yang, Yanan sun, Shu Liu, Xiaoyong Shen, Jiaya Jia
We present a new two-stage 3D object detection framework, named sparse-to-dense 3D Object Detector (STD).
1 code implementation • ICCV 2019 • Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia
Current image translation methods, albeit effective to produce high-quality results in various applications, still do not consider much geometric transform.
no code implementations • 2 Jul 2019 • Ruizheng Wu, Xiaodong Gu, Xin Tao, Xiaoyong Shen, Yu-Wing Tai, Jiaya Jia
In this paper, we are interested in generating an cartoon face of a person by using unpaired training data between real faces and cartoon ones.
no code implementations • 27 Jun 2019 • Zhuotao Tian, Hengshuang Zhao, Michelle Shu, Jiaze Wang, Ruiyu Li, Xiaoyong Shen, Jiaya Jia
Albeit intensively studied, false prediction and unclear boundaries are still major issues of salient object detection.
no code implementations • 13 Jun 2019 • Pengyuan Lyu, Zhicheng Yang, Xinhang Leng, Xiao-Jun Wu, Ruiyu Li, Xiaoyong Shen
Irregular scene text, which has complex layout in 2D space, is challenging to most previous scene text recognizers.
1 code implementation • CVPR 2019 • Wenjie Pei, Jiyuan Zhang, Xiangrong Wang, Lei Ke, Xiaoyong Shen, Yu-Wing Tai
Typical techniques for video captioning follow the encoder-decoder framework, which can only focus on one source video being processed.
3 code implementations • CVPR 2019 • Xinlong Wang, Shu Liu, Xiaoyong Shen, Chunhua Shen, Jiaya Jia
A 3D point cloud describes the real scene precisely and intuitively. To date how to segment diversified elements in such an informative 3D scene is rarely discussed.
Ranked #16 on
3D Instance Segmentation
on S3DIS
(mRec metric)
no code implementations • 7 Jan 2019 • Hong Zhang, Hao Ouyang, Shu Liu, Xiaojuan Qi, Xiaoyong Shen, Ruigang Yang, Jiaya Jia
With this principle, we present two conceptually simple and yet computational efficient modules, namely Cascade Prediction Fusion (CPF) and Pose Graph Neural Network (PGNN), to exploit underlying contextual information.
Ranked #10 on
Pose Estimation
on MPII Human Pose
no code implementations • 13 Dec 2018 • Zetong Yang, Yanan sun, Shu Liu, Xiaoyong Shen, Jiaya Jia
We present a novel 3D object detection framework, named IPOD, based on raw point cloud.
Ranked #1 on
3D Object Detection
on KITTI Pedestrians Easy
2 code implementations • NeurIPS 2018 • Yi Wang, Xin Tao, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia
In this paper, we propose a generative multi-column network for image inpainting.
1 code implementation • CVPR 2018 • Ruiyu Li, Kaican Li, Yi-Chun Kuo, Michelle Shu, Xiaojuan Qi, Xiaoyong Shen, Jiaya Jia
We address the problem of image segmentation from natural language descriptions.
no code implementations • CVPR 2018 • Ying-Cong Chen, Huaijia Lin, Michelle Shu, Ruiyu Li, Xin Tao, Yangang Ye, Xiaoyong Shen, Jiaya Jia
Digital face manipulation has become a popular and fascinating way to touch images with the prevalence of smartphones and social networks.
4 code implementations • CVPR 2018 • Xin Tao, Hongyun Gao, Yi Wang, Xiaoyong Shen, Jue Wang, Jiaya Jia
In single image deblurring, the "coarse-to-fine" scheme, i. e. gradually restoring the sharp image on different resolutions in a pyramid, is very successful in both traditional optimization-based methods and recent neural-network-based approaches.
Ranked #2 on
Image Deblurring
on GoPro
(Params (M) metric, using extra
training data)
no code implementations • ICCV 2017 • Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia
However, due to the limitations of these datasets and the difficulty of collecting new stereo data, current methods fail in real-life cases.
no code implementations • ICCV 2017 • Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
In this paper, we propose the task of restoring a portrait image from this process.
no code implementations • 28 Apr 2017 • Xiaoyong Shen, RuiXing Wang, Hengshuang Zhao, Jiaya Jia
A spatial-temporal refinement network is developed to further refine the segmentation errors in each frame and ensure temporal coherence in the segmentation map.
18 code implementations • ECCV 2018 • Hengshuang Zhao, Xiaojuan Qi, Xiaoyong Shen, Jianping Shi, Jiaya Jia
We focus on the challenging task of real-time semantic segmentation in this paper.
Ranked #12 on
Semantic Segmentation
on BDD100K val
Dichotomous Image Segmentation
Real-Time Semantic Segmentation
+3
1 code implementation • ICCV 2017 • Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia
In this paper, we study an unconventional but practically meaningful reversibility problem of commonly used image filters.
no code implementations • 7 Apr 2017 • Xiaoyong Shen, Ying-Cong Chen, Xin Tao, Jiaya Jia
We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks.
no code implementations • ICCV 2017 • Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia
Estimating correspondence between two images and extracting the foreground object are two challenges in computer vision.
no code implementations • ICCV 2015 • Xiaoyong Shen, Chao Zhou, Li Xu, Jiaya Jia
Previous joint/guided filters directly transfer the structural information in the reference image to the target one.
no code implementations • CVPR 2015 • Di Lin, Xiaoyong Shen, Cewu Lu, Jiaya Jia
Our major contribution is to propose a valve linkage function(VLF) for back-propagation chaining and form our deep localization, alignment and classification (LAC) system.