Search Results for author: Xiaoguang Han

Found 53 papers, 16 papers with code

Multi-level Consistency Learning for Semi-supervised Domain Adaptation

no code implementations9 May 2022 Zizheng Yan, Yushuang Wu, Guanbin Li, Yipeng Qin, Xiaoguang Han, Shuguang Cui

Semi-supervised domain adaptation (SSDA) aims to apply knowledge learned from a fully labeled source domain to a scarcely labeled target domain.

Domain Adaptation

DArch: Dental Arch Prior-assisted 3D Tooth Instance Segmentation

no code implementations25 Apr 2022 Liangdong Qiu, Chongjie Ye, Pei Chen, Yunbi Liu, Xiaoguang Han, Shuguang Cui

Experimental results on $4, 773$ dental models have shown our DArch can accurately segment each tooth of a dental model, and its performance is superior to the state-of-the-art methods.

Instance Segmentation Semantic Segmentation

Registering Explicit to Implicit: Towards High-Fidelity Garment mesh Reconstruction from Single Images

no code implementations28 Mar 2022 Heming Zhu, Lingteng Qiu, Yuda Qiu, Xiaoguang Han

Fueled by the power of deep learning techniques and implicit shape learning, recent advances in single-image human digitalization have reached unprecedented accuracy and could recover fine-grained surface details such as garment wrinkles.

SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation

no code implementations24 Mar 2022 Chenming Zhu, Xuanye Zhang, Yanran Li, Liangdong Qiu, Kai Han, Xiaoguang Han

Contour-based models are efficient and generic to be incorporated with any existing segmentation methods, but they often generate over-smoothed contour and tend to fail on corner areas.

Instance Segmentation Semantic Segmentation

Compound Domain Generalization via Meta-Knowledge Encoding

no code implementations24 Mar 2022 Chaoqi Chen, Jiongcheng Li, Xiaoguang Han, Xiaoqing Liu, Yizhou Yu

Such holistic semantic structure, referred to as meta-knowledge here, is crucial for learning generalizable representations.

Domain Generalization Out-of-Distribution Generalization

Exploring Motion Ambiguity and Alignment for High-Quality Video Frame Interpolation

no code implementations19 Mar 2022 Kun Zhou, Wenbo Li, Xiaoguang Han, Jiangbo Lu

Without the bells and whistles, our plug-and-play TCL is capable of improving the performance of existing VFI frameworks.

Frame Optical Flow Estimation +1

TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes

no code implementations17 Mar 2022 Mutian Xu, Pei Chen, Haolin Liu, Xiaoguang Han

Many basic indoor activities such as eating or writing are always conducted upon different tabletops (e. g., coffee tables, writing desks).

3D Semantic Segmentation Object Detection +1

Blind Image Super Resolution with Semantic-Aware Quantized Texture Prior

1 code implementation26 Feb 2022 Chaofeng Chen, Xinyu Shi, Yipeng Qin, Xiaoming Li, Xiaoguang Han, Tao Yang, Shihui Guo

Since features in the codebook have shown the ability to generate natural textures in the pretrain stage, QuanTexSR can generate rich and realistic textures with the pretrained codebook as texture priors.

Image Super-Resolution Texture Synthesis

PointMatch: A Consistency Training Framework for Weakly SupervisedSemantic Segmentation of 3D Point Clouds

no code implementations22 Feb 2022 Yushuang Wu, Zizheng Yan, Shengcai Cai, Guanbin Li, Yizhou Yu, Xiaoguang Han, Shuguang Cui

Semantic segmentation of point cloud usually relies on dense annotation that is exhausting and costly, so it attracts wide attention to investigate solutions for the weakly supervised scheme with only sparse points annotated.

Representation Learning Semantic Segmentation

Pose2Room: Understanding 3D Scenes from Human Activities

no code implementations1 Dec 2021 Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner

To this end, we propose P2R-Net to learn a probabilistic 3D model of the objects in a scene characterized by their class categories and oriented 3D bounding boxes, based on an input observed human trajectory in the environment.

SketchHairSalon: Deep Sketch-based Hair Image Synthesis

no code implementations16 Sep 2021 Chufeng Xiao, Deng Yu, Xiaoguang Han, Youyi Zheng, Hongbo Fu

At the second stage, another network is trained to synthesize the structure and appearance of hair images from the input sketch and the generated matte.

Image Generation

Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts

1 code implementation ICCV 2021 Hong-Yu Zhou, Chixiang Lu, Sibei Yang, Xiaoguang Han, Yizhou Yu

From this perspective, we introduce Preservational Learning to reconstruct diverse image contexts in order to preserve more information in learned representations.

Contrastive Learning Representation Learning +1

ME-PCN: Point Completion Conditioned on Mask Emptiness

1 code implementation ICCV 2021 Bingchen Gong, Yinyu Nie, Yiqun Lin, Xiaoguang Han, Yizhou Yu

Main-stream methods predict the missing shapes by decoding a global feature learned from the input point cloud, which often leads to deficient results in preserving topology consistency and surface details.

SimpModeling: Sketching Implicit Field to Guide Mesh Modeling for 3D Animalmorphic Head Design

1 code implementation5 Aug 2021 Zhongjin Luo, Jie zhou, Heming Zhu, Dong Du, Xiaoguang Han, Hongbo Fu

In this work, we propose SimpModeling, a novel sketch-based system for helping users, especially amateur users, easily model 3D animalmorphic heads - a prevalent kind of heads in character design.

From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting

no code implementations21 Jul 2021 Mengcheng Lan, Shuliang Ning, Yanran Li, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui

Despite video forecasting has been a widely explored topic in recent years, the mainstream of the existing work still limits their models with a single prediction space but completely neglects the way to leverage their model with multi-prediction spaces.

Video Prediction

Transformer with Peak Suppression and Knowledge Guidance for Fine-grained Image Recognition

no code implementations14 Jul 2021 Xinda Liu, Lili Wang, Xiaoguang Han

In this paper, we analyze the difficulties of fine-grained image recognition from a new perspective and propose a transformer architecture with the peak suppression module and knowledge guidance module, which respects the diversification of discriminative features in a single image and the aggregation of discriminative clues among multiple images.

Fine-Grained Image Classification Fine-Grained Image Recognition

Task-Aware Sampling Layer for Point-Wise Analysis

no code implementations9 Jul 2021 Yiqun Lin, Lichang Chen, Haibin Huang, Chongyang Ma, Xiaoguang Han, Shuguang Cui

Sampling, grouping, and aggregation are three important components in the multi-scale analysis of point clouds.

Keypoint Detection Point Cloud Completion

LapsCore: Language-Guided Person Search via Color Reasoning

no code implementations ICCV 2021 Yushuang Wu, Zizheng Yan, Xiaoguang Han, Guanbin Li, Changqing Zou, Shuguang Cui

The key point of language-guided person search is to construct the cross-modal association between visual and textual input.

Colorization Person Search +1

RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

1 code implementation CVPR 2021 Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias Nießner

In this work, we introduce RfD-Net that jointly detects and reconstructs dense object surfaces directly from raw point clouds.

Object Detection Object Localization +2

A deep learning based interactive sketching system for fashion images design

no code implementations9 Oct 2020 Yao Li, Xianggang Yu, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu

In this work, we propose an interactive system to design diverse high-quality garment images from fashion sketches and the texture information.

Intrinsic Image Decomposition Texture Synthesis

Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

no code implementations18 Sep 2020 Jie Wu, Guanbin Li, Xiaoguang Han, Liang Lin

Temporal grounding of natural language in untrimmed videos is a fundamental yet challenging multimedia task facilitating cross-media visual content retrieval.

reinforcement-learning Temporal Localization

Ultrasound Liver Fibrosis Diagnosis using Multi-indicator guided Deep Neural Networks

no code implementations10 Sep 2020 Jiali Liu, Wenxuan Wang, Tianyao Guan, Ningbo Zhao, Xiaoguang Han, Zhen Li

An indicator-guided learning mechanism is further proposed to ease the training of the proposed model.

SkeletonNet: A Topology-Preserving Solution for Learning Mesh Reconstruction of Object Surfaces from RGB Images

1 code implementation13 Aug 2020 Jiapeng Tang, Xiaoguang Han, Mingkui Tan, Xin Tong, Kui Jia

However, they all have their own drawbacks, and cannot properly reconstruct the surface shapes of complex topologies, arguably due to a lack of constraints on the topologicalstructures in their learning frameworks.

Surface Reconstruction

Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images

2 code implementations ECCV 2020 Heming Zhu, Yu Cao, Hang Jin, Weikai Chen, Dong Du, Zhangye Wang, Shuguang Cui, Xiaoguang Han

High-fidelity clothing reconstruction is the key to achieving photorealism in a wide range of applications including human digitization, virtual try-on, etc.

Virtual Try-on

Learning Inverse Rendering of Faces from Real-world Videos

1 code implementation26 Mar 2020 Yuda Qiu, Zhangyang Xiong, Kai Han, Zhongyuan Wang, Zixiang Xiong, Xiaoguang Han

To alleviate this problem, we propose a weakly supervised training approach to train our model on real face videos, based on the assumption of consistency of albedo and normal across different frames, thus bridging the gap between real and synthetic face images.

Peeking into occluded joints: A novel framework for crowd pose estimation

1 code implementation ECCV 2020 Lingteng Qiu, Xuanye Zhang, Yan-ran Li, Guanbin Li, Xiao-Jun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui

Although occlusion widely exists in nature and remains a fundamental challenge for pose estimation, existing heatmap-based approaches suffer serious degradation on occlusions.

Pose Estimation

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

no code implementations22 Feb 2020 Yinyu Nie, Shihui Guo, Jian Chang, Xiaoguang Han, Jiahui Huang, Shi-Min Hu, Jian Jun Zhang

Particularly, we design a shallow-to-deep architecture on the basis of convolutional networks for semantic scene understanding and modeling.

Scene Understanding

Self-Enhanced Convolutional Network for Facial Video Hallucination

no code implementations23 Nov 2019 Chaowei Fang, Guanbin Li, Xiaoguang Han, Yizhou Yu

It further recurrently exploits the reconstructed results and intermediate features of a sequence of preceding frames to improve the initial super-resolution of the current frame by modelling the coherence of structural facial features across frames.

Frame Video Super-Resolution

Deep Mesh Reconstruction from Single RGB Images via Topology Modification Networks

no code implementations ICCV 2019 Junyi Pan, Xiaoguang Han, Weikai Chen, Jiapeng Tang, Kui Jia

The key to our approach is a novel progressive shaping framework that alternates between mesh deformation and topology modification.

Deep Reinforcement Learning of Volume-guided Progressive View Inpainting for 3D Point Scene Completion from a Single Depth Image

no code implementations CVPR 2019 Xiaoguang Han, Zhaoxuan Zhang, Dong Du, Mingdai Yang, Jingming Yu, Pan Pan, Xin Yang, Ligang Liu, Zixiang Xiong, Shuguang Cui

Given a single depth image, our method first goes through the 3D volume branch to obtain a volumetric scene reconstruction as a guide to the next view inpainting step, which attempts to make up the missing information; the third step involves projecting the volume under the same view of the input, concatenating them to complete the current view depth, and integrating all depth into the point cloud.

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

no code implementations28 Feb 2019 Haonan Qiu, Chuan Wang, Hang Zhu, Xiangyu Zhu, Jinjin Gu, Xiaoguang Han

Generating plausible hair image given limited guidance, such as sparse sketches or low-resolution image, has been made possible with the rise of Generative Adversarial Networks (GANs).

Image-to-Image Translation Super-Resolution +1

Learning Mutually Local-global U-nets For High-resolution Retinal Lesion Segmentation in Fundus Images

no code implementations18 Jan 2019 Zizheng Yan, Xiaoguang Han, Changmiao Wang, Yuda Qiu, Zixiang Xiong, Shuguang Cui

Due to high-resolution and small-size lesion regions, applying existing methods, such as U-Nets, to perform segmentation on fundus photography is very challenging.

Lesion Segmentation

Deep RBFNet: Point Cloud Feature Learning using Radial Basis Functions

no code implementations11 Dec 2018 Weikai Chen, Xiaoguang Han, Guanbin Li, Chao Chen, Jun Xing, Yajie Zhao, Hao Li

Three-dimensional object recognition has recently achieved great progress thanks to the development of effective point cloud-based learning frameworks, such as PointNet and its extensions.

3D Object Recognition

Adversarial 3D Human Pose Estimation via Multimodal Depth Supervision

no code implementations21 Sep 2018 Kun Zhou, Jinmiao Cai, Yao Li, Yulong Shi, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu

In this paper, a novel deep-learning based framework is proposed to infer 3D human poses from a single image.

3D Human Pose Estimation

CaricatureShop: Personalized and Photorealistic Caricature Sketching

no code implementations24 Jul 2018 Xiaoguang Han, Kangcheng Hou, Dong Du, Yuda Qiu, Yizhou Yu, Kun Zhou, Shuguang Cui

To construct the mapping between 2D sketches and a vertex-wise scaling field, a novel deep learning architecture is developed.

Caricature Face Model

FBI-Pose: Towards Bridging the Gap between 2D Images and 3D Human Poses using Forward-or-Backward Information

no code implementations25 Jun 2018 Yulong Shi, Xiaoguang Han, Nianjuan Jiang, Kun Zhou, Kui Jia, Jiangbo Lu

Although significant advances have been made in the area of human poses estimation from images using deep Convolutional Neural Network (ConvNet), it remains a big challenge to perform 3D pose inference in-the-wild.

Video Inpainting by Jointly Learning Temporal Structure and Spatial Details

no code implementations22 Jun 2018 Chuan Wang, Haibin Huang, Xiaoguang Han, Jue Wang

We present a new data-driven video inpainting method for recovering missing regions of video frames.

Frame Video Inpainting

High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference

no code implementations ICCV 2017 Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu

Our method is based on a new deep learning architecture consisting of two sub-networks: a global structure inference network and a local geometry refinement network.

DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

no code implementations7 Jun 2017 Xiaoguang Han, Chang Gao, Yizhou Yu

This system has a labor-efficient sketching interface, that allows the user to draw freehand imprecise yet expressive 2D lines representing the contours of facial features.


Cannot find the paper you are looking for? You can Submit a new open access paper.