Search Results for author: Xiaoguang Han

Found 105 papers, 31 papers with code

Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement

1 code implementation3 Sep 2024 Kun Zhou, Xinyu Lin, Wenbo Li, Xiaogang Xu, Yuanhao Cai, Zhonghang Liu, Xiaoguang Han, Jiangbo Lu

Previous low-light image enhancement (LLIE) approaches, while employing frequency decomposition techniques to address the intertwined challenges of low frequency (e. g., illumination recovery) and high frequency (e. g., noise reduction), primarily focused on the development of dedicated and complex networks to achieve improved performance.

Disentanglement Low-Light Image Enhancement

Towards Realistic Example-based Modeling via 3D Gaussian Stitching

no code implementations28 Aug 2024 Xinyu Gao, ZiYi Yang, Bingchen Gong, Xiaoguang Han, Sipeng Yang, Xiaogang Jin

To this end, we present an example-based modeling method that combines multiple Gaussian fields in a point-based representation using sample-guided synthesis.

CT4D: Consistent Text-to-4D Generation with Animatable Meshes

no code implementations15 Aug 2024 Ce Chen, Shaoli Huang, Xuelin Chen, Guangyi Chen, Xiaoguang Han, Kun Zhang, Mingming Gong

The primary challenges of our mesh-based framework involve stably generating a mesh with details that align with the text prompt while directly driving it and maintaining surface continuity.

ViMo: Generating Motions from Casual Videos

no code implementations13 Aug 2024 Liangdong Qiu, Chengxing Yu, Yanran Li, Zhao Wang, Haibin Huang, Chongyang Ma, Di Zhang, Pengfei Wan, Xiaoguang Han

Although humans have the innate ability to imagine multiple possible actions from videos, it remains an extraordinary challenge for computers due to the intricate camera movements and montages.

ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation

no code implementations23 Jul 2024 Zhenhua Wu, Yanlin Jin, Liangdong Qiu, Xiaoguang Han, Xiang Wan, Guanbin Li

Furthermore, we carefully design a TNet module in our adaptation architecture to yield geometry constraints and obtain better depth quality.

Depth Estimation Domain Adaptation

GaussReg: Fast 3D Registration with Gaussian Splatting

no code implementations7 Jul 2024 Jiahao Chang, Yinglin Xu, Yihao Li, Yuantao Chen, Xiaoguang Han

The existing methods usually convert the implicit representation to explicit representation for further registration.

Point Cloud Registration

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

no code implementations24 Jun 2024 Chongjie Ye, Lingteng Qiu, Xiaodong Gu, Qi Zuo, Yushuang Wu, Zilong Dong, Liefeng Bo, Yuliang Xiu, Xiaoguang Han

The effectiveness of StableNormal is demonstrated through competitive performance in standard datasets such as DIODE-indoor, iBims, ScannetV2 and NYUv2, and also in various downstream tasks, such as surface reconstruction and normal enhancement.

Surface Normal Estimation Surface Reconstruction

Sketch2Human: Deep Human Generation with Disentangled Geometry and Appearance Control

no code implementations24 Apr 2024 Linzi Qu, Jiaxiang Shang, Hui Ye, Xiaoguang Han, Hongbo Fu

This work presents Sketch2Human, the first system for controllable full-body human image generation guided by a semantic sketch (for geometry control) and a reference image (for appearance control).

Face Generation

SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

no code implementations8 Apr 2024 Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, GuanYing Chen, Xiaoguang Han

We further introduce a view-image consistency loss for the discriminator to emphasize the correspondence of the camera parameters and the images.

Face Generation

GauStudio: A Modular Framework for 3D Gaussian Splatting and Beyond

1 code implementation28 Mar 2024 Chongjie Ye, Yinyu Nie, Jiahao Chang, Yuantao Chen, YiHao Zhi, Xiaoguang Han

We present GauStudio, a novel modular framework for modeling 3D Gaussian Splatting (3DGS) to provide standardized, plug-and-play components for users to easily customize and implement a 3DGS pipeline.

Novel View Synthesis Surface Reconstruction

IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images

no code implementations CVPR 2024 Yushuang Wu, Luyue Shi, Junhao Cai, Weihao Yuan, Lingteng Qiu, Zilong Dong, Liefeng Bo, Shuguang Cui, Xiaoguang Han

This approach treats the query points for implicit field learning as a noisy point cloud for iterative denoising allowing for their dynamic adaptation to the target object shape.

3D Object Reconstruction Denoising

PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns

no code implementations CVPR 2024 Shuliang Ning, Duomin Wang, Yipeng Qin, Zirong Jin, Baoyuan Wang, Xiaoguang Han

Unlike prior arts constrained by specific input types, our method allows flexible specification of style (text or image) and texture (full garment, cropped sections, or texture patches) conditions.

Disentanglement Human Parsing +1

RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D

no code implementations CVPR 2024 Lingteng Qiu, GuanYing Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han

Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images.

3D Generation Text to 3D

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images

no code implementations CVPR 2024 Xihe Yang, Xingyu Chen, Daiheng Gao, Shaohui Wang, Xiaoguang Han, Baoyuan Wang

As for human avatar reconstruction, contemporary techniques commonly necessitate the acquisition of costly data and struggle to achieve satisfactory results from a small number of casual images.

FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design

no code implementations13 Nov 2023 Zhen Huang, Yihao Li, Dong Pei, Jiapeng Zhou, Xuliang Ning, Jianlin Han, Xiaoguang Han, Xuejun Chen

Text-driven fashion synthesis and design is an extremely valuable part of artificial intelligence generative content(AIGC), which has the potential to propel a tremendous revolution in the traditional fashion industry.

Fashion Synthesis

SeamlessNeRF: Stitching Part NeRFs with Gradient Propagation

no code implementations30 Oct 2023 Bingchen Gong, Yuehao Wang, Xiaoguang Han, Qi Dou

To fill this gap, we propose SeamlessNeRF, a novel approach for seamless appearance blending of multiple NeRFs.

Activate and Reject: Towards Safe Domain Generalization under Category Shift

no code implementations ICCV 2023 Chaoqi Chen, Luyao Tang, Leitian Tao, Hong-Yu Zhou, Yue Huang, Xiaoguang Han, Yizhou Yu

Albeit the notable performance on in-domain test points, it is non-trivial for deep neural networks to attain satisfactory accuracy when deploying in the open world, where novel domains and object classes often occur.

Domain Generalization Image Classification +3

EMS: 3D Eyebrow Modeling from Single-view Images

no code implementations22 Sep 2023 Chenghong Li, Leyang Jin, Yujian Zheng, Yizhou Yu, Xiaoguang Han

Three modules are then carefully designed: RootFinder firstly localizes the fiber root positions which indicates where to grow; OriPredictor predicts an orientation field in the 3D space to guide the growing of fibers; FiberEnder is designed to determine when to stop the growth of each fiber.

Efficient View Synthesis with Neural Radiance Distribution Field

no code implementations ICCV 2023 Yushuang Wu, Xiao Li, Jinglu Wang, Xiaoguang Han, Shuguang Cui, Yan Lu

Specifically, we use a small network similar to NeRF while preserving the rendering speed with a single network forwarding per pixel as in NeLF.

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

no code implementations13 Aug 2023 David Junhao Zhang, Mutian Xu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou

Despite the rapid advancement of unsupervised learning in visual representation, it requires training on large-scale datasets that demand costly data collection, and pose additional challenges due to concerns regarding data privacy.

Contrastive Learning Image Classification +2

Universal Semi-supervised Model Adaptation via Collaborative Consistency Training

no code implementations7 Jul 2023 Zizheng Yan, Yushuang Wu, Yipeng Qin, Xiaoguang Han, Shuguang Cui, Guanbin Li

In this paper, we introduce a realistic and challenging domain adaptation problem called Universal Semi-supervised Model Adaptation (USMA), which i) requires only a pre-trained source model, ii) allows the source and target domain to have different label sets, i. e., they share a common label set and hold their own private label set, and iii) requires only a few labeled samples in each class of the target domain.

Domain Adaptation

SketchMetaFace: A Learning-based Sketching Interface for High-fidelity 3D Character Face Modeling

no code implementations3 Jul 2023 Zhongjin Luo, Dong Du, Heming Zhu, Yizhou Yu, Hongbo Fu, Xiaoguang Han

User studies demonstrate the superiority of our system over existing modeling tools in terms of the ease to use and visual quality of results.

3D Keypoint Estimation Using Implicit Representation Learning

no code implementations20 Jun 2023 Xiangyu Zhu, Dong Du, Haibin Huang, Chongyang Ma, Xiaoguang Han

Inspired by the recent success of advanced implicit representation in reconstruction tasks, we explore the idea of using an implicit field to represent keypoints.

Keypoint Estimation Representation Learning

AUGUST: an Automatic Generation Understudy for Synthesizing Conversational Recommendation Datasets

no code implementations16 Jun 2023 Yu Lu, Junwei Bao, Zichen Ma, Xiaoguang Han, Youzheng Wu, Shuguang Cui, Xiaodong He

High-quality data is essential for conversational recommendation systems and serves as the cornerstone of the network architecture development and training strategy design.

Conversational Recommendation Knowledge Graphs +1

From NeRFLiX to NeRFLiX++: A General NeRF-Agnostic Restorer Paradigm

1 code implementation10 Jun 2023 Kun Zhou, Wenbo Li, Nianjuan Jiang, Xiaoguang Han, Jiangbo Lu

To address this, we propose NeRFLiX, a general NeRF-agnostic restorer paradigm that learns a degradation-driven inter-viewpoint mixer.

Computational Efficiency Novel View Synthesis

REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos

1 code implementation CVPR 2023 Lingteng Qiu, GuanYing Chen, Jiapeng Zhou, Mutian Xu, Junle Wang, Xiaoguang Han

To address the above limitations, in this paper, we formulate this task as an optimization problem of 3D garment feature curves and surface reconstruction from monocular video.

Garment Reconstruction Neural Rendering +1

FashionTex: Controllable Virtual Try-on with Text and Texture

1 code implementation8 May 2023 Anran Lin, Nanxuan Zhao, Shuliang Ning, Yuda Qiu, Baoyuan Wang, Xiaoguang Han

Virtual try-on attracts increasing research attention as a promising way for enhancing the user experience for online cloth shopping.

Virtual Try-on

NerVE: Neural Volumetric Edges for Parametric Curve Extraction from Point Cloud

no code implementations CVPR 2023 Xiangyu Zhu, Dong Du, Weikai Chen, Zhiyou Zhao, Yinyu Nie, Xiaoguang Han

We show that a simple network based on NerVE can already outperform the previous state-of-the-art methods by a great margin.

Keypoint Detection

RaBit: Parametric Modeling of 3D Biped Cartoon Characters with a Topological-consistent Dataset

no code implementations CVPR 2023 Zhongjin Luo, Shengcai Cai, Jinguo Dong, Ruibo Ming, Liangdong Qiu, Xiaohang Zhan, Xiaoguang Han

However, none of the prior works focus on modeling 3D biped cartoon characters, which are also in great demand in gaming and filming.

HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling

1 code implementation CVPR 2023 Yujian Zheng, Zirong Jin, Moran Li, Haibin Huang, Chongyang Ma, Shuguang Cui, Xiaoguang Han

We firmly think an intermediate representation is essential, but we argue that orientation map using the dominant filtering-based methods is sensitive to uncertain noise and far from a competent representation.

Get3DHuman: Lifting StyleGAN-Human into a 3D Generative Model using Pixel-aligned Reconstruction Priors

no code implementations ICCV 2023 Zhangyang Xiong, Di Kang, Derong Jin, Weikai Chen, Linchao Bao, Shuguang Cui, Xiaoguang Han

Specifically, we bridge the latent space of Get3DHuman with that of StyleGAN-Human via a specially-designed prior network, where the input latent code is mapped to the shape and texture feature volumes spanned by the pixel-aligned 3D reconstructor.

Diversity

RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes

no code implementations19 Jan 2023 Bingchen Gong, Yuehao Wang, Xiaoguang Han, Qi Dou

We present RecolorNeRF, a novel user-friendly color editing approach for the neural radiance fields.

Color Manipulation

MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency

no code implementations CVPR 2023 Mingye Xu, Mutian Xu, Tong He, Wanli Ouyang, Yali Wang, Xiaoguang Han, Yu Qiao

Besides, such scenes with progressive masking ratios can also serve to self-distill their intrinsic spatial consistency, requiring to learn the consistent representations from unmasked areas.

object-detection Object Detection +2

Which Pixel to Annotate: a Label-Efficient Nuclei Segmentation Framework

1 code implementation20 Dec 2022 Wei Lou, Haofeng Li, Guanbin Li, Xiaoguang Han, Xiang Wan

Recently deep neural networks, which require a large amount of annotated samples, have been widely applied in nuclei instance segmentation of H\&E stained pathology images.

Instance Segmentation Segmentation +1

MIMO Is All You Need : A Strong Multi-In-Multi-Out Baseline for Video Prediction

1 code implementation9 Dec 2022 Shuliang Ning, Mengcheng Lan, Yanran Li, Chaofeng Chen, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui

The mainstream of the existing approaches for video prediction builds up their models based on a Single-In-Single-Out (SISO) architecture, which takes the current frame as input to predict the next frame in a recursive manner.

Video Prediction

Mutual Guidance and Residual Integration for Image Enhancement

no code implementations25 Nov 2022 Kun Zhou, Kenkun Liu, Wenbo Li, Xiaoguang Han, Jiangbo Lu

To address those issues, we propose a novel mutual guidance network (MGN) to perform effective bidirectional global-local information exchange while keeping a compact architecture.

Computational Efficiency Image Enhancement +1

Learning 3D Scene Priors with 2D Supervision

no code implementations CVPR 2023 Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner

Holistic 3D scene understanding entails estimation of both layout configuration and object geometry in a 3D environment.

Decoder Scene Understanding

Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image

no code implementations12 Oct 2022 Zhaoxuan Zhang, Xiaoguang Han, Bo Dong, Tong Li, BaoCai Yin, Xin Yang

Given a single RGB-D image, our method first predicts its semantic segmentation map and goes through the 3D volume branch to obtain a volumetric scene reconstruction as a guide to the next view inpainting step, which attempts to make up the missing information; the third step involves projecting the volume under the same view of the input, concatenating them to complete the current view RGB-D and segmentation map, and integrating all RGB-D and segmentation maps into the point cloud.

Image Inpainting Segmentation +1

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

no code implementations27 Sep 2022 Chaoqi Chen, Yushuang Wu, Qiyuan Dai, Hong-Yu Zhou, Mutian Xu, Sibei Yang, Xiaoguang Han, Yizhou Yu

Graph Neural Networks (GNNs) have gained momentum in graph representation learning and boosted the state of the art in a variety of areas, such as data mining (\emph{e. g.,} social network analysis and recommender systems), computer vision (\emph{e. g.,} object detection and point cloud learning), and natural language processing (\emph{e. g.,} relation extraction and sequence learning), to name a few.

Graph Representation Learning object-detection +3

Towards High-Fidelity Single-view Holistic Reconstruction of Indoor Scenes

1 code implementation18 Jul 2022 Haolin Liu, Yujian Zheng, GuanYing Chen, Shuguang Cui, Xiaoguang Han

We present a new framework to reconstruct holistic 3D indoor scenes including both room background and indoor objects from single-view images.

Object Reconstruction Vocal Bursts Intensity Prediction

Relation Matters: Foreground-aware Graph-based Relational Reasoning for Domain Adaptive Object Detection

no code implementations6 Jun 2022 Chaoqi Chen, Jiongcheng Li, Hong-Yu Zhou, Xiaoguang Han, Yue Huang, Xinghao Ding, Yizhou Yu

However, both the global and local alignment approaches fail to capture the topological relations among different foreground objects as the explicit dependencies and interactions between and within domains are neglected.

Domain Adaptation Graph Attention +5

Multi-level Consistency Learning for Semi-supervised Domain Adaptation

1 code implementation9 May 2022 Zizheng Yan, Yushuang Wu, Guanbin Li, Yipeng Qin, Xiaoguang Han, Shuguang Cui

Semi-supervised domain adaptation (SSDA) aims to apply knowledge learned from a fully labeled source domain to a scarcely labeled target domain.

Domain Adaptation Semi-supervised Domain Adaptation

DArch: Dental Arch Prior-assisted 3D Tooth Instance Segmentation

no code implementations25 Apr 2022 Liangdong Qiu, Chongjie Ye, Pei Chen, Yunbi Liu, Xiaoguang Han, Shuguang Cui

Experimental results on $4, 773$ dental models have shown our DArch can accurately segment each tooth of a dental model, and its performance is superior to the state-of-the-art methods.

Instance Segmentation Segmentation +1

Registering Explicit to Implicit: Towards High-Fidelity Garment mesh Reconstruction from Single Images

no code implementations CVPR 2022 Heming Zhu, Lingteng Qiu, Yuda Qiu, Xiaoguang Han

Fueled by the power of deep learning techniques and implicit shape learning, recent advances in single-image human digitalization have reached unprecedented accuracy and could recover fine-grained surface details such as garment wrinkles.

Garment Reconstruction

SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation

no code implementations CVPR 2022 Chenming Zhu, Xuanye Zhang, Yanran Li, Liangdong Qiu, Kai Han, Xiaoguang Han

Contour-based models are efficient and generic to be incorporated with any existing segmentation methods, but they often generate over-smoothed contour and tend to fail on corner areas.

Instance Segmentation Segmentation +1

TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes

1 code implementation17 Mar 2022 Mutian Xu, Pei Chen, Haolin Liu, Xiaoguang Han

Experiments show that the algorithms trained on TO-Scene indeed work on the realistic test data, and our proposed tabletop-aware learning strategy greatly improves the state-of-the-art results on both 3D semantic segmentation and object detection tasks.

3D Semantic Segmentation object-detection +2

Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors

2 code implementations26 Feb 2022 Chaofeng Chen, Xinyu Shi, Yipeng Qin, Xiaoming Li, Xiaoguang Han, Tao Yang, Shihui Guo

Unlike image-space methods, our FeMaSR restores HR images by matching distorted LR image {\it features} to their distortion-free HR counterparts in our pretrained HR priors, and decoding the matched features to obtain realistic HR images.

Blind Super-Resolution Decoder +3

PointMatch: A Consistency Training Framework for Weakly Supervised Semantic Segmentation of 3D Point Clouds

no code implementations22 Feb 2022 Yushuang Wu, Zizheng Yan, Shengcai Cai, Guanbin Li, Yizhou Yu, Xiaoguang Han, Shuguang Cui

Semantic segmentation of point cloud usually relies on dense annotation that is exhausting and costly, so it attracts wide attention to investigate solutions for the weakly supervised scheme with only sparse points annotated.

Representation Learning Weakly supervised Semantic Segmentation +1

DArch: Dental Arch Prior-Assisted 3D Tooth Instance Segmentation With Weak Annotations

no code implementations CVPR 2022 Liangdong Qiu, Chongjie Ye, Pei Chen, Yunbi Liu, Xiaoguang Han, Shuguang Cui

Experimental results on 4, 773 dental models have shown our DArch can accurately segment each tooth of a dental model, and its performance is superior to the state-of-the-art methods.

Instance Segmentation Segmentation +1

ETHSeg: An Amodel Instance Segmentation Network and a Real-World Dataset for X-Ray Waste Inspection

no code implementations CVPR 2022 Lingteng Qiu, Zhangyang Xiong, Xuhao Wang, Kenkun Liu, Yihan Li, GuanYing Chen, Xiaoguang Han, Shuguang Cui

Inspired by the fact that X-ray has a strong penetrating power to see through the bag and overlapping objects, we propose to perform waste inspection efficiently using X-ray images without the need to open the bag.

Instance Segmentation Segmentation +1

Pose2Room: Understanding 3D Scenes from Human Activities

no code implementations1 Dec 2021 Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner

To this end, we propose P2R-Net to learn a probabilistic 3D model of the objects in a scene characterized by their class categories and oriented 3D bounding boxes, based on an input observed human trajectory in the environment.

Object

SketchHairSalon: Deep Sketch-based Hair Image Synthesis

no code implementations16 Sep 2021 Chufeng Xiao, Deng Yu, Xiaoguang Han, Youyi Zheng, Hongbo Fu

At the second stage, another network is trained to synthesize the structure and appearance of hair images from the input sketch and the generated matte.

Image Generation

Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts

2 code implementations ICCV 2021 Hong-Yu Zhou, Chixiang Lu, Sibei Yang, Xiaoguang Han, Yizhou Yu

From this perspective, we introduce Preservational Learning to reconstruct diverse image contexts in order to preserve more information in learned representations.

Contrastive Learning Representation Learning +1

ME-PCN: Point Completion Conditioned on Mask Emptiness

1 code implementation ICCV 2021 Bingchen Gong, Yinyu Nie, Yiqun Lin, Xiaoguang Han, Yizhou Yu

Main-stream methods predict the missing shapes by decoding a global feature learned from the input point cloud, which often leads to deficient results in preserving topology consistency and surface details.

SimpModeling: Sketching Implicit Field to Guide Mesh Modeling for 3D Animalmorphic Head Design

1 code implementation5 Aug 2021 Zhongjin Luo, Jie zhou, Heming Zhu, Dong Du, Xiaoguang Han, Hongbo Fu

In this work, we propose SimpModeling, a novel sketch-based system for helping users, especially amateur users, easily model 3D animalmorphic heads - a prevalent kind of heads in character design.

From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting

no code implementations21 Jul 2021 Mengcheng Lan, Shuliang Ning, Yanran Li, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui

Despite video forecasting has been a widely explored topic in recent years, the mainstream of the existing work still limits their models with a single prediction space but completely neglects the way to leverage their model with multi-prediction spaces.

Video Prediction

Transformer with Peak Suppression and Knowledge Guidance for Fine-grained Image Recognition

no code implementations14 Jul 2021 Xinda Liu, Lili Wang, Xiaoguang Han

In this paper, we analyze the difficulties of fine-grained image recognition from a new perspective and propose a transformer architecture with the peak suppression module and knowledge guidance module, which respects the diversification of discriminative features in a single image and the aggregation of discriminative clues among multiple images.

Fine-Grained Image Classification Fine-Grained Image Recognition

Task-Aware Sampling Layer for Point-Wise Analysis

no code implementations9 Jul 2021 Yiqun Lin, Lichang Chen, Haibin Huang, Chongyang Ma, Xiaoguang Han, Shuguang Cui

Sampling, grouping, and aggregation are three important components in the multi-scale analysis of point clouds.

Keypoint Detection Point Cloud Completion +1

RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction

1 code implementation CVPR 2021 Yinyu Nie, Ji Hou, Xiaoguang Han, Matthias Nießner

In this work, we introduce RfD-Net that jointly detects and reconstructs dense object surfaces directly from raw point clouds.

3D geometry Object +5

A deep learning based interactive sketching system for fashion images design

no code implementations9 Oct 2020 Yao Li, Xianggang Yu, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu

In this work, we propose an interactive system to design diverse high-quality garment images from fashion sketches and the texture information.

Intrinsic Image Decomposition Texture Synthesis

Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

no code implementations18 Sep 2020 Jie Wu, Guanbin Li, Xiaoguang Han, Liang Lin

Temporal grounding of natural language in untrimmed videos is a fundamental yet challenging multimedia task facilitating cross-media visual content retrieval.

cross-modal alignment reinforcement-learning +3

Ultrasound Liver Fibrosis Diagnosis using Multi-indicator guided Deep Neural Networks

no code implementations10 Sep 2020 Jiali Liu, Wenxuan Wang, Tianyao Guan, Ningbo Zhao, Xiaoguang Han, Zhen Li

An indicator-guided learning mechanism is further proposed to ease the training of the proposed model.

SkeletonNet: A Topology-Preserving Solution for Learning Mesh Reconstruction of Object Surfaces from RGB Images

1 code implementation13 Aug 2020 Jiapeng Tang, Xiaoguang Han, Mingkui Tan, Xin Tong, Kui Jia

However, they all have their own drawbacks, and cannot properly reconstruct the surface shapes of complex topologies, arguably due to a lack of constraints on the topologicalstructures in their learning frameworks.

Surface Reconstruction

Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images

2 code implementations ECCV 2020 Heming Zhu, Yu Cao, Hang Jin, Weikai Chen, Dong Du, Zhangye Wang, Shuguang Cui, Xiaoguang Han

High-fidelity clothing reconstruction is the key to achieving photorealism in a wide range of applications including human digitization, virtual try-on, etc.

Garment Reconstruction Virtual Try-on

Learning Inverse Rendering of Faces from Real-world Videos

1 code implementation26 Mar 2020 Yuda Qiu, Zhangyang Xiong, Kai Han, Zhongyuan Wang, Zixiang Xiong, Xiaoguang Han

To alleviate this problem, we propose a weakly supervised training approach to train our model on real face videos, based on the assumption of consistency of albedo and normal across different frames, thus bridging the gap between real and synthetic face images.

Inverse Rendering

Peeking into occluded joints: A novel framework for crowd pose estimation

1 code implementation ECCV 2020 Lingteng Qiu, Xuanye Zhang, Yan-ran Li, Guanbin Li, Xiao-Jun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui

Although occlusion widely exists in nature and remains a fundamental challenge for pose estimation, existing heatmap-based approaches suffer serious degradation on occlusions.

Pose Estimation

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

no code implementations22 Feb 2020 Yinyu Nie, Shihui Guo, Jian Chang, Xiaoguang Han, Jiahui Huang, Shi-Min Hu, Jian Jun Zhang

Particularly, we design a shallow-to-deep architecture on the basis of convolutional networks for semantic scene understanding and modeling.

3D geometry Relation Network +1

Self-Enhanced Convolutional Network for Facial Video Hallucination

no code implementations23 Nov 2019 Chaowei Fang, Guanbin Li, Xiaoguang Han, Yizhou Yu

It further recurrently exploits the reconstructed results and intermediate features of a sequence of preceding frames to improve the initial super-resolution of the current frame by modelling the coherence of structural facial features across frames.

Hallucination Video Super-Resolution

Deep Reinforcement Learning of Volume-guided Progressive View Inpainting for 3D Point Scene Completion from a Single Depth Image

no code implementations CVPR 2019 Xiaoguang Han, Zhaoxuan Zhang, Dong Du, Mingdai Yang, Jingming Yu, Pan Pan, Xin Yang, Ligang Liu, Zixiang Xiong, Shuguang Cui

Given a single depth image, our method first goes through the 3D volume branch to obtain a volumetric scene reconstruction as a guide to the next view inpainting step, which attempts to make up the missing information; the third step involves projecting the volume under the same view of the input, concatenating them to complete the current view depth, and integrating all depth into the point cloud.

Two-phase Hair Image Synthesis by Self-Enhancing Generative Model

no code implementations28 Feb 2019 Haonan Qiu, Chuan Wang, Hang Zhu, Xiangyu Zhu, Jinjin Gu, Xiaoguang Han

Generating plausible hair image given limited guidance, such as sparse sketches or low-resolution image, has been made possible with the rise of Generative Adversarial Networks (GANs).

Image-to-Image Translation Super-Resolution +2

Learning Mutually Local-global U-nets For High-resolution Retinal Lesion Segmentation in Fundus Images

no code implementations18 Jan 2019 Zizheng Yan, Xiaoguang Han, Changmiao Wang, Yuda Qiu, Zixiang Xiong, Shuguang Cui

Due to high-resolution and small-size lesion regions, applying existing methods, such as U-Nets, to perform segmentation on fundus photography is very challenging.

Decoder Lesion Segmentation +1

Deep RBFNet: Point Cloud Feature Learning using Radial Basis Functions

no code implementations11 Dec 2018 Weikai Chen, Xiaoguang Han, Guanbin Li, Chao Chen, Jun Xing, Yajie Zhao, Hao Li

Three-dimensional object recognition has recently achieved great progress thanks to the development of effective point cloud-based learning frameworks, such as PointNet and its extensions.

3D Object Recognition

Adversarial 3D Human Pose Estimation via Multimodal Depth Supervision

no code implementations21 Sep 2018 Kun Zhou, Jinmiao Cai, Yao Li, Yulong Shi, Xiaoguang Han, Nianjuan Jiang, Kui Jia, Jiangbo Lu

In this paper, a novel deep-learning based framework is proposed to infer 3D human poses from a single image.

3D Human Pose Estimation

CaricatureShop: Personalized and Photorealistic Caricature Sketching

no code implementations24 Jul 2018 Xiaoguang Han, Kangcheng Hou, Dong Du, Yuda Qiu, Yizhou Yu, Kun Zhou, Shuguang Cui

To construct the mapping between 2D sketches and a vertex-wise scaling field, a novel deep learning architecture is developed.

Caricature Face Model

FBI-Pose: Towards Bridging the Gap between 2D Images and 3D Human Poses using Forward-or-Backward Information

no code implementations25 Jun 2018 Yulong Shi, Xiaoguang Han, Nianjuan Jiang, Kun Zhou, Kui Jia, Jiangbo Lu

Although significant advances have been made in the area of human poses estimation from images using deep Convolutional Neural Network (ConvNet), it remains a big challenge to perform 3D pose inference in-the-wild.

3D Human Pose Estimation

Video Inpainting by Jointly Learning Temporal Structure and Spatial Details

no code implementations22 Jun 2018 Chuan Wang, Haibin Huang, Xiaoguang Han, Jue Wang

We present a new data-driven video inpainting method for recovering missing regions of video frames.

Video Inpainting

High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference

no code implementations ICCV 2017 Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu

Our method is based on a new deep learning architecture consisting of two sub-networks: a global structure inference network and a local geometry refinement network.

Decoder

DeepSketch2Face: A Deep Learning Based Sketching System for 3D Face and Caricature Modeling

no code implementations7 Jun 2017 Xiaoguang Han, Chang Gao, Yizhou Yu

This system has a labor-efficient sketching interface, that allows the user to draw freehand imprecise yet expressive 2D lines representing the contours of facial features.

Caricature

Cannot find the paper you are looking for? You can Submit a new open access paper.