Search Results for author: Hongwen Zhang

Found 44 papers, 21 papers with code

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

1 code implementation15 Mar 2024 Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules.

Motion Synthesis

Ins-HOI: Instance Aware Human-Object Interactions Recovery

1 code implementation15 Dec 2023 Jiajun Zhang, Yuxiang Zhang, Hongwen Zhang, Xiao Zhou, Boyao Zhou, Ruizhi Shao, Zonghai Hu, Yebin Liu

To address this, we further propose a complementary training strategy that leverages synthetic data to introduce instance-level shape priors, enabling the disentanglement of occupancy fields for different instances.

Descriptive Disentanglement +3

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

1 code implementation5 Dec 2023 Yuelang Xu, Benwang Chen, Zhe Li, Hongwen Zhang, Lizhen Wang, Zerong Zheng, Yebin Liu

Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains a great challenge under lightweight sparse view setups.

2k

FG-MDM: Towards Zero-Shot Human Motion Generation via Fine-Grained Descriptions

no code implementations5 Dec 2023 Xu Shi, Wei Yao, Chuanchen Luo, Junran Peng, Hongwen Zhang, Yunlian Sun

By adopting a divide-and-conquer strategy, we propose a new framework named Fine-Grained Human Motion Diffusion Model (FG-MDM) for zero-shot human motion generation.

Language Modelling Large Language Model

GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

1 code implementation4 Dec 2023 Liangxiao Hu, Hongwen Zhang, Yuxiang Zhang, Boyao Zhou, Boning Liu, Shengping Zhang, Liqiang Nie

We present GaussianAvatar, an efficient approach to creating realistic human avatars with dynamic 3D appearances from a single video.

Motion Estimation

W-HMR: Human Mesh Recovery in World Space with Weak-supervised Camera Calibration and Orientation Correction

1 code implementation29 Nov 2023 Wei Yao, Hongwen Zhang, Yunlian Sun, Jinhui Tang

We propose a novel orientation correction module to allow the reconstructed human body to remain normal in world space.

 Ranked #1 on 3D Human Pose Estimation on SPEC-MTP (using extra training data)

3D Human Pose Estimation Camera Calibration +1

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation

no code implementations2 Oct 2023 Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang

The main idea is to enhance the model's 2D perception of 3D geometry by learning a normal-adapted diffusion model and a normal-aligned diffusion model.

Text to 3D Texture Synthesis

HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field

no code implementations29 Sep 2023 Xiaochen Zhao, Lizhen Wang, Jingxiang Sun, Hongwen Zhang, Jinli Suo, Yebin Liu

The problem of modeling an animatable 3D human head avatar under light-weight setups is of significant importance but has not been well solved.

Image-to-Image Translation

Leveraging Intrinsic Properties for Non-Rigid Garment Alignment

no code implementations ICCV 2023 Siyou Lin, Boyao Zhou, Zerong Zheng, Hongwen Zhang, Yebin Liu

To achieve wrinkle-level as well as texture-level alignment, we present a novel coarse-to-fine two-stage method that leverages intrinsic manifold properties with two neural deformation fields, in the 3D space and the intrinsic space, respectively.

CaPhy: Capturing Physical Properties for Animatable Human Avatars

no code implementations ICCV 2023 Zhaoqi Su, Liangxiao Hu, Siyou Lin, Hongwen Zhang, Shengping Zhang, Justus Thies, Yebin Liu

In contrast to previous work on 3D avatar reconstruction, our method is able to generalize to novel poses with realistic dynamic cloth deformations.

ProxyCap: Real-time Monocular Full-body Capture in World Space via Human-Centric Proxy-to-Motion Learning

no code implementations3 Jul 2023 Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Jiajun Zhang, Hongwei Yi, Shengping Zhang, Yebin Liu

For more accurate and physically plausible predictions in world space, our network is designed to learn human motions from a human-centric perspective, which enables the understanding of the same motion captured with different camera trajectories.

3D Human Pose Estimation

Control4D: Efficient 4D Portrait Editing with Text

no code implementations31 May 2023 Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu

We introduce Control4D, an innovative framework for editing dynamic 4D portraits using text instructions.

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

no code implementations31 May 2023 Junxing Hu, Hongwen Zhang, Zerui Chen, Mengcheng Li, Yunlong Wang, Yebin Liu, Zhenan Sun

In the second part, we introduce a novel method to diffuse estimated contact states from the hand mesh surface to nearby 3D space and leverage diffused contact probabilities to construct the implicit neural representation for the manipulated object.

Object

AvatarReX: Real-time Expressive Full-body Avatars

no code implementations8 May 2023 Zerong Zheng, Xiaochen Zhao, Hongwen Zhang, Boning Liu, Yebin Liu

We present AvatarReX, a new method for learning NeRF-based full-body avatars from video data.

Disentanglement

LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar

no code implementations2 May 2023 Yuelang Xu, Hongwen Zhang, Lizhen Wang, Xiaochen Zhao, Han Huang, GuoJun Qi, Yebin Liu

Existing approaches to animatable NeRF-based head avatars are either built upon face templates or use the expression coefficients of templates as the driving signal.

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

1 code implementation1 May 2023 Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen Zhang, Tao Yu, Yebin Liu

Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods.

Face Reenactment Translation +1

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

no code implementations CVPR 2023 Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu

In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses.

Narrator: Towards Natural Control of Human-Scene Interaction Generation via Relationship Reasoning

no code implementations ICCV 2023 Haibiao Xuan, Xiongzheng Li, Jinsong Zhang, Hongwen Zhang, Yebin Liu, Kun Li

Also, we model global and local spatial relationships in a 3D scene and a textual description respectively based on the scene graph, and introduce a partlevel action mechanism to represent interactions as atomic body part states.

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery

no code implementations15 Jan 2023 Kai Jia, Hongwen Zhang, Liang An, Yebin Liu

The key components of a typical regressor lie in the feature extraction of input views and the fusion of multi-view features.

Human Mesh Recovery regression

Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering

no code implementations CVPR 2023 Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu

The key of our solution is an efficient 4D tensor decomposition method so that the dynamic scene can be directly represented as a 4D spatio-temporal tensor.

Dynamic Reconstruction Tensor Decomposition

AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels

no code implementations23 Nov 2022 Yuelang Xu, Lizhen Wang, Xiaochen Zhao, Hongwen Zhang, Yebin Liu

AvatarMAV is the first to model both the canonical appearance and the decoupled expression motion by neural voxels for head avatar.

Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars

2 code implementations CVPR 2023 Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu

We propose a novel 3D GAN framework for unsupervised learning of generative, high-quality and 3D-consistent facial avatars from unstructured 2D images.

Face Model

Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering

1 code implementation21 Nov 2022 Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu

The key of our solution is an efficient 4D tensor decomposition method so that the dynamic scene can be directly represented as a 4D spatio-temporal tensor.

Dynamic Reconstruction Tensor Decomposition

DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras

no code implementations16 Jul 2022 Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu

At its core is a novel diffusion-based stereo module, which introduces diffusion models, a type of powerful generative models, into the iterative stereo matching network.

3D Human Reconstruction 4k +2

Learning Implicit Templates for Point-Based Clothed Human Modeling

1 code implementation14 Jul 2022 Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu

We present FITE, a First-Implicit-Then-Explicit framework for modeling human avatars in clothing.

PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

1 code implementation13 Jul 2022 Hongwen Zhang, Yating Tian, Yuxiang Zhang, Mengcheng Li, Liang An, Zhenan Sun, Yebin Liu

To address these issues, we propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop in our regression network for well-aligned human mesh recovery and extend it as PyMAF-X for the recovery of expressive full-body models.

Ranked #6 on 3D Human Pose Estimation on AGORA (using extra training data)

3D human pose and shape estimation Human Mesh Recovery +2

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture

1 code implementation5 Jul 2022 Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu

Then given a monocular RGB video of this subject, our method integrates information from both the image observation and the avatar prior, and accordingly recon-structs high-fidelity 3D textured models with dynamic details regardless of the visibility.

Structured Local Radiance Fields for Human Avatar Modeling

no code implementations CVPR 2022 Zerong Zheng, Han Huang, Tao Yu, Hongwen Zhang, Yandong Guo, Yebin Liu

These local radiance fields not only leverage the flexibility of implicit representation in shape and appearance modeling, but also factorize cloth deformations into skeleton motions, node residual translations and the dynamic detail variations inside each individual radiance field.

Recovering 3D Human Mesh from Monocular Images: A Survey

1 code implementation3 Mar 2022 Yating Tian, Hongwen Zhang, Yebin Liu, LiMin Wang

Since the release of statistical body models, 3D human mesh recovery has been drawing broader attention.

3D human pose and shape estimation Human Mesh Recovery

DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering

no code implementations CVPR 2022 Ruizhi Shao, Hongwen Zhang, He Zhang, Mingjia Chen, YanPei Cao, Tao Yu, Yebin Liu

We introduce DoubleField, a novel framework combining the merits of both surface field and radiance field for high-fidelity human reconstruction and rendering.

Transfer Learning

Rethinking Pseudo-LiDAR Representation

1 code implementation ECCV 2020 Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang

Based on this observation, we design an image based CNN detector named Patch-Net, which is more generalized and can be instantiated as pseudo-LiDAR based 3D detectors.

Learning 3D Human Shape and Pose from Dense Body Parts

1 code implementation31 Dec 2019 Hongwen Zhang, Jie Cao, Guo Lu, Wanli Ouyang, Zhenan Sun

Reconstructing 3D human shape and pose from monocular images is challenging despite the promising results achieved by the most recent learning-based methods.

Ranked #75 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation 3D Human Reconstruction +3

Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization

no code implementations NeurIPS 2018 Jie Cao, Yibo Hu, Hongwen Zhang, Ran He, Zhenan Sun

We decompose the prerequisite of warping into dense correspondence field estimation and facial texture map recovering, which are both well addressed by deep networks.

Dictionary Learning Face Recognition +2

Android Malware Detection using Large-scale Network Representation Learning

no code implementations13 Jun 2018 Rui Zhu, Chenglin Li, Di Niu, Hongwen Zhang, Husam Kinawi

With the growth of mobile devices and applications, the number of malicious software, or malware, is rapidly increasing in recent years, which calls for the development of advanced and effective malware detection approaches.

Cryptography and Security

Android Malware Detection based on Factorization Machine

no code implementations30 May 2018 Chenglin Li, Keith Mills, Rui Zhu, Di Niu, Hongwen Zhang, Husam Kinawi

As the popularity of Android smart phones has increased in recent years, so too has the number of malicious applications.

Cryptography and Security

Joint Voxel and Coordinate Regression for Accurate 3D Facial Landmark Localization

no code implementations28 Jan 2018 Hongwen Zhang, Qi Li, Zhenan Sun

Then, a stacked hourglass network is adopted to estimate the volumetric representation from coarse to fine, followed by a 3D convolution network that takes the estimated volume as input and regresses 3D coordinates of the face shape.

3D Facial Landmark Localization Depth Estimation +2

Combining Data-driven and Model-driven Methods for Robust Facial Landmark Detection

1 code implementation30 Nov 2016 Hongwen Zhang, Qi Li, Zhenan Sun, Yunfan Liu

This Estimation-Correction-Tuning process perfectly combines the advantages of the global robustness of data-driven method (FCN), outlier correction capability of model-driven method (PDM) and non-parametric optimization of RLMS.

Facial Landmark Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.