Search Results for author: Hongwen Zhang

Found 44 papers, 21 papers with code

Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives

1 code implementation • 15 Mar 2024 • Ronghui Li, Yuxiang Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li

In contrast, the second-stage is the local diffusion, which parallelly generates detailed motion sequences under the guidance of the dance primitives and choreographic rules.

Ranked #1 on Motion Synthesis on FineDance

Motion Synthesis

Paper
Code

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion

1 code implementation • 3 Jan 2024 • Wei Yao, Hongwen Zhang, Yunlian Sun, Jinhui Tang

This method can remarkably improve the smoothness of recovery results from video.

Ranked #45 on 3D Human Pose Estimation on 3DPW (using extra training data)

3D Human Pose Estimation Human Mesh Recovery

Paper
Code

Ins-HOI: Instance Aware Human-Object Interactions Recovery

1 code implementation • 15 Dec 2023 • Jiajun Zhang, Yuxiang Zhang, Hongwen Zhang, Xiao Zhou, Boyao Zhou, Ruizhi Shao, Zonghai Hu, Yebin Liu

To address this, we further propose a complementary training strategy that leverages synthetic data to introduce instance-level shape priors, enabling the disentanglement of occupancy fields for different instances.

Descriptive Disentanglement +3

Paper
Code

Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

1 code implementation • 5 Dec 2023 • Yuelang Xu, Benwang Chen, Zhe Li, Hongwen Zhang, Lizhen Wang, Zerong Zheng, Yebin Liu

Creating high-fidelity 3D head avatars has always been a research hotspot, but there remains a great challenge under lightweight sparse view setups.

618

Paper
Code

FG-MDM: Towards Zero-Shot Human Motion Generation via Fine-Grained Descriptions

no code implementations • 5 Dec 2023 • Xu Shi, Wei Yao, Chuanchen Luo, Junran Peng, Hongwen Zhang, Yunlian Sun

By adopting a divide-and-conquer strategy, we propose a new framework named Fine-Grained Human Motion Diffusion Model (FG-MDM) for zero-shot human motion generation.

Language Modelling Large Language Model

Paper
Add Code

GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

1 code implementation • 4 Dec 2023 • Liangxiao Hu, Hongwen Zhang, Yuxiang Zhang, Boyao Zhou, Boning Liu, Shengping Zhang, Liqiang Nie

We present GaussianAvatar, an efficient approach to creating realistic human avatars with dynamic 3D appearances from a single video.

Motion Estimation

274

Paper
Code

W-HMR: Human Mesh Recovery in World Space with Weak-supervised Camera Calibration and Orientation Correction

1 code implementation • 29 Nov 2023 • Wei Yao, Hongwen Zhang, Yunlian Sun, Jinhui Tang

We propose a novel orientation correction module to allow the reconstructed human body to remain normal in world space.

Ranked #1 on 3D Human Pose Estimation on SPEC-MTP (using extra training data)

3D Human Pose Estimation Camera Calibration +1

Paper
Code

HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation

no code implementations • 2 Oct 2023 • Xin Huang, Ruizhi Shao, Qi Zhang, Hongwen Zhang, Ying Feng, Yebin Liu, Qing Wang

The main idea is to enhance the model's 2D perception of 3D geometry by learning a normal-adapted diffusion model and a normal-aligned diffusion model.

Text to 3D Texture Synthesis

Paper
Add Code

HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field

no code implementations • 29 Sep 2023 • Xiaochen Zhao, Lizhen Wang, Jingxiang Sun, Hongwen Zhang, Jinli Suo, Yebin Liu

The problem of modeling an animatable 3D human head avatar under light-weight setups is of significant importance but has not been well solved.

Image-to-Image Translation

Paper
Add Code

Leveraging Intrinsic Properties for Non-Rigid Garment Alignment

no code implementations • ICCV 2023 • Siyou Lin, Boyao Zhou, Zerong Zheng, Hongwen Zhang, Yebin Liu

To achieve wrinkle-level as well as texture-level alignment, we present a novel coarse-to-fine two-stage method that leverages intrinsic manifold properties with two neural deformation fields, in the 3D space and the intrinsic space, respectively.

Paper
Add Code

CaPhy: Capturing Physical Properties for Animatable Human Avatars

no code implementations • ICCV 2023 • Zhaoqi Su, Liangxiao Hu, Siyou Lin, Hongwen Zhang, Shengping Zhang, Justus Thies, Yebin Liu

In contrast to previous work on 3D avatar reconstruction, our method is able to generalize to novel poses with realistic dynamic cloth deformations.

Paper
Add Code

ProxyCap: Real-time Monocular Full-body Capture in World Space via Human-Centric Proxy-to-Motion Learning

no code implementations • 3 Jul 2023 • Yuxiang Zhang, Hongwen Zhang, Liangxiao Hu, Jiajun Zhang, Hongwei Yi, Shengping Zhang, Yebin Liu

For more accurate and physically plausible predictions in world space, our network is designed to learn human motions from a human-centric perspective, which enables the understanding of the same motion captured with different camera trajectories.

Ranked #208 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

Control4D: Efficient 4D Portrait Editing with Text

no code implementations • 31 May 2023 • Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu

We introduce Control4D, an innovative framework for editing dynamic 4D portraits using text instructions.

Paper
Add Code

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

no code implementations • 31 May 2023 • Junxing Hu, Hongwen Zhang, Zerui Chen, Mengcheng Li, Yunlong Wang, Yebin Liu, Zhenan Sun

In the second part, we introduce a novel method to diffuse estimated contact states from the hand mesh surface to nearby 3D space and leverage diffused contact probabilities to construct the implicit neural representation for the manipulated object.

Object

Paper
Add Code

AvatarReX: Real-time Expressive Full-body Avatars

no code implementations • 8 May 2023 • Zerong Zheng, Xiaochen Zhao, Hongwen Zhang, Boning Liu, Yebin Liu

We present AvatarReX, a new method for learning NeRF-based full-body avatars from video data.

Disentanglement

Paper
Add Code

LatentAvatar: Learning Latent Expression Code for Expressive Neural Head Avatar

no code implementations • 2 May 2023 • Yuelang Xu, Hongwen Zhang, Lizhen Wang, Xiaochen Zhao, Han Huang, GuoJun Qi, Yebin Liu

Existing approaches to animatable NeRF-based head avatars are either built upon face templates or use the expression coefficients of templates as the driving signal.

Paper
Add Code

StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

1 code implementation • 1 May 2023 • Lizhen Wang, Xiaochen Zhao, Jingxiang Sun, Yuxiang Zhang, Hongwen Zhang, Tao Yu, Yebin Liu

Results and experiments demonstrate the superiority of our method in terms of image quality, full portrait video generation, and real-time re-animation compared to existing facial reenactment methods.

Face Reenactment Translation +1

352

Paper
Code

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

no code implementations • CVPR 2023 • Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu

In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses.

Paper
Add Code

Narrator: Towards Natural Control of Human-Scene Interaction Generation via Relationship Reasoning

no code implementations • ICCV 2023 • Haibiao Xuan, Xiongzheng Li, Jinsong Zhang, Hongwen Zhang, Yebin Liu, Kun Li

Also, we model global and local spatial relationships in a 3D scene and a textual description respectively based on the scene graph, and introduce a partlevel action mechanism to represent interactions as atomic body part states.

Paper
Add Code

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery

no code implementations • 15 Jan 2023 • Kai Jia, Hongwen Zhang, Liang An, Yebin Liu

The key components of a typical regressor lie in the feature extraction of input views and the fusion of multi-view features.

Human Mesh Recovery regression

Paper
Add Code

Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering

no code implementations • CVPR 2023 • Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu

The key of our solution is an efficient 4D tensor decomposition method so that the dynamic scene can be directly represented as a 4D spatio-temporal tensor.

Dynamic Reconstruction Tensor Decomposition

Paper
Add Code

AvatarMAV: Fast 3D Head Avatar Reconstruction Using Motion-Aware Neural Voxels

no code implementations • 23 Nov 2022 • Yuelang Xu, Lizhen Wang, Xiaochen Zhao, Hongwen Zhang, Yebin Liu

AvatarMAV is the first to model both the canonical appearance and the decoupled expression motion by neural voxels for head avatar.

Paper
Add Code

Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars

2 code implementations • CVPR 2023 • Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu

We propose a novel 3D GAN framework for unsupervised learning of generative, high-quality and 3D-consistent facial avatars from unstructured 2D images.

Face Model

458

Paper
Code

Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering

1 code implementation • 21 Nov 2022 • Ruizhi Shao, Zerong Zheng, Hanzhang Tu, Boning Liu, Hongwen Zhang, Yebin Liu

The key of our solution is an efficient 4D tensor decomposition method so that the dynamic scene can be directly represented as a 4D spatio-temporal tensor.

Dynamic Reconstruction Tensor Decomposition

179

Paper
Code

DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras

no code implementations • 16 Jul 2022 • Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu

At its core is a novel diffusion-based stereo module, which introduces diffusion models, a type of powerful generative models, into the iterative stereo matching network.

3D Human Reconstruction 4k +2

Paper
Add Code

Learning Implicit Templates for Point-Based Clothed Human Modeling

1 code implementation • 14 Jul 2022 • Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu

We present FITE, a First-Implicit-Then-Explicit framework for modeling human avatars in clothing.

Paper
Code

PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular Images

1 code implementation • 13 Jul 2022 • Hongwen Zhang, Yating Tian, Yuxiang Zhang, Mengcheng Li, Liang An, Zhenan Sun, Yebin Liu

To address these issues, we propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop in our regression network for well-aligned human mesh recovery and extend it as PyMAF-X for the recovery of expressive full-body models.

Ranked #6 on 3D Human Pose Estimation on AGORA (using extra training data)

3D human pose and shape estimation Human Mesh Recovery +2

576

Paper
Code

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture

1 code implementation • 5 Jul 2022 • Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu

Then given a monocular RGB video of this subject, our method integrates information from both the image observation and the avatar prior, and accordingly recon-structs high-fidelity 3D textured models with dynamic details regardless of the visibility.

177

Paper
Code

Structured Local Radiance Fields for Human Avatar Modeling

no code implementations • CVPR 2022 • Zerong Zheng, Han Huang, Tao Yu, Hongwen Zhang, Yandong Guo, Yebin Liu

These local radiance fields not only leverage the flexibility of implicit representation in shape and appearance modeling, but also factorize cloth deformations into skeleton motions, node residual translations and the dynamic detail variations inside each individual radiance field.

Paper
Add Code

Interacting Attention Graph for Single Image Two-Hand Reconstruction

1 code implementation • CVPR 2022 • Mengcheng Li, Liang An, Hongwen Zhang, Lianpeng Wu, Feng Chen, Tao Yu, Yebin Liu

To solve occlusion and interaction challenges of two-hand reconstruction, we introduce two novel attention based modules in each upsampling step of the original GCN.

Ranked #4 on 3D Interacting Hand Pose Estimation on InterHand2.6M

3D Interacting Hand Pose Estimation Vocal Bursts Valence Prediction

258

Paper
Code

Recovering 3D Human Mesh from Monocular Images: A Survey

1 code implementation • 3 Mar 2022 • Yating Tian, Hongwen Zhang, Yebin Liu, LiMin Wang

Since the release of statistical body models, 3D human mesh recovery has been drawing broader attention.

3D human pose and shape estimation Human Mesh Recovery

329

Paper
Code

DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering

no code implementations • CVPR 2022 • Ruizhi Shao, Hongwen Zhang, He Zhang, Mingjia Chen, YanPei Cao, Tao Yu, Yebin Liu

We introduce DoubleField, a novel framework combining the merits of both surface field and radiance field for high-fidelity human reconstruction and rendering.

Transfer Learning

Paper
Add Code

PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop

2 code implementations • ICCV 2021 • Hongwen Zhang, Yating Tian, Xinchi Zhou, Wanli Ouyang, Yebin Liu, LiMin Wang, Zhenan Sun

Regression-based methods have recently shown promising results in reconstructing human meshes from monocular images.

Ranked #5 on 3D Human Pose Estimation on AGORA (using extra training data)

3D human pose and shape estimation 3D Human Reconstruction +2

576

Paper
Code

Evolving Search Space for Neural Architecture Search

1 code implementation • ICCV 2021 • Yuanzheng Ci, Chen Lin, Ming Sun, BoYu Chen, Hongwen Zhang, Wanli Ouyang

The automation of neural architecture design has been a coveted alternative to human experts.

Knowledge Distillation Neural Architecture Search

Paper
Code

Rethinking Pseudo-LiDAR Representation

1 code implementation • ECCV 2020 • Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang

Based on this observation, we design an image based CNN detector named Patch-Net, which is more generalized and can be instantiated as pseudo-LiDAR based 3D detectors.

161

Paper
Code

Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection

no code implementations • ECCV 2020 • Dongzhan Zhou, Xinchi Zhou, Hongwen Zhang, Shuai Yi, Wanli Ouyang

In this paper, we propose a general and efficient pre-training paradigm, Montage pre-training, for object detection.

object-detection Object Detection

Paper
Add Code

Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition

4 code implementations • CVPR 2020 • Ziyu Liu, Hongwen Zhang, Zhenghao Chen, Zhiyong Wang, Wanli Ouyang

Spatial-temporal graphs have been widely used by skeleton-based action recognition algorithms to model human action dynamics.

Ranked #4 on 3D Action Recognition on Assembly101

Long-range modeling Skeleton Based Action Recognition

854

Paper
Code

Learning 3D Human Shape and Pose from Dense Body Parts

1 code implementation • 31 Dec 2019 • Hongwen Zhang, Jie Cao, Guo Lu, Wanli Ouyang, Zhenan Sun

Reconstructing 3D human shape and pose from monocular images is challenging despite the promising results achieved by the most recent learning-based methods.

Ranked #75 on 3D Human Pose Estimation on 3DPW (MPJPE metric)

3D human pose and shape estimation 3D Human Reconstruction +3

217

Paper
Code

Adversarial Learning Semantic Volume for 2D/3D Face Shape Regression in the Wild

1 code implementation • IEEE Transactions on Image Processing 2019 • Hongwen Zhang, Qi Li, Zhenan Sun

Then, an end-to-end pipeline is designed to jointly regress the proposed volumetric representation and the coordinate vector.

Ranked #3 on Face Alignment on AFLW2000-3D

3D Face Alignment 3D Facial Landmark Localization +2

151

Paper
Code

Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization

no code implementations • NeurIPS 2018 • Jie Cao, Yibo Hu, Hongwen Zhang, Ran He, Zhenan Sun

We decompose the prerequisite of warping into dense correspondence field estimation and facial texture map recovering, which are both well addressed by deep networks.

Dictionary Learning Face Recognition +2

Paper
Add Code

Android Malware Detection using Large-scale Network Representation Learning

no code implementations • 13 Jun 2018 • Rui Zhu, Chenglin Li, Di Niu, Hongwen Zhang, Husam Kinawi

With the growth of mobile devices and applications, the number of malicious software, or malware, is rapidly increasing in recent years, which calls for the development of advanced and effective malware detection approaches.

Cryptography and Security

Paper
Add Code

Android Malware Detection based on Factorization Machine

no code implementations • 30 May 2018 • Chenglin Li, Keith Mills, Rui Zhu, Di Niu, Hongwen Zhang, Husam Kinawi

As the popularity of Android smart phones has increased in recent years, so too has the number of malicious applications.

Cryptography and Security

Paper
Add Code

Joint Voxel and Coordinate Regression for Accurate 3D Facial Landmark Localization

no code implementations • 28 Jan 2018 • Hongwen Zhang, Qi Li, Zhenan Sun

Then, a stacked hourglass network is adopted to estimate the volumetric representation from coarse to fine, followed by a 3D convolution network that takes the estimated volume as input and regresses 3D coordinates of the face shape.

Ranked #1 on 3D Facial Landmark Localization on AFLW2000-3D

3D Facial Landmark Localization Depth Estimation +2

Paper
Add Code

Combining Data-driven and Model-driven Methods for Robust Facial Landmark Detection

1 code implementation • 30 Nov 2016 • Hongwen Zhang, Qi Li, Zhenan Sun, Yunfan Liu

This Estimation-Correction-Tuning process perfectly combines the advantages of the global robustness of data-driven method (FCN), outlier correction capability of model-driven method (PDM) and non-parametric optimization of RLMS.

Facial Landmark Detection

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.