Search Results for author: Bang Zhang

Found 20 papers, 7 papers with code

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

no code implementations27 Feb 2024 Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo

In this work, we tackle the challenge of enhancing the realism and expressiveness in talking head video generation by focusing on the dynamic and nuanced relationship between audio cues and facial movements.

Video Generation

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

no code implementations12 Dec 2023 Kangneng Zhou, Daiheng Gao, Xuan Wang, Jie Zhang, Peng Zhang, Xusen Sun, Longhao Zhang, Shiqi Yang, Bang Zhang, Liefeng Bo, Yaxing Wang

To address this limitation, we propose \textbf{MaTe3D}: mask-guided text-based 3D-aware portrait editing.

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

no code implementations4 Dec 2023 Xusen Sun, Longhao Zhang, Hao Zhu, Peng Zhang, Bang Zhang, Xinya Ji, Kangneng Zhou, Daiheng Gao, Liefeng Bo, Xun Cao

Audio-driven talking head generation has drawn much attention in recent years, and many efforts have been made in lip-sync, expressive facial expressions, natural head pose generation, and high video quality.

Talking Head Generation

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

1 code implementation28 Nov 2023 Li Hu, Xin Gao, Peng Zhang, Ke Sun, Bang Zhang, Liefeng Bo

Character Animation aims to generating character videos from still images through driving signals.

RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

1 code implementation ICCV 2023 Lijun Li, Linrui Tian, Xindi Zhang, Qi Wang, Bang Zhang, Mengyuan Liu, Chen Chen

The current interacting hand (IH) datasets are relatively simplistic in terms of background and texture, with hand joints being annotated by a machine annotator, which may result in inaccuracies, and the diversity of pose distribution is limited.

3D Interacting Hand Pose Estimation Hand Pose Estimation

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

no code implementations8 Aug 2023 Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, QiXing Huang

Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process.

Texture Synthesis Virtual Try-on

DiffHand: End-to-End Hand Mesh Reconstruction via Diffusion Models

no code implementations23 May 2023 Lijun Li, Li'an Zhuo, Bang Zhang, Liefeng Bo, Chen Chen

Hand mesh reconstruction from the monocular image is a challenging task due to its depth ambiguity and severe occlusion, there remains a non-unique mapping between the monocular image and hand mesh.

Denoising

Gloss-Free End-to-End Sign Language Translation

1 code implementation22 May 2023 Kezhou Lin, Xiaohan Wang, Linchao Zhu, Ke Sun, Bang Zhang, Yi Yang

In this paper, we tackle the problem of sign language translation (SLT) without gloss annotations.

Sign Language Translation Translation

Towards Stable Human Pose Estimation via Cross-View Fusion and Foot Stabilization

no code implementations CVPR 2023 Li’an Zhuo, Jian Cao, Qi Wang, Bang Zhang, Liefeng Bo

Then the optimization-based method is introduced to reconstruct the foot pose and foot-ground contact for the general multi-view datasets including AIST++ and Human3. 6M.

Pose Estimation

DART: Articulated Hand Model with Diverse Accessories and Rich Textures

1 code implementation14 Oct 2022 Daiheng Gao, Yuliang Xiu, Kailin Li, Lixin Yang, Feng Wang, Peng Zhang, Bang Zhang, Cewu Lu, Ping Tan

Unity GUI is also provided to generate synthetic hand data with user-defined settings, e. g., pose, camera, background, lighting, textures, and accessories.

Hand Pose Estimation Unity

One-stage Action Detection Transformer

no code implementations21 Jun 2022 Lijun Li, Li'an Zhuo, Bang Zhang

In this work, we introduce our solution to the EPIC-KITCHENS-100 2022 Action Detection challenge.

Action Detection

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis

1 code implementation CVPR 2022 Xuanmeng Zhang, Zhedong Zheng, Daiheng Gao, Bang Zhang, Pan Pan, Yi Yang

To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D-aware image synthesis with geometry constraints.

3D-Aware Image Synthesis

Divide and Rule: Recurrent Partitioned Network for Dynamic Processes

no code implementations1 Jun 2021 Qianyu Feng, Bang Zhang, Yi Yang

Differently, our goal is to represent a system with a part-whole hierarchy and discover the implied dependencies among intra-system variables: inferring the interactions that possess causal effects on the sub-system behavior with REcurrent partItioned Network (REIN).

Temporal Sequences

VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots

1 code implementation31 May 2021 Yuan Gan, Yawei Luo, Xin Yu, Bang Zhang, Yi Yang

In this paper, we investigate the task of hallucinating an authentic high-resolution (HR) human face from multiple low-resolution (LR) video snapshots.

Face Hallucination Hallucination

OR-Net: Pointwise Relational Inference for Data Completion under Partial Observation

no code implementations2 May 2021 Qianyu Feng, Linchao Zhu, Bang Zhang, Pan Pan, Yi Yang

Specifically, we expect to approximate the real joint distribution over the partial observation and latent variables, thus infer the unseen targets respectively.

Stable Learning in Coding Space for Multi-Class Decoding and Its Extension for Multi-Class Hypothesis Transfer Learning

no code implementations CVPR 2014 Bang Zhang, Yi Wang, Yang Wang, Fang Chen

Many prevalent multi-class classification approaches can be unified and generalized by the output coding framework which usually consists of three phases: (1) coding, (2) learning binary classifiers, and (3) decoding.

General Classification Multi-class Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.