Search Results for author: Jiahe Li

Found 8 papers, 4 papers with code

TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

no code implementations23 Apr 2024 Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu

Leveraging the point-based Gaussian Splatting, facial motions can be represented in our method by applying smooth and continuous deformations to persistent Gaussian primitives, without requiring to learn the difficult appearance change like previous methods.

Robust Synthetic-to-Real Transfer for Stereo Matching

1 code implementation12 Mar 2024 Jiawei Zhang, Jiahe Li, Lei Huang, Xiaohan Yu, Lin Gu, Jin Zheng, Xiao Bai

With advancements in domain generalized stereo matching networks, models pre-trained on synthetic data demonstrate strong robustness to unseen domains.

Domain Generalization Pseudo Label +1

DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization

1 code implementation11 Mar 2024 Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu

Our motivation stems from the highly efficient representation and surprising quality of the recent 3D Gaussian Splatting, despite it will encounter a geometry degradation when input views decrease.

Novel View Synthesis

Self-supervised Learning of Implicit Shape Representation with Dense Correspondence for Deformable Objects

no code implementations ICCV 2023 Baowen Zhang, Jiahe Li, Xiaoming Deng, yinda zhang, Cuixia Ma, Hongan Wang

In this paper, we propose a novel self-supervised approach to learn neural implicit shape representation for deformable objects, which can represent shapes with a template shape and dense correspondence in 3D.

3D Shape Representation Self-Supervised Learning

Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

1 code implementation ICCV 2023 Jiahe Li, Jiawei Zhang, Xiao Bai, Jun Zhou, Lin Gu

This paper presents ER-NeRF, a novel conditional Neural Radiance Fields (NeRF) based architecture for talking portrait synthesis that can concurrently achieve fast convergence, real-time rendering, and state-of-the-art performance with small model size.

Accelerating Dynamic Network Embedding with Billions of Parameter Updates to Milliseconds

1 code implementation15 Jun 2023 Haoran Deng, Yang Yang, Jiahe Li, Haoyang Cai, ShiLiang Pu, Weihao Jiang

Network embedding, a graph representation learning method illustrating network topology by mapping nodes into lower-dimension vectors, is challenging to accommodate the ever-changing dynamic graphs in practice.

Graph Reconstruction Graph Representation Learning +3

EvHandPose: Event-based 3D Hand Pose Estimation with Sparse Supervision

no code implementations6 Mar 2023 Jianping Jiang, Jiahe Li, Baowen Zhang, Xiaoming Deng, Boxin Shi

Experiments on EvRealHands demonstrate that EvHandPose outperforms previous event-based methods under all evaluation scenes, achieves accurate and stable hand pose estimation with high temporal resolution in fast motion and strong light scenes compared with RGB-based methods, generalizes well to outdoor scenes and another type of event camera, and shows the potential for the hand gesture recognition task.

3D Hand Pose Estimation Hand Gesture Recognition +1

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition

no code implementations ICCV 2023 Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng Yan

For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i. e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective.

Action Recognition Facial Expression Recognition (FER) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.