Search Results for author: Xiaohu Guo

Found 16 papers, 2 papers with code

NASM: Neural Anisotropic Surface Meshing

no code implementations30 Oct 2024 Hongbo Li, Haikuan Zhu, Sikai Zhong, Ningna Wang, Cheng Lin, Xiaohu Guo, Shiqing Xin, Wenping Wang, Jing Hua, Zichun Zhong

To our knowledge, this is the first time that a deep learning framework and a large dataset are proposed to construct a high-d Euclidean embedding space for 3D anisotropic surface meshing.

Graph Neural Network

DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures

no code implementations11 Sep 2024 Steven Hogue, Chenxu Zhang, Hamza Daruger, Yapeng Tian, Xiaohu Guo

Audio-driven talking video generation has advanced significantly, but existing methods often depend on video-to-video translation techniques and traditional generative networks like GANs and they typically generate taking heads and co-speech gestures separately, leading to less coherent outputs.

Diversity Talking Head Generation +1

SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction

no code implementations6 Jul 2024 Weixing Xie, Junfeng Yao, Xianpeng Cao, Qiqin Lin, Zerui Tang, Xiao Dong, Xiaohu Guo

However, based on implicit representation, NeRFs struggle to capture the intricate details of objects in the scene and cannot achieve real-time rendering.

Dynamic Reconstruction

HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

no code implementations10 Jun 2024 Jikai Wang, Qifan Zhang, Yu-Wei Chao, Bowen Wen, Xiaohu Guo, Yu Xiang

We introduce a data capture system and a new dataset named HO-Cap that can be used to study 3D reconstruction and pose tracking of hands and objects in videos.

3D Reconstruction Pose Tracking +1

CWF: Consolidating Weak Features in High-quality Mesh Simplification

no code implementations24 Apr 2024 Rui Xu, Longdu Liu, Ningna Wang, Shuangmin Chen, Shiqing Xin, Xiaohu Guo, Zichun Zhong, Taku Komura, Wenping Wang, Changhe Tu

In mesh simplification, common requirements like accuracy, triangle quality, and feature alignment are often considered as a trade-off.

Robust Active Speaker Detection in Noisy Environments

no code implementations27 Mar 2024 Siva Sai Nagender Vasireddy, Chenxu Zhang, Xiaohu Guo, Yapeng Tian

Experiments demonstrate that non-speech audio noises significantly impact ASD models, and our proposed approach improves ASD performance in noisy environments.

Active Speaker Detection Speech Separation

DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular cameras

no code implementations1 Feb 2024 Weixing Xie, Xiao Dong, Yong Yang, Qiqin Lin, Jingze Chen, Junfeng Yao, Xiaohu Guo

With the popularity of monocular videos generated by video sharing and live broadcasting applications, reconstructing and editing dynamic scenes in stationary monocular cameras has become a special but anticipated technology.

Dynamic Reconstruction Neural Rendering

MusicFace: Music-driven Expressive Singing Face Synthesis

no code implementations24 Mar 2023 PengFei Liu, Wenjin Deng, Hengda Li, Jintai Wang, Yinglin Zheng, Yiwei Ding, Xiaohu Guo, Ming Zeng

In this paper, we present a method for this task with natural motions of the lip, facial expression, head pose, and eye states.

Face Generation

Layered-Garment Net: Generating Multiple Implicit Garment Layers from a Single Image

no code implementations22 Nov 2022 Alakh Aggarwal, Jikai Wang, Steven Hogue, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo

To the best of our knowledge, LGN is the first research work to generate intersection-free multiple layers of garments on the human body from a single image.

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning

1 code implementation ICCV 2021 Chenxu Zhang, Yifan Zhao, Yifei HUANG, Ming Zeng, Saifeng Ni, Madhukar Budagavi, Xiaohu Guo

In this paper, we propose a talking face generation method that takes an audio signal as input and a short target video clip as reference, and synthesizes a photo-realistic video of the target face with natural lip motions, head poses, and eye blinks that are in-sync with the input audio signal.

3D Face Animation Attribute +2

Topology-Change-Aware Volumetric Fusion for Dynamic Scene Reconstruction

no code implementations ECCV 2020 Chao Li, Xiaohu Guo

In the classic volumetric fusion-based framework, a mesh is usually extracted from the TSDF volume as the canonical surface representation to help estimating deformation field.

4D reconstruction

Efficient Plane-Based Optimization of Geometry and Texture for Indoor RGB-D Reconstruction

1 code implementation21 May 2019 Chao Wang, Xiaohu Guo

We propose a novel approach to reconstruct RGB-D indoor scene based on plane primitives.

RGB-D Reconstruction

ArticulatedFusion: Real-time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera

no code implementations ECCV 2018 Chao Li, Zheheng Zhao, Xiaohu Guo

This paper proposes a real-time dynamic scene reconstruction method capable of reproducing the motion, geometry, and segmentation simultaneously given live depth stream from a single RGB-D camera.

Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.