Search Results for author: Qianyi Wu

Found 18 papers, 11 papers with code

Taming Stable Diffusion for Text to 360° Panorama Image Generation

2 code implementations • 11 Apr 2024 • Cheng Zhang, Qianyi Wu, Camilo Cruz Gambardella, Xiaoshui Huang, Dinh Phung, Wanli Ouyang, Jianfei Cai

Generative models, e. g., Stable Diffusion, have enabled the creation of photorealistic images from text prompts.

Denoising Image Generation

131

Paper
Code

ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition

no code implementations • 21 Mar 2024 • Tianhao Wu, Chuanxia Zheng, Tat-Jen Cham, Qianyi Wu

3D decomposition/segmentation still remains a challenge as large-scale 3D annotated data is not readily available.

Segmentation

Paper
Add Code

HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

1 code implementation • 21 Mar 2024 • Yihang Chen, Qianyi Wu, Jianfei Cai, Mehrtash Harandi, Weiyao Lin

3D Gaussian Splatting (3DGS) has emerged as a promising framework for novel view synthesis, boasting rapid rendering speed with high fidelity.

Attribute Novel View Synthesis +1

Paper
Code

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

1 code implementation • ICCV 2023 • Qianyi Wu, Kaisiyuan Wang, Kejie Li, Jianmin Zheng, Jianfei Cai

Unlike traditional multi-view stereo approaches, the neural implicit surface-based methods leverage neural networks to represent 3D scenes as signed distance functions (SDFs).

3D Reconstruction Multi-View 3D Reconstruction +3

135

Paper
Code

Explicit Correspondence Matching for Generalizable Neural Radiance Fields

1 code implementation • 24 Apr 2023 • Yuedong Chen, Haofei Xu, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai

The key to our approach lies in the explicitly modeled correspondence matching information, so as to provide the geometry prior to the prediction of NeRF color and density for volume rendering.

Novel View Synthesis

159

Paper
Code

Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation

no code implementations • 14 Feb 2023 • Yasheng Sun, Qianyi Wu, Hang Zhou, Kaisiyuan Wang, Tianshu Hu, Chen-Chieh Liao, Shio Miyafuji, Ziwei Liu, Hideki Koike

Creating the photo-realistic version of people sketched portraits is useful to various entertainment purposes.

Paper
Add Code

Masked Lip-Sync Prediction by Audio-Visual Contextual Exploitation in Transformers

no code implementations • 9 Dec 2022 • Yasheng Sun, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Zhibin Hong, Jingtuo Liu, Errui Ding, Jingdong Wang, Ziwei Liu, Hideki Koike

This requires masking a large percentage of the original image and seamlessly inpainting it with the aid of audio and reference frames.

Paper
Add Code

Audio-Driven Co-Speech Gesture Video Generation

no code implementations • 5 Dec 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yuanqi Du, Wayne Wu, Dahua Lin, Ziwei Liu

Our key insight is that the co-speech gestures can be decomposed into common motion patterns and subtle rhythmic dynamics.

Video Generation

Paper
Add Code

Object-Compositional Neural Implicit Surfaces

1 code implementation • 20 Jul 2022 • Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng

This paper proposes a novel framework, ObjectSDF, to build an object-compositional neural implicit representation with high fidelity in 3D reconstruction and object representation.

3D Reconstruction Novel View Synthesis +1

181

Paper
Code

EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model

no code implementations • 30 May 2022 • Xinya Ji, Hang Zhou, Kaisiyuan Wang, Qianyi Wu, Wayne Wu, Feng Xu, Xun Cao

Although significant progress has been made to audio-driven talking face generation, existing methods either neglect facial emotion or cannot be applied to arbitrary subjects.

Talking Face Generation

Paper
Add Code

TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

1 code implementation • CVPR 2022 • Yanbo Xu, Yueqin Yin, Liming Jiang, Qianyi Wu, Chengyao Zheng, Chen Change Loy, Bo Dai, Wayne Wu

In this study, we highlight the importance of interaction in a dual-space GAN for more controllable editing.

Attribute Disentanglement +1

172

Paper
Code

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation

1 code implementation • CVPR 2022 • Xian Liu, Qianyi Wu, Hang Zhou, Yinghao Xu, Rui Qian, Xinyi Lin, Xiaowei Zhou, Wayne Wu, Bo Dai, Bolei Zhou

To enhance the quality of synthesized gestures, we develop a contrastive learning strategy based on audio-text alignment for better audio representations.

Ranked #3 on Gesture Generation on TED Gesture Dataset

Contrastive Learning Gesture Generation

117

Paper
Code

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

1 code implementation • 21 Mar 2022 • Yuedong Chen, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai

In light of recent advances in NeRF-based 3D-aware generative models, we introduce a new task, Semantic-to-NeRF translation, that aims to reconstruct a 3D scene modelled by NeRF, conditioned on one single-view semantic mask as input.

Ranked #1 on 3D-Aware Image Synthesis on CelebAMask-HQ

3D-Aware Image Synthesis Translation

122

Paper
Code

Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation

2 code implementations • 2 Mar 2022 • Yicheng Wu, Zhonghua Wu, Qianyi Wu, ZongYuan Ge, Jianfei Cai

The pixel-level smoothness forces the model to generate invariant results under adversarial perturbations.

Image Segmentation Semantic Segmentation +1

1,984

Paper
Code

Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation

no code implementations • 19 Jan 2022 • Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou

Moreover, to enable portrait rendering in one unified neural radiance field, a Torso Deformation module is designed to stabilize the large-scale non-rigid torso motions.

Paper
Add Code

AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection

no code implementations • NeurIPS 2020 • Hao Zhu, Chaoyou Fu, Qianyi Wu, Wayne Wu, Chen Qian, Ran He

However, due to the lack of Deepfakes datasets with large variance in appearance, which can be hardly produced by recent identity swapping methods, the detection algorithm may fail in this situation.

Paper
Add Code

Disentangled Representation Learning for 3D Face Shape

1 code implementation • CVPR 2019 • Zi-Hang Jiang, Qianyi Wu, Keyu Chen, Juyong Zhang

In this paper, we present a novel strategy to design disentangled 3D face shape representation.

Attribute Representation Learning

226

Paper
Code

Alive Caricature from 2D to 3D

1 code implementation • CVPR 2018 • Qianyi Wu, Juyong Zhang, Yu-Kun Lai, Jianmin Zheng, Jianfei Cai

Caricature is an art form that expresses subjects in abstract, simple and exaggerated view.

Caricature

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.