Search Results for author: Song-Hai Zhang

Found 34 papers, 9 papers with code

Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing

no code implementations15 Mar 2024 Tian-Xing Xu, WenBo Hu, Yu-Kun Lai, Ying Shan, Song-Hai Zhang

3D Gaussian splatting, emerging as a groundbreaking approach, has drawn increasing attention for its capabilities of high-fidelity reconstruction and real-time rendering.

Disentanglement

MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation

no code implementations18 Feb 2024 Yup-Jiang Dong, Fang-Lue Zhang, Song-Hai Zhang

To address this issue, we present Motion-Aware Loss, which leverages the temporal relation among consecutive input frames and a novel distillation scheme between the teacher and student networks in the multi-frame self-supervised depth estimation methods.

Monocular Depth Estimation

PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion

no code implementations14 Dec 2023 Ying-Tian Liu, Guan Luo, Heyi Sun, Wei Yin, Yuan-Chen Guo, Song-Hai Zhang

In this paper, we introduce PI3D, a novel and efficient framework that utilizes the pre-trained text-to-image diffusion models to generate high-quality 3D shapes in minutes.

Text to 3D

Text-to-3D with Classifier Score Distillation

no code implementations30 Oct 2023 Xin Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Song-Hai Zhang, Xiaojuan Qi

In this paper, we re-evaluate the role of classifier-free guidance in score distillation and discover a surprising finding: the guidance alone is enough for effective text-to-3D generation tasks.

Text to 3D Texture Synthesis

Wonder3D: Single Image to 3D using Cross-Domain Diffusion

no code implementations23 Oct 2023 Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, YuAn Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang

In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images. Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry.

Image to 3D

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

no code implementations27 Aug 2023 Zi-Xin Zou, Weihao Cheng, Yan-Pei Cao, Shi-Sheng Huang, Ying Shan, Song-Hai Zhang

While recent techniques employ image diffusion models for generating plausible images at novel viewpoints or for distilling pre-trained diffusion priors into 3D representations using score distillation sampling (SDS), these methods often struggle to simultaneously achieve high-quality, consistent, and detailed results for both novel-view synthesis (NVS) and geometry.

3D Reconstruction Novel View Synthesis +1

Neural Point-based Volumetric Avatar: Surface-guided Neural Points for Efficient and Photorealistic Volumetric Head Avatar

no code implementations11 Jul 2023 Cong Wang, Di Kang, Yan-Pei Cao, Linchao Bao, Ying Shan, Song-Hai Zhang

Rendering photorealistic and dynamically moving human heads is crucial for ensuring a pleasant and immersive experience in AR/VR and video conferencing applications.

PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas

no code implementations NeurIPS 2023 Zheng Chen, Yan-Pei Cao, Yuan-Chen Guo, Chen Wang, Ying Shan, Song-Hai Zhang

Unlike generalizable radiance fields trained on perspective images, PanoGRF avoids the information loss from panorama-to-perspective conversion and directly aggregates geometry and appearance features of 3D sample points from each panoramic view based on spherical projection.

Depth Estimation

VMesh: Hybrid Volume-Mesh Representation for Efficient View Synthesis

no code implementations28 Mar 2023 Yuan-Chen Guo, Yan-Pei Cao, Chen Wang, Yu He, Ying Shan, XiaoHu Qie, Song-Hai Zhang

With the emergence of neural radiance fields (NeRFs), view synthesis quality has reached an unprecedented level.

MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors

no code implementations ICCV 2023 Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang

To address these issues, we present MBPTrack, which adopts a Memory mechanism to utilize past information and formulates localization in a coarse-to-fine scheme using Box Priors given in the first frame.

3D Single Object Tracking Autonomous Driving +1

Joint Implicit Neural Representation for High-fidelity and Compact Vector Fonts

no code implementations ICCV 2023 Chia-Hao Chen, Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Song-Hai Zhang

Existing vector font generation approaches either struggle to preserve high-frequency corner details of the glyph or produce vector shapes that have redundant segments, which hinders their applications in practical scenarios.

Font Generation

Gradient-based Point Cloud Denoising with Uniformity

no code implementations21 Jul 2022 Tian-Xing Xu, Yuan-Chen Guo, Yong-Liang Yang, Song-Hai Zhang

Point clouds captured by depth sensors are often contaminated by noises, obstructing further analysis and applications.

Denoising Surface Reconstruction

DeepPortraitDrawing: Generating Human Body Images from Freehand Sketches

no code implementations4 May 2022 Xian Wu, Chen Wang, Hongbo Fu, Ariel Shamir, Song-Hai Zhang, Shi-Min Hu

Researchers have explored various ways to generate realistic images from freehand sketches, e. g., for objects and human faces.

Image Generation Sketch-to-Image Translation

NeRF-SR: High-Quality Neural Radiance Fields using Supersampling

1 code implementation3 Dec 2021 Chen Wang, Xian Wu, Yuan-Chen Guo, Song-Hai Zhang, Yu-Wing Tai, Shi-Min Hu

We present NeRF-SR, a solution for high-resolution (HR) novel view synthesis with mostly low-resolution (LR) inputs.

Novel View Synthesis Vocal Bursts Intensity Prediction

NeRFReN: Neural Radiance Fields with Reflections

no code implementations CVPR 2022 Yuan-Chen Guo, Di Kang, Linchao Bao, Yu He, Song-Hai Zhang

Specifically, we propose to split a scene into transmitted and reflected components, and model the two components with separate neural radiance fields.

Depth Estimation Novel View Synthesis

Deep Image Synthesis from Intuitive User Input: A Review and Perspectives

no code implementations9 Jul 2021 Yuan Xue, Yuan-Chen Guo, Han Zhang, Tao Xu, Song-Hai Zhang, Xiaolei Huang

In many applications of computer graphics, art and design, it is desirable for a user to provide intuitive non-image input, such as text, sketch, stroke, graph or layout, and have a computer system automatically generate photo-realistic images that adhere to the input content.

Image Generation Image Retrieval +1

Learning Implicit Glyph Shape Representation

no code implementations16 Jun 2021 Ying-Tian Liu, Yuan-Chen Guo, Yi-Xiao Li, Chen Wang, Song-Hai Zhang

In this paper, we present a novel implicit glyph shape representation, which models glyphs as shape primitives enclosed by quadratic curves, and naturally enables generating glyph images at arbitrary high resolutions.

Font Style Transfer Vector Graphics

Sketch2Model: View-Aware 3D Modeling from Single Free-Hand Sketches

1 code implementation CVPR 2021 Song-Hai Zhang, Yuan-Chen Guo, Qing-Wen Gu

We investigate the problem of generating 3D meshes from single free-hand sketches, aiming at fast 3D modeling for novice users.

A new dataset of dog breed images and a benchmark for fine-grained classification

1 code implementation1 Oct 2020 Ding-Nan Zo, Song-Hai Zhang, Tai-Jiang M, Min Zhang

It is currently the largest dataset for fine-grained classification of dogs, including130 dog breeds and 70, 428 real-world images.

Benchmarking Classification +3

Example-Guided Style Consistent Image Synthesis from Semantic Labeling

1 code implementation4 Jun 2019 Miao Wang, Guo-Ye Yang, Rui-Long Li, Run-Ze Liang, Song-Hai Zhang, Peter. M. Hall, Shi-Min Hu

Example-guided image synthesis aims to synthesize an image from a semantic label map and an exemplary image indicating style.

Image Generation Scene Segmentation

What and Where: A Context-based Recommendation System for Object Insertion

no code implementations24 Nov 2018 Song-Hai Zhang, Zhengping Zhou, Bin Liu, Xin Dong, Dun Liang, Peter Hall, Shi-Min Hu

In this work, we propose a novel topic consisting of two dual tasks: 1) given a scene, recommend objects to insert, 2) given an object category, retrieve suitable background scenes.

Object

Pose2Seg: Detection Free Human Instance Segmentation

6 code implementations CVPR 2019 Song-Hai Zhang, Rui-Long Li, Xin Dong, Paul L. Rosin, Zixi Cai, Han Xi, Dingcheng Yang, Hao-Zhi Huang, Shi-Min Hu

We demonstrate that our pose-based framework can achieve better accuracy than the state-of-art detection-based approach on the human instance segmentation problem, and can moreover better handle occlusion.

2D Human Pose Estimation Human Instance Segmentation +5

Deep Online Video Stabilization

3 code implementations22 Feb 2018 Miao Wang, Guo-Ye Yang, Jin-Kun Lin, Ariel Shamir, Song-Hai Zhang, Shao-Ping Lu, Shi-Min Hu

In this paper, we solve the video stabilization problem using a convolutional neural network (ConvNet).

Graphics

A Comparative Study of Algorithms for Realtime Panoramic Video Blending

no code implementations1 Jun 2016 Zhe Zhu, Jiaming Lu, Minxuan Wang, Song-Hai Zhang, Ralph Martin, Hantao Liu, Shi-Min Hu

In this paper, we investigate 6 popular blending algorithms---feather blending, multi-band blending, modified Poisson blending, mean value coordinate blending, multi-spline blending and convolution pyramid blending.

Cannot find the paper you are looking for? You can Submit a new open access paper.