Search Results for author: Gang Zeng

Found 40 papers, 20 papers with code

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

no code implementations26 Sep 2024 Jiaxiang Tang, Zhaoshuo Li, Zekun Hao, Xian Liu, Gang Zeng, Ming-Yu Liu, Qinsheng Zhang

Current auto-regressive mesh generation methods suffer from issues such as incompleteness, insufficient detail, and poor generalization.

Diversity

Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction

1 code implementation6 Jun 2024 Diwen Wan, Ruijie Lu, Gang Zeng

Empowered by these superpoints, our method manages to extend 3D Gaussian splatting to dynamic scenes with only a slight increase in computational expense.

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

no code implementations4 Jun 2024 Qi Wang, Ruijie Lu, Xudong Xu, Jingbo Wang, Michael Yu Wang, Bo Dai, Gang Zeng, Dan Xu

In the coarse stage, RoomTex first unwraps the scene mesh to a panoramic depth map and leverages ControlNet to generate a room panorama, which is regarded as the coarse reference to ensure the global texture consistency.

Edge Detection Text to 3D

InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting

no code implementations18 Mar 2024 Jiaxiang Tang, Ruijie Lu, Xiaokang Chen, Xiang Wen, Gang Zeng, Ziwei Liu

Text-to-texture synthesis has become a new frontier in 3D content creation thanks to the recent advances in text-to-image models.

Texture Synthesis

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

1 code implementation7 Feb 2024 Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu

2) 3D Backbone: We present an asymmetric U-Net as a high-throughput backbone operating on multi-view images, which can be produced from text or single-view image input by leveraging multi-view diffusion models.

DreamGaussian4D: Generative 4D Gaussian Splatting

1 code implementation28 Dec 2023 Jiawei Ren, Liang Pan, Jiaxiang Tang, Chi Zhang, Ang Cao, Gang Zeng, Ziwei Liu

Specifically, we propose an integral framework with two major modules: 1) Image-to-4D GS - we initially generate static GS with DreamGaussianHD, followed by HexPlane-based dynamic generation with Gaussian deformation; and 2) Video-to-Video Texture Refinement - we refine the generated UV-space texture maps and meanwhile enhance their temporal consistency by utilizing a pre-trained image-to-video diffusion model.

Video Generation

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting

no code implementations CVPR 2024 Xian Liu, Xiaohang Zhan, Jiaxiang Tang, Ying Shan, Gang Zeng, Dahua Lin, Xihui Liu, Ziwei Liu

In this paper, we propose an efficient yet effective framework, HumanGaussian, that generates high-quality 3D humans with fine-grained geometry and realistic appearance.

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

1 code implementation28 Sep 2023 Jiaxiang Tang, Jiawei Ren, Hang Zhou, Ziwei Liu, Gang Zeng

In contrast to the occupancy pruning used in Neural Radiance Fields, we demonstrate that the progressive densification of 3D Gaussians converges significantly faster for 3D generative tasks.

3D Generation

Interactive Segment Anything NeRF with Feature Imitation

no code implementations25 May 2023 Xiaokang Chen, Jiaxiang Tang, Diwen Wan, Jingbo Wang, Gang Zeng

We propose to imitate the backbone feature of off-the-shelf perception models to achieve zero-shot semantic segmentation with NeRF.

Decoder Segmentation +2

Real-time 3D Semantic Scene Completion Via Feature Aggregation and Conditioned Prediction

no code implementations20 Mar 2023 Xiaokang Chen, Yajie Xing, Gang Zeng

In this paper, we propose a real-time semantic scene completion method with a feature aggregation strategy and conditioned prediction module.

3D Semantic Scene Completion

Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution

1 code implementation8 Feb 2023 Chao Chen, Haoyu Geng, Gang Zeng, Zhaobing Han, Hua Chai, Xiaokang Yang, Junchi Yan

Inductive one-bit matrix completion is motivated by modern applications such as recommender systems, where new users would appear at test stage with the ratings consisting of only ones and no zeros.

Matrix Completion Recommendation Systems

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

1 code implementation22 Nov 2022 Jiaxiang Tang, Kaisiyuan Wang, Hang Zhou, Xiaokang Chen, Dongliang He, Tianshu Hu, Jingtuo Liu, Gang Zeng, Jingdong Wang

While dynamic Neural Radiance Fields (NeRF) have shown success in high-fidelity 3D modeling of talking portraits, the slow training and inference speed severely obstruct their potential usage.

Talking Face Generation

D$^3$ETR: Decoder Distillation for Detection Transformer

no code implementations17 Nov 2022 Xiaokang Chen, Jiahui Chen, Yan Liu, Gang Zeng

Specifically, Adaptive Matching applies bipartite matching to adaptively match the outputs of the teacher and the student in each decoder layer, while Fixed Matching fixes the correspondence between the outputs of the teacher and the student with the same object queries, with the teacher's fixed object queries fed to the decoder of the student as an auxiliary group.

Decoder Knowledge Distillation

JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario

no code implementations21 Aug 2022 Longrui Dong, Gang Zeng

The ability for a moving agent to localize itself in environment is the basic demand for emerging applications, such as autonomous driving, etc.

Autonomous Driving

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

2 code implementations ICCV 2023 Qiang Chen, Xiaokang Chen, Jian Wang, Shan Zhang, Kun Yao, Haocheng Feng, Junyu Han, Errui Ding, Gang Zeng, Jingdong Wang

Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth object to one prediction, for end-to-end detection without NMS post-processing.

Data Augmentation Decoder +3

Conditional DETR V2: Efficient Detection Transformer with Box Queries

no code implementations18 Jul 2022 Xiaokang Chen, Fangyun Wei, Gang Zeng, Jingdong Wang

Inspired by Conditional DETR, an improved DETR with fast training convergence, that presented box queries (originally called spatial queries) for internal decoder layers, we reformulate the object query into the format of the box query that is a composition of the embeddings of the reference point and the transformation of the box with respect to the reference point.

Decoder Object +2

Compressible-composable NeRF via Rank-residual Decomposition

2 code implementations30 May 2022 Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng

To circumvent the hurdle, in this paper, we present an explicit neural field representation that enables efficient and convenient manipulation of models.

Point Scene Understanding via Disentangled Instance Mesh Reconstruction

1 code implementation31 Mar 2022 Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng

Semantic scene reconstruction from point cloud is an essential and challenging task for 3D scene understanding.

Retrieval Scene Understanding

MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation

no code implementations28 Mar 2022 Min Zhong, Xinghao Chen, Xiaokang Chen, Gang Zeng, Yunhe Wang

For instance, our approach achieves a 66. 4\% mAP with the 0. 5 IoU threshold on the ScanNetV2 test set, which is 1. 9\% higher than the state-of-the-art method.

3D Instance Segmentation Semantic Segmentation

Context Autoencoder for Self-Supervised Representation Learning

6 code implementations7 Feb 2022 Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang

The pretraining tasks include two tasks: masked representation prediction - predict the representations for the masked patches, and masked patch reconstruction - reconstruct the masked patches.

Decoder Instance Segmentation +6

Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

no code implementations24 Dec 2021 Xiaokang Chen, Jiaxiang Tang, Jingbo Wang, Gang Zeng

Firstly, we transfer the voxelized scenes to point clouds by removing these visible empty voxels and adopt a deep point stream to capture semantic information from the scene efficiently.

3D Semantic Scene Completion

Conditional DETR for Fast Training Convergence

4 code implementations ICCV 2021 Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang

Our approach, named conditional DETR, learns a conditional spatial query from the decoder embedding for decoder multi-head cross-attention.

Decoder Object +2

Joint Implicit Image Function for Guided Depth Super-Resolution

1 code implementation19 Jul 2021 Jiaxiang Tang, Xiaokang Chen, Gang Zeng

Inspired by the recent progress in implicit neural representation, we propose to formulate the guided super-resolution as a neural implicit image interpolation problem, where we take the form of a general image interpolation but use a novel Joint Implicit Image Function (JIIF) representation to learn both the interpolation weights and values.

Graph Attention Super-Resolution

Neural Style Transfer via Meta Networks

no code implementations CVPR 2018 Falong Shen, Shuicheng Yan, Gang Zeng

Recent works on style transfer typically need to train image transformation networks for every new style, and the style is encoded in the network parameters by enormous iterations of stochastic gradient descent, which lacks the generalization ability to new style in the inference stage.

Style Transfer

Meta Networks for Neural Style Transfer

1 code implementation13 Sep 2017 Falong Shen, Shuicheng Yan, Gang Zeng

Recent works on style transfer typically need to train image transformation networks for every new style, and the style is encoded in the network parameters by enormous iterations of stochastic gradient descent.

Style Transfer

Weighted Residuals for Very Deep Networks

no code implementations28 May 2016 Falong Shen, Gang Zeng

The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently.

Fast Semantic Image Segmentation with High Order Context and Guided Filtering

no code implementations13 May 2016 Falong Shen, Gang Zeng

This paper describes a fast and accurate semantic image segmentation approach that encodes not only the discriminative features from deep neural networks, but also the high-order context compatibility among adjacent objects as well as low level image features.

Image Segmentation Semantic Segmentation +1

Fast Approximate $K$-Means via Cluster Closures

no code implementations11 Dec 2013 Jingdong Wang, Jing Wang, Qifa Ke, Gang Zeng, Shipeng Li

Traditional $k$-means is an iterative algorithm---in each iteration new cluster centers are computed and each data point is re-assigned to its nearest center.

Clustering Image Retrieval +1

Scalable $k$-NN graph construction

no code implementations30 Jul 2013 Jingdong Wang, Jing Wang, Gang Zeng, Zhuowen Tu, Rui Gan, Shipeng Li

The $k$-NN graph has played a central role in increasingly popular data-driven techniques for various learning and vision tasks; yet, finding an efficient and effective way to construct $k$-NN graphs remains a challenge, especially for large-scale high-dimensional data.

graph construction

Cannot find the paper you are looking for? You can Submit a new open access paper.