Search Results for author: Gang Zeng

Found 37 papers, 19 papers with code

InTeX: Interactive Text-to-texture Synthesis via Unified Depth-aware Inpainting

no code implementations • 18 Mar 2024 • Jiaxiang Tang, Ruijie Lu, Xiaokang Chen, Xiang Wen, Gang Zeng, Ziwei Liu

Text-to-texture synthesis has become a new frontier in 3D content creation thanks to the recent advances in text-to-image models.

Texture Synthesis

Paper
Add Code

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

1 code implementation • 7 Feb 2024 • Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu

2) 3D Backbone: We present an asymmetric U-Net as a high-throughput backbone operating on multi-view images, which can be produced from text or single-view image input by leveraging multi-view diffusion models.

1,158

Paper
Code

DreamGaussian4D: Generative 4D Gaussian Splatting

1 code implementation • 28 Dec 2023 • Jiawei Ren, Liang Pan, Jiaxiang Tang, Chi Zhang, Ang Cao, Gang Zeng, Ziwei Liu

Remarkable progress has been made in 4D content generation recently.

394

Paper
Code

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting

no code implementations • 28 Nov 2023 • Xian Liu, Xiaohang Zhan, Jiaxiang Tang, Ying Shan, Gang Zeng, Dahua Lin, Xihui Liu, Ziwei Liu

In this paper, we propose an efficient yet effective framework, HumanGaussian, that generates high-quality 3D humans with fine-grained geometry and realistic appearance.

Paper
Add Code

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

1 code implementation • 28 Sep 2023 • Jiaxiang Tang, Jiawei Ren, Hang Zhou, Ziwei Liu, Gang Zeng

In contrast to the occupancy pruning used in Neural Radiance Fields, we demonstrate that the progressive densification of 3D Gaussians converges significantly faster for 3D generative tasks.

3D Generation

3,616

Paper
Code

Interactive Segment Anything NeRF with Feature Imitation

no code implementations • 25 May 2023 • Xiaokang Chen, Jiaxiang Tang, Diwen Wan, Jingbo Wang, Gang Zeng

We propose to imitate the backbone feature of off-the-shelf perception models to achieve zero-shot semantic segmentation with NeRF.

Segmentation Semantic Segmentation +1

Paper
Add Code

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

2 code implementations • NeurIPS 2023 • Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie zhou, Yu Qiao, Jifeng Dai

We hope this model can set a new baseline for generalist vision and language models.

Language Modelling Large Language Model

3,121

Paper
Code

Real-time 3D Semantic Scene Completion Via Feature Aggregation and Conditioned Prediction

no code implementations • 20 Mar 2023 • Xiaokang Chen, Yajie Xing, Gang Zeng

In this paper, we propose a real-time semantic scene completion method with a feature aggregation strategy and conditioned prediction module.

3D Semantic Scene Completion

Paper
Add Code

Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement

1 code implementation • ICCV 2023 • Jiaxiang Tang, Hang Zhou, Xiaokang Chen, Tianshu Hu, Errui Ding, Jingdong Wang, Gang Zeng

Neural Radiance Fields (NeRF) have constituted a remarkable breakthrough in image-based 3D reconstruction.

3D Reconstruction

847

Paper
Code

Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution

1 code implementation • 8 Feb 2023 • Chao Chen, Haoyu Geng, Gang Zeng, Zhaobing Han, Hua Chai, Xiaokang Yang, Junchi Yan

Inductive one-bit matrix completion is motivated by modern applications such as recommender systems, where new users would appear at test stage with the ratings consisting of only ones and no zeros.

Matrix Completion Recommendation Systems

Paper
Code

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

1 code implementation • 22 Nov 2022 • Jiaxiang Tang, Kaisiyuan Wang, Hang Zhou, Xiaokang Chen, Dongliang He, Tianshu Hu, Jingtuo Liu, Gang Zeng, Jingdong Wang

While dynamic Neural Radiance Fields (NeRF) have shown success in high-fidelity 3D modeling of talking portraits, the slow training and inference speed severely obstruct their potential usage.

Talking Face Generation

817

Paper
Code

D$^3$ETR: Decoder Distillation for Detection Transformer

no code implementations • 17 Nov 2022 • Xiaokang Chen, Jiahui Chen, Yan Liu, Gang Zeng

Specifically, Adaptive Matching applies bipartite matching to adaptively match the outputs of the teacher and the student in each decoder layer, while Fixed Matching fixes the correspondence between the outputs of the teacher and the student with the same object queries, with the teacher's fixed object queries fed to the decoder of the student as an auxiliary group.

Knowledge Distillation

Paper
Add Code

JVLDLoc: a Joint Optimization of Visual-LiDAR Constraints and Direction Priors for Localization in Driving Scenario

no code implementations • 21 Aug 2022 • Longrui Dong, Gang Zeng

The ability for a moving agent to localize itself in environment is the basic demand for emerging applications, such as autonomous driving, etc.

Autonomous Driving

Paper
Add Code

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment

2 code implementations • ICCV 2023 • Qiang Chen, Xiaokang Chen, Jian Wang, Shan Zhang, Kun Yao, Haocheng Feng, Junyu Han, Errui Ding, Gang Zeng, Jingdong Wang

Detection transformer (DETR) relies on one-to-one assignment, assigning one ground-truth object to one prediction, for end-to-end detection without NMS post-processing.

Data Augmentation Object +2

12,059

Paper
Code

Conditional DETR V2: Efficient Detection Transformer with Box Queries

no code implementations • 18 Jul 2022 • Xiaokang Chen, Fangyun Wei, Gang Zeng, Jingdong Wang

Inspired by Conditional DETR, an improved DETR with fast training convergence, that presented box queries (originally called spatial queries) for internal decoder layers, we reformulate the object query into the format of the box query that is a composition of the embeddings of the reference point and the transformation of the box with respect to the reference point.

Object object-detection +1

Paper
Add Code

Compressible-composable NeRF via Rank-residual Decomposition

2 code implementations • 30 May 2022 • Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng

To circumvent the hurdle, in this paper, we present an explicit neural field representation that enables efficient and convenient manipulation of models.

2,004

Paper
Code

Point Scene Understanding via Disentangled Instance Mesh Reconstruction

1 code implementation • 31 Mar 2022 • Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng

Semantic scene reconstruction from point cloud is an essential and challenging task for 3D scene understanding.

Retrieval Scene Understanding

Paper
Code

MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation

no code implementations • 28 Mar 2022 • Min Zhong, Xinghao Chen, Xiaokang Chen, Gang Zeng, Yunhe Wang

For instance, our approach achieves a 66. 4\% mAP with the 0. 5 IoU threshold on the ScanNetV2 test set, which is 1. 9\% higher than the state-of-the-art method.

Ranked #6 on 3D Instance Segmentation on S3DIS

3D Instance Segmentation Semantic Segmentation

Paper
Add Code

Context Autoencoder for Self-Supervised Representation Learning

6 code implementations • 7 Feb 2022 • Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang

The pretraining tasks include two tasks: masked representation prediction - predict the representations for the masked patches, and masked patch reconstruction - reconstruct the masked patches.

Ranked #14 on Self-Supervised Image Classification on ImageNet (finetuned)

Instance Segmentation object-detection +5

3,082

Paper
Code

Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

no code implementations • 24 Dec 2021 • Xiaokang Chen, Jiaxiang Tang, Jingbo Wang, Gang Zeng

Firstly, we transfer the voxelized scenes to point clouds by removing these visible empty voxels and adopt a deep point stream to capture semantic information from the scene efficiently.

Ranked #4 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion

Paper
Add Code

Conditional DETR for Fast Training Convergence

3 code implementations • ICCV 2021 • Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang

Our approach, named conditional DETR, learns a conditional spatial query from the decoder embedding for decoder multi-head cross-attention.

Object object-detection +1

124,984

Paper
Code

Joint Implicit Image Function for Guided Depth Super-Resolution

1 code implementation • 19 Jul 2021 • Jiaxiang Tang, Xiaokang Chen, Gang Zeng

Inspired by the recent progress in implicit neural representation, we propose to formulate the guided super-resolution as a neural implicit image interpolation problem, where we take the form of a general image interpolation but use a novel Joint Implicit Image Function (JIIF) representation to learn both the interpolation weights and values.

Graph Attention Super-Resolution

Paper
Code

Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision

3 code implementations • CVPR 2021 • Xiaokang Chen, Yuhui Yuan, Gang Zeng, Jingdong Wang

Our approach imposes the consistency on two segmentation networks perturbed with different initialization for the same input image.

Ranked #2 on Semi-Supervised Semantic Segmentation on WoodScape

Segmentation Semi-Supervised Semantic Segmentation

475

Paper
Code

Semantic Point Completion Network for 3D Semantic Scene Completion

no code implementations • ECAI 2020 • Min Zhong, Gang Zeng

In this work, a Semantic Point Completion Network (SPCNet) is proposed to address SSC in the point cloud space.

Ranked #8 on 3D Semantic Scene Completion on NYUv2

3D Semantic Scene Completion Semantic Segmentation

Paper
Add Code

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

2 code implementations • ECCV 2020 • Yajie Xing, Jingbo Wang, Gang Zeng

In this paper, we propose a novel operator called malleable 2. 5D convolution to learn the receptive field along the depth-axis.

Ranked #46 on Semantic Segmentation on NYU Depth v2

Scene Parsing Semantic Segmentation

273

Paper
Code

Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation

2 code implementations • ECCV 2020 • Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng

Depth information has proven to be a useful cue in the semantic segmentation of RGB-D images for providing a geometric counterpart to the RGB representation.

Ranked #3 on Semantic Segmentation on Event-based Segmentation Dataset

Segmentation Semantic Segmentation +2

273

Paper
Code

3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior

2 code implementations • CVPR 2020 • Xiaokang Chen, Kwan-Yee Lin, Chen Qian, Gang Zeng, Hongsheng Li

To this end, we first propose a novel 3D sketch-aware feature embedding to explicitly encode geometric information effectively and efficiently.

Ranked #3 on 3D Semantic Scene Completion from a single RGB image on NYUv2

3D Semantic Scene Completion from a single RGB image Hallucination

Paper
Code

Neural Style Transfer via Meta Networks

no code implementations • CVPR 2018 • Falong Shen, Shuicheng Yan, Gang Zeng

Recent works on style transfer typically need to train image transformation networks for every new style, and the style is encoded in the network parameters by enormous iterations of stochastic gradient descent, which lacks the generalization ability to new style in the inference stage.

Style Transfer

Paper
Add Code

Meta Networks for Neural Style Transfer

1 code implementation • 13 Sep 2017 • Falong Shen, Shuicheng Yan, Gang Zeng

Style Transfer

127

Paper
Code

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

1 code implementation • CVPR 2017 • Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng

The proposed joint model also employs a guidance CRF to further enhance the segmentation performance.

Image Segmentation Scene Parsing +2

Paper
Code

Weighted Residuals for Very Deep Networks

no code implementations • 28 May 2016 • Falong Shen, Gang Zeng

The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently.

Paper
Add Code

Fast Semantic Image Segmentation with High Order Context and Guided Filtering

no code implementations • 13 May 2016 • Falong Shen, Gang Zeng

This paper describes a fast and accurate semantic image segmentation approach that encodes not only the discriminative features from deep neural networks, but also the high-order context compatibility among adjacent objects as well as low level image features.

Image Segmentation Semantic Segmentation +1

Paper
Add Code

Similarity-Aware Patchwork Assembly for Depth Image Super-Resolution

no code implementations • CVPR 2014 • Jing Li, Zhichao Lu, Gang Zeng, Rui Gan, Hongbin Zha

This paper describes a patchwork assembly algorithm for depth image super-resolution.

Image Super-Resolution

Paper
Add Code

Fast Approximate $K$-Means via Cluster Closures

no code implementations • 11 Dec 2013 • Jingdong Wang, Jing Wang, Qifa Ke, Gang Zeng, Shipeng Li

Traditional $k$-means is an iterative algorithm---in each iteration new cluster centers are computed and each data point is re-assigned to its nearest center.

Clustering Image Retrieval +1

Paper
Add Code

Fast Neighborhood Graph Search using Cartesian Concatenation

no code implementations • 11 Dec 2013 • Jingdong Wang, Jing Wang, Gang Zeng, Rui Gan, Shipeng Li, Baining Guo

This structure augments the neighborhood graph with a bridge graph.

Paper
Add Code

Scalable $k$-NN graph construction

no code implementations • 30 Jul 2013 • Jingdong Wang, Jing Wang, Gang Zeng, Zhuowen Tu, Rui Gan, Shipeng Li

The $k$-NN graph has played a central role in increasingly popular data-driven techniques for various learning and vision tasks; yet, finding an efficient and effective way to construct $k$-NN graphs remains a challenge, especially for large-scale high-dimensional data.

graph construction

Paper
Add Code

Supervised Kernel Descriptors for Visual Recognition

no code implementations • CVPR 2013 • Peng Wang, Jingdong Wang, Gang Zeng, Weiwei Xu, Hongbin Zha, Shipeng Li

In visual recognition tasks, the design of low level image feature representation is fundamental.

General Classification Image Classification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.