Search Results for author: Qingshan Xu

Found 22 papers, 7 papers with code

CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction

no code implementations25 Nov 2024 Yuan Zhou, Qingshan Xu, Jiequan Cui, Junbao Zhou, Jing Zhang, Richang Hong, Hanwang Zhang

In this paper, we propose a new de\textbf{C}oupled du\textbf{A}l-interactive linea\textbf{R} att\textbf{E}ntion (CARE) mechanism, revealing that features' decoupling and interaction can fully unleash the power of linear attention.

Inductive Bias

Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud Registration

1 code implementation4 Nov 2024 Kezheng Xiong, Haoen Xiang, Qingshan Xu, Chenglu Wen, Siqi Shen, Jonathan Li, Cheng Wang

Point cloud registration, a fundamental task in 3D vision, has achieved remarkable success with learning-based methods in outdoor environments.

Contrastive Learning Point Cloud Registration +1

Few-shot NeRF by Adaptive Rendering Loss Regularization

no code implementations23 Oct 2024 Qingshan Xu, Xuanyu Yi, Jianyao Xu, Wenbing Tao, Yew-Soon Ong, Hanwang Zhang

In this work, we reveal that there exists an inconsistency between the frequency regularization of PE and rendering loss.

Novel View Synthesis

MVGamba: Unify 3D Content Generation as State Space Sequence Modeling

1 code implementation10 Jun 2024 Xuanyu Yi, Zike Wu, Qiuhong Shen, Qingshan Xu, Pan Zhou, Joo-Hwee Lim, Shuicheng Yan, Xinchao Wang, Hanwang Zhang

Recent 3D large reconstruction models (LRMs) can generate high-quality 3D content in sub-seconds by integrating multi-view diffusion models with scalable multi-view reconstructors.

3D Generation Attribute

PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction

no code implementations22 Apr 2024 Hao Wang, Qingshan Xu, Hongyuan Chen, Rui Ma

In this work, we introduce PGAHum, a prior-guided geometry and appearance learning framework for high-fidelity animatable human reconstruction.

Neural Rendering Representation Learning

Diffusion Time-step Curriculum for One Image to 3D Generation

1 code implementation CVPR 2024 Xuanyu Yi, Zike Wu, Qingshan Xu, Pan Zhou, Joo-Hwee Lim, Hanwang Zhang

Score distillation sampling~(SDS) has been widely adopted to overcome the absence of unseen views in reconstructing 3D objects from a \textbf{single} image.

3D Generation Image to 3D +1

Precise-Physics Driven Text-to-3D Generation

no code implementations19 Mar 2024 Qingshan Xu, Jiao Liu, Melvin Wong, Caishun Chen, Yew-Soon Ong

However, existing generative methods mostly focus on geometric or visual plausibility while ignoring precise physics perception for the generated 3D shapes.

3D Generation Text to 3D

TaylorGrid: Towards Fast and High-Quality Implicit Field Learning via Direct Taylor-based Grid Optimization

no code implementations22 Feb 2024 Renyi Mao, Qingshan Xu, Peng Zheng, Ye Wang, Tieru Wu, Rui Ma

In this paper, we aim for both fast and high-quality implicit field learning, and propose TaylorGrid, a novel implicit field representation which can be efficiently computed via direct Taylor expansion optimization on 2D or 3D grids.

3D geometry Novel View Synthesis

PSDF: Prior-Driven Neural Implicit Surface Learning for Multi-view Reconstruction

no code implementations23 Jan 2024 Wanjuan Su, Chen Zhang, Qingshan Xu, Wenbing Tao

While NISR has shown impressive results on simple scenes, it remains challenging to recover delicate geometry from uncontrolled real-world scenes which is caused by its underconstrained optimization.

Surface Reconstruction

Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor Scenes

1 code implementation CVPR 2024 Xiaotian Sun, Qingshan Xu, Xinjie Yang, Yu Zang, Cheng Wang

In this work we present P\textsuperscript 2 NeRF to capture global and hierarchical geometry consistency priors from pretrained models thus facilitating few-shot NeRFs in 360^\circ outward-facing indoor scenes.

Depth Prediction Monocular Depth Estimation

SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration

no code implementations14 Dec 2023 Kezheng Xiong, Maoji Zheng, Qingshan Xu, Chenglu Wen, Siqi Shen, Cheng Wang

To the best of our knowledge, our approach is the first to facilitate point cloud registration with skeletal geometric priors.

Benchmarking Point Cloud Registration

PG-NeuS: Robust and Efficient Point Guidance for Multi-View Neural Surface Reconstruction

no code implementations12 Oct 2023 Chen Zhang, Wanjuan Su, Qingshan Xu, Wenbing Tao

Recently, learning multi-view neural surface reconstruction with the supervision of point clouds or depth maps has been a promising way.

Surface Reconstruction

Hierarchical Prior Mining for Non-local Multi-View Stereo

no code implementations ICCV 2023 Chunlin Ren, Qingshan Xu, Shikun Zhang, Jiaqi Yang

3) A Hierarchical Prior Mining (HPM) framework, which is used to mine extensive non-local prior information at different scales to assist 3D model recovery, this strategy can achieve a considerable balance between the reconstruction of details and low-textured areas.

3D geometry

Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction

1 code implementation31 May 2022 Qiancheng Fu, Qingshan Xu, Yew-Soon Ong, Wenbing Tao

Recently, neural implicit surfaces learning by volume rendering has become popular for multi-view reconstruction.

Surface Reconstruction

Non-local Recurrent Regularization Networks for Multi-view Stereo

no code implementations13 Oct 2021 Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui

However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.

Depth Estimation

PVSNet: Pixelwise Visibility-Aware Multi-View Stereo Network

no code implementations15 Jul 2020 Qingshan Xu, Wenbing Tao

We present a pixelwise visibility network to learn the visibility information for different neighboring images before computing the multi-view similarity, and then construct an adaptive weighted cost volume with the visibility information.

3D Reconstruction

Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume

2 code implementations26 Dec 2019 Qingshan Xu, Wenbing Tao

This can be attributed to the memory-consuming cost volume representation and inappropriate depth inference.

regression Stereo Matching

Planar Prior Assisted PatchMatch Multi-View Stereo

1 code implementation26 Dec 2019 Qingshan Xu, Wenbing Tao

In detail, we utilize a probabilistic graphical model to embed planar models into PatchMatch multi-view stereo and contribute a novel multi-view aggregated matching cost.

Depth Estimation Multi-View 3D Reconstruction

Multi-Scale Geometric Consistency Guided Multi-View Stereo

no code implementations CVPR 2019 Qingshan Xu, Wenbing Tao

For the depth estimation of low-textured areas, we further propose to combine ACMH with multi-scale geometric consistency guidance (ACMM) to obtain the reliable depth estimates for low-textured areas at coarser scales and guarantee that they can be propagated to finer scales.

Depth Estimation Multi-View 3D Reconstruction +1

Multi-View Stereo with Asymmetric Checkerboard Propagation and Multi-Hypothesis Joint View Selection

no code implementations21 May 2018 Qingshan Xu, Wenbing Tao

In computer vision domain, how to fast and accurately perform multiview stereo (MVS) is still a challenging problem.

Cannot find the paper you are looking for? You can Submit a new open access paper.