Search Results for author: Wenbing Tao

Found 36 papers, 12 papers with code

A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning

1 code implementation22 Apr 2024 Yu-Xin Zhang, Jie Gui, Xiaofeng Cong, Xin Gong, Wenbing Tao

Point cloud registration (PCR) involves determining a rigid transformation that aligns one point cloud to another.

Point Cloud Registration

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking

1 code implementation7 Mar 2024 Sijia Chen, En Yu, Jinyang Li, Wenbing Tao

In this study, we pioneer an exploration into the distribution patterns of tracking data and identify a pronounced long-tail distribution issue within existing MOT datasets.

Data Augmentation Multiple Object Tracking +1

PSDF: Prior-Driven Neural Implicit Surface Learning for Multi-view Reconstruction

no code implementations23 Jan 2024 Wanjuan Su, Chen Zhang, Qingshan Xu, Wenbing Tao

While NISR has shown impressive results on simple scenes, it remains challenging to recover delicate geometry from uncontrolled real-world scenes which is caused by its underconstrained optimization.

Surface Reconstruction

DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration

no code implementations5 Dec 2023 Zhi Chen, Yufan Ren, Tong Zhang, Zheng Dang, Wenbing Tao, Sabine Süsstrunk, Mathieu Salzmann

We propose formulating PCR as a denoising diffusion probabilistic process, mapping noisy transformations to the ground truth.

Denoising Point Cloud Registration

Merlin:Empowering Multimodal LLMs with Foresight Minds

no code implementations30 Nov 2023 En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao

Then, FIT requires MLLMs to first predict trajectories of related objects and then reason about potential future events based on them.

Visual Question Answering

PG-NeuS: Robust and Efficient Point Guidance for Multi-View Neural Surface Reconstruction

no code implementations12 Oct 2023 Chen Zhang, Wanjuan Su, Qingshan Xu, Wenbing Tao

Recently, learning multi-view neural surface reconstruction with the supervision of point clouds or depth maps has been a promising way.

Surface Reconstruction

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

no code implementations23 May 2023 En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics.

Denoising Multi-Object Tracking +1

Deep Seam Prediction for Image Stitching Based on Selection Consistency Loss

no code implementations10 Feb 2023 Senmao Cheng, Fan Yang, Zhi Chen, Nanjun Yuan, Wenbing Tao

To our knowledge, the proposed DSeam is the first deep learning based seam prediction method for image stitching.

Image Stitching

DMNet: Delaunay Meshing Network for 3D Shape Representation

1 code implementation ICCV 2023 Chen Zhang, Ganzhangqin Yuan, Wenbing Tao

We model the Delaunay triangulation as a dual graph, extract local geometric information from the points, and embed it into the structural representation of Delaunay triangulation in an organic way, benefiting fine-grained details reconstruction.

3D Shape Representation

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation

no code implementations3 Dec 2022 En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao

VLM joints the information in the generated visual prompts and the textual prompts from a pre-defined Trackbook to obtain instance-level pseudo textual description, which is domain invariant to different tracking scenes.

Domain Generalization Multi-Object Tracking +1

Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking

no code implementations23 Aug 2022 Jinrong Yang, En Yu, Zeming Li, Xiaoping Li, Wenbing Tao

Recent advanced works generally employ a series of object attributes, e. g., position, size, velocity, and appearance, to provide the clues for the association in 3D MOT.

3D Multi-Object Tracking 3D Object Detection +2

Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction

1 code implementation31 May 2022 Qiancheng Fu, Qingshan Xu, Yew-Soon Ong, Wenbing Tao

Recently, neural implicit surfaces learning by volume rendering has become popular for multi-view reconstruction.

Surface Reconstruction

SC^2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration

1 code implementation28 Mar 2022 Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao

In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.

Point Cloud Registration

SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration

no code implementations CVPR 2022 Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao

In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.

Image to Point Cloud Registration

DetarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration

1 code implementation28 Dec 2021 Zhi Chen, Fan Yang, Wenbing Tao

In this paper, we propose a neural network named DetarNet to decouple the translation $t$ and rotation $R$, so as to overcome the performance degradation due to their mutual interference in point cloud registration.

Point Cloud Registration Translation

Non-local Recurrent Regularization Networks for Multi-view Stereo

no code implementations13 Oct 2021 Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui

However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.

Depth Estimation

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

no code implementations17 Aug 2021 Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao

However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).

Autonomous Driving LIDAR Semantic Segmentation +1

Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences

1 code implementation31 Jan 2021 Zhi Chen, Fan Yang, Wenbing Tao

We then propose a hybrid attention block to extract feature, which integrates the Bayesian attentive context normalization (BACN) and channel-wise attention (CA).

DeepDT: Learning Geometry From Delaunay Triangulation for Surface Reconstruction

1 code implementation25 Jan 2021 Yiming Luo, Zhenxing Mi, Wenbing Tao

DeepDT learns to predict inside/outside labels of Delaunay tetrahedrons directly from a point cloud and corresponding Delaunay triangulation.

Surface Reconstruction

PVSNet: Pixelwise Visibility-Aware Multi-View Stereo Network

no code implementations15 Jul 2020 Qingshan Xu, Wenbing Tao

We present a pixelwise visibility network to learn the visibility information for different neighboring images before computing the multi-view similarity, and then construct an adaptive weighted cost volume with the visibility information.

3D Reconstruction

Cascade Network with Guided Loss and Hybrid Attention for Two-view Geometry

no code implementations11 Jul 2020 Zhi Chen, Fan Yang, Wenbing Tao

We then propose a hybrid attention block to extract feature, which integrates the bayesian attentive context normalization (BACN) and channel-wise attention (CA).

Planar Prior Assisted PatchMatch Multi-View Stereo

1 code implementation26 Dec 2019 Qingshan Xu, Wenbing Tao

In detail, we utilize a probabilistic graphical model to embed planar models into PatchMatch multi-view stereo and contribute a novel multi-view aggregated matching cost.

Depth Estimation

Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume

2 code implementations26 Dec 2019 Qingshan Xu, Wenbing Tao

This can be attributed to the memory-consuming cost volume representation and inappropriate depth inference.

regression Stereo Matching

JSNet: Joint Instance and Semantic Segmentation of 3D Point Clouds

2 code implementations20 Dec 2019 Lin Zhao, Wenbing Tao

In this paper, we propose a novel joint instance and semantic segmentation approach, which is called JSNet, in order to address the instance and semantic segmentation of 3D point clouds simultaneously.

3D Instance Segmentation Clustering +2

IoU-uniform R-CNN: Breaking Through the Limitations of RPN

1 code implementation11 Dec 2019 Li Zhu, Zihao Xie, Liman Liu, Bo Tao, Wenbing Tao

Region Proposal Network (RPN) is the cornerstone of two-stage object detectors, it generates a sparse set of object proposals and alleviates the extrem foregroundbackground class imbalance problem during training.

Object object-detection +2

SSRNet: Scalable 3D Surface Reconstruction Network

no code implementations CVPR 2020 Zhenxing Mi, Yiming Luo, Wenbing Tao

Existing learning-based surface reconstruction methods from point clouds are still facing challenges in terms of scalability and preservation of details on large-scale point clouds.

Surface Reconstruction

Localization-aware Channel Pruning for Object Detection

no code implementations6 Nov 2019 Zihao Xie, Wenbing Tao, Li Zhu, Lin Zhao

In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources.

Classification General Classification +5

GLA-Net: An Attention Network with Guided Loss for Mismatch Removal

no code implementations28 Sep 2019 Zhi Chen, Fan Yang, Wenbing Tao

To establish the link between Fn-score and loss, we propose to guide the loss with the Fn-score directly.

Binary Classification

Multi-Scale Geometric Consistency Guided Multi-View Stereo

no code implementations CVPR 2019 Qingshan Xu, Wenbing Tao

For the depth estimation of low-textured areas, we further propose to combine ACMH with multi-scale geometric consistency guidance (ACMM) to obtain the reliable depth estimates for low-textured areas at coarser scales and guarantee that they can be propagated to finer scales.

Depth Estimation Point Clouds

GPU Accelerated Cascade Hashing Image Matching for Large Scale 3D Reconstruction

no code implementations23 May 2018 Tao Xu, Kun Sun, Wenbing Tao

In this paper, we proposed a GPU accelerated image matching method with improved Cascade Hashing.

3D Reconstruction

Multi-View Stereo with Asymmetric Checkerboard Propagation and Multi-Hypothesis Joint View Selection

no code implementations21 May 2018 Qingshan Xu, Wenbing Tao

In computer vision domain, how to fast and accurately perform multiview stereo (MVS) is still a challenging problem.

Trilaminar Multiway Reconstruction Tree for Efficient Large Scale Structure from Motion

no code implementations21 Dec 2016 Kun Sun, Wenbing Tao

Accuracy and efficiency are two key problems in large scale incremental Structure from Motion (SfM).

Convolutional Regression for Visual Tracking

no code implementations14 Nov 2016 Kai Chen, Wenbing Tao

In this paper, we propose a Convolutional Regression framework for visual tracking (CRT).

regression Visual Object Tracking +1

Asymmetrical Gauss Mixture Models for Point Sets Matching

no code implementations CVPR 2014 Wenbing Tao, Kun Sun

The probabilistic methods based on Symmetrical Gauss Mixture Model (SGMM) have achieved great success in point sets registration, but are seldom used to find the correspondences between two images due to the complexity of the non-rigid transformation and too many outliers.

Cannot find the paper you are looking for? You can Submit a new open access paper.