Search Results for author: Wenbing Tao

Found 36 papers, 12 papers with code

A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning

1 code implementation • 22 Apr 2024 • Yu-Xin Zhang, Jie Gui, Xiaofeng Cong, Xin Gong, Wenbing Tao

Point cloud registration (PCR) involves determining a rigid transformation that aligns one point cloud to another.

Paper
Code

Delving into the Trajectory Long-tail Distribution for Muti-object Tracking

1 code implementation • 7 Mar 2024 • Sijia Chen, En Yu, Jinyang Li, Wenbing Tao

In this study, we pioneer an exploration into the distribution patterns of tracking data and identify a pronounced long-tail distribution issue within existing MOT datasets.

Data Augmentation Multiple Object Tracking +1

Paper
Code

PSDF: Prior-Driven Neural Implicit Surface Learning for Multi-view Reconstruction

no code implementations • 23 Jan 2024 • Wanjuan Su, Chen Zhang, Qingshan Xu, Wenbing Tao

While NISR has shown impressive results on simple scenes, it remains challenging to recover delicate geometry from uncontrolled real-world scenes which is caused by its underconstrained optimization.

Surface Reconstruction

Paper
Add Code

PR-NeuS: A Prior-based Residual Learning Paradigm for Fast Multi-view Neural Surface Reconstruction

no code implementations • 18 Dec 2023 • Jianyao Xu, Qingshan Xu, Xinyao Liao, Wanjuan Su, Chen Zhang, Yew-Soon Ong, Wenbing Tao

In this work, we propose a prior-based residual learning paradigm for fast multi-view neural surface reconstruction.

Surface Reconstruction

Paper
Add Code

DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration

no code implementations • 5 Dec 2023 • Zhi Chen, Yufan Ren, Tong Zhang, Zheng Dang, Wenbing Tao, Sabine Süsstrunk, Mathieu Salzmann

We propose formulating PCR as a denoising diffusion probabilistic process, mapping noisy transformations to the ground truth.

Denoising Point Cloud Registration

Paper
Add Code

Merlin:Empowering Multimodal LLMs with Foresight Minds

no code implementations • 30 Nov 2023 • En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao

Then, FIT requires MLLMs to first predict trajectories of related objects and then reason about potential future events based on them.

Ranked #61 on Visual Question Answering on MM-Vet

Visual Question Answering

Paper
Add Code

PG-NeuS: Robust and Efficient Point Guidance for Multi-View Neural Surface Reconstruction

no code implementations • 12 Oct 2023 • Chen Zhang, Wanjuan Su, Qingshan Xu, Wenbing Tao

Recently, learning multi-view neural surface reconstruction with the supervision of point clouds or depth maps has been a promising way.

Surface Reconstruction

Paper
Add Code

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

no code implementations • 23 May 2023 • En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics.

Denoising Multi-Object Tracking +1

Paper
Add Code

Deep Seam Prediction for Image Stitching Based on Selection Consistency Loss

no code implementations • 10 Feb 2023 • Senmao Cheng, Fan Yang, Zhi Chen, Nanjun Yuan, Wenbing Tao

To our knowledge, the proposed DSeam is the first deep learning based seam prediction method for image stitching.

Image Stitching

Paper
Add Code

DMNet: Delaunay Meshing Network for 3D Shape Representation

1 code implementation • ICCV 2023 • Chen Zhang, Ganzhangqin Yuan, Wenbing Tao

We model the Delaunay triangulation as a dual graph, extract local geometric information from the points, and embed it into the structural representation of Delaunay triangulation in an organic way, benefiting fine-grained details reconstruction.

3D Shape Representation

Paper
Code

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation

no code implementations • 3 Dec 2022 • En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao

VLM joints the information in the generated visual prompts and the textual prompts from a pre-defined Trackbook to obtain instance-level pseudo textual description, which is domain invariant to different tracking scenes.

Domain Generalization Multi-Object Tracking +1

Paper
Add Code

Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking

no code implementations • 23 Aug 2022 • Jinrong Yang, En Yu, Zeming Li, Xiaoping Li, Wenbing Tao

Recent advanced works generally employ a series of object attributes, e. g., position, size, velocity, and appearance, to provide the clues for the association in 3D MOT.

3D Multi-Object Tracking 3D Object Detection +2

Paper
Add Code

Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction

1 code implementation • 31 May 2022 • Qiancheng Fu, Qingshan Xu, Yew-Soon Ong, Wenbing Tao

Recently, neural implicit surfaces learning by volume rendering has become popular for multi-view reconstruction.

Surface Reconstruction

287

Paper
Code

SC^2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration

1 code implementation • 28 Mar 2022 • Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao

In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.

Point Cloud Registration

134

Paper
Code

SC2-PCR: A Second Order Spatial Compatibility for Efficient and Robust Point Cloud Registration

no code implementations • CVPR 2022 • Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao

In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.

Ranked #1 on Point Cloud Registration on FP-O-H

Image to Point Cloud Registration

Paper
Add Code

DetarNet: Decoupling Translation and Rotation by Siamese Network for Point Cloud Registration

1 code implementation • 28 Dec 2021 • Zhi Chen, Fan Yang, Wenbing Tao

In this paper, we propose a neural network named DetarNet to decouple the translation $t$ and rotation $R$, so as to overcome the performance degradation due to their mutual interference in point cloud registration.

Point Cloud Registration Translation

Paper
Code

Non-local Recurrent Regularization Networks for Multi-view Stereo

no code implementations • 13 Oct 2021 • Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui

However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.

Depth Estimation

Paper
Add Code

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

no code implementations • 17 Aug 2021 • Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao

However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).

Autonomous Driving LIDAR Semantic Segmentation +1

Paper
Add Code

Cascade Network with Guided Loss and Hybrid Attention for Finding Good Correspondences

1 code implementation • 31 Jan 2021 • Zhi Chen, Fan Yang, Wenbing Tao

We then propose a hybrid attention block to extract feature, which integrates the Bayesian attentive context normalization (BACN) and channel-wise attention (CA).

Paper
Code

DeepDT: Learning Geometry From Delaunay Triangulation for Surface Reconstruction

1 code implementation • 25 Jan 2021 • Yiming Luo, Zhenxing Mi, Wenbing Tao

DeepDT learns to predict inside/outside labels of Delaunay tetrahedrons directly from a point cloud and corresponding Delaunay triangulation.

Surface Reconstruction

Paper
Code

PVSNet: Pixelwise Visibility-Aware Multi-View Stereo Network

no code implementations • 15 Jul 2020 • Qingshan Xu, Wenbing Tao

We present a pixelwise visibility network to learn the visibility information for different neighboring images before computing the multi-view similarity, and then construct an adaptive weighted cost volume with the visibility information.

3D Reconstruction

Paper
Add Code

Cascade Network with Guided Loss and Hybrid Attention for Two-view Geometry

no code implementations • 11 Jul 2020 • Zhi Chen, Fan Yang, Wenbing Tao

We then propose a hybrid attention block to extract feature, which integrates the bayesian attentive context normalization (BACN) and channel-wise attention (CA).

Paper
Add Code

Planar Prior Assisted PatchMatch Multi-View Stereo

1 code implementation • 26 Dec 2019 • Qingshan Xu, Wenbing Tao

In detail, we utilize a probabilistic graphical model to embed planar models into PatchMatch multi-view stereo and contribute a novel multi-view aggregated matching cost.

Depth Estimation

178

Paper
Code

Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume

2 code implementations • 26 Dec 2019 • Qingshan Xu, Wenbing Tao

This can be attributed to the memory-consuming cost volume representation and inappropriate depth inference.

regression Stereo Matching

263

Paper
Code

JSNet: Joint Instance and Semantic Segmentation of 3D Point Clouds

2 code implementations • 20 Dec 2019 • Lin Zhao, Wenbing Tao

In this paper, we propose a novel joint instance and semantic segmentation approach, which is called JSNet, in order to address the instance and semantic segmentation of 3D point clouds simultaneously.

Ranked #2 on Semantic Segmentation on ShapeNet

3D Instance Segmentation Clustering +2

100

Paper
Code

IoU-uniform R-CNN: Breaking Through the Limitations of RPN

1 code implementation • 11 Dec 2019 • Li Zhu, Zihao Xie, Liman Liu, Bo Tao, Wenbing Tao

Region Proposal Network (RPN) is the cornerstone of two-stage object detectors, it generates a sparse set of object proposals and alleviates the extrem foregroundbackground class imbalance problem during training.

Object object-detection +2

Paper
Code

SSRNet: Scalable 3D Surface Reconstruction Network

no code implementations • CVPR 2020 • Zhenxing Mi, Yiming Luo, Wenbing Tao

Existing learning-based surface reconstruction methods from point clouds are still facing challenges in terms of scalability and preservation of details on large-scale point clouds.

Surface Reconstruction

Paper
Add Code

Localization-aware Channel Pruning for Object Detection

no code implementations • 6 Nov 2019 • Zihao Xie, Wenbing Tao, Li Zhu, Lin Zhao

In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources.

Classification General Classification +5

Paper
Add Code

GLA-Net: An Attention Network with Guided Loss for Mismatch Removal

no code implementations • 28 Sep 2019 • Zhi Chen, Fan Yang, Wenbing Tao

To establish the link between Fn-score and loss, we propose to guide the loss with the Fn-score directly.

Binary Classification

Paper
Add Code

Multi-Scale Geometric Consistency Guided Multi-View Stereo

no code implementations • CVPR 2019 • Qingshan Xu, Wenbing Tao

For the depth estimation of low-textured areas, we further propose to combine ACMH with multi-scale geometric consistency guidance (ACMM) to obtain the reliable depth estimates for low-textured areas at coarser scales and guarantee that they can be propagated to finer scales.

Ranked #9 on Point Clouds on Tanks and Temples

Depth Estimation Point Clouds

Paper
Add Code

GPU Accelerated Cascade Hashing Image Matching for Large Scale 3D Reconstruction

no code implementations • 23 May 2018 • Tao Xu, Kun Sun, Wenbing Tao

In this paper, we proposed a GPU accelerated image matching method with improved Cascade Hashing.

3D Reconstruction

Paper
Add Code

Multi-View Stereo with Asymmetric Checkerboard Propagation and Multi-Hypothesis Joint View Selection

no code implementations • 21 May 2018 • Qingshan Xu, Wenbing Tao

In computer vision domain, how to fast and accurately perform multiview stereo (MVS) is still a challenging problem.

Paper
Add Code

Trilaminar Multiway Reconstruction Tree for Efficient Large Scale Structure from Motion

no code implementations • 21 Dec 2016 • Kun Sun, Wenbing Tao

Accuracy and efficiency are two key problems in large scale incremental Structure from Motion (SfM).

Paper
Add Code

Convolutional Regression for Visual Tracking

no code implementations • 14 Nov 2016 • Kai Chen, Wenbing Tao

In this paper, we propose a Convolutional Regression framework for visual tracking (CRT).

regression Visual Object Tracking +1

Paper
Add Code

Once for All: a Two-flow Convolutional Neural Network for Visual Tracking

no code implementations • 26 Apr 2016 • Kai Chen, Wenbing Tao

As a result, the model need to be initialized and retrained for different objects.

Object Visual Object Tracking +1

Paper
Add Code

Asymmetrical Gauss Mixture Models for Point Sets Matching

no code implementations • CVPR 2014 • Wenbing Tao, Kun Sun

The probabilistic methods based on Symmetrical Gauss Mixture Model (SGMM) have achieved great success in point sets registration, but are seldom used to find the correspondences between two images due to the complexity of the non-rigid transformation and too many outliers.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.