1 code implementation • 22 Apr 2024 • Yu-Xin Zhang, Jie Gui, Xiaofeng Cong, Xin Gong, Wenbing Tao
Point cloud registration (PCR) involves determining a rigid transformation that aligns one point cloud to another.
1 code implementation • 7 Mar 2024 • Sijia Chen, En Yu, Jinyang Li, Wenbing Tao
In this study, we pioneer an exploration into the distribution patterns of tracking data and identify a pronounced long-tail distribution issue within existing MOT datasets.
no code implementations • 23 Jan 2024 • Wanjuan Su, Chen Zhang, Qingshan Xu, Wenbing Tao
While NISR has shown impressive results on simple scenes, it remains challenging to recover delicate geometry from uncontrolled real-world scenes which is caused by its underconstrained optimization.
no code implementations • 18 Dec 2023 • Jianyao Xu, Qingshan Xu, Xinyao Liao, Wanjuan Su, Chen Zhang, Yew-Soon Ong, Wenbing Tao
In this work, we propose a prior-based residual learning paradigm for fast multi-view neural surface reconstruction.
no code implementations • 5 Dec 2023 • Zhi Chen, Yufan Ren, Tong Zhang, Zheng Dang, Wenbing Tao, Sabine Süsstrunk, Mathieu Salzmann
We propose formulating PCR as a denoising diffusion probabilistic process, mapping noisy transformations to the ground truth.
no code implementations • 30 Nov 2023 • En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao
Then, FIT requires MLLMs to first predict trajectories of related objects and then reason about potential future events based on them.
Ranked #61 on Visual Question Answering on MM-Vet
no code implementations • 12 Oct 2023 • Chen Zhang, Wanjuan Su, Qingshan Xu, Wenbing Tao
Recently, learning multi-view neural surface reconstruction with the supervision of point clouds or depth maps has been a promising way.
no code implementations • 23 May 2023 • En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao
Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics.
no code implementations • 10 Feb 2023 • Senmao Cheng, Fan Yang, Zhi Chen, Nanjun Yuan, Wenbing Tao
To our knowledge, the proposed DSeam is the first deep learning based seam prediction method for image stitching.
1 code implementation • ICCV 2023 • Chen Zhang, Ganzhangqin Yuan, Wenbing Tao
We model the Delaunay triangulation as a dual graph, extract local geometric information from the points, and embed it into the structural representation of Delaunay triangulation in an organic way, benefiting fine-grained details reconstruction.
no code implementations • 3 Dec 2022 • En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao
VLM joints the information in the generated visual prompts and the textual prompts from a pre-defined Trackbook to obtain instance-level pseudo textual description, which is domain invariant to different tracking scenes.
no code implementations • 23 Aug 2022 • Jinrong Yang, En Yu, Zeming Li, Xiaoping Li, Wenbing Tao
Recent advanced works generally employ a series of object attributes, e. g., position, size, velocity, and appearance, to provide the clues for the association in 3D MOT.
1 code implementation • 31 May 2022 • Qiancheng Fu, Qingshan Xu, Yew-Soon Ong, Wenbing Tao
Recently, neural implicit surfaces learning by volume rendering has become popular for multi-view reconstruction.
1 code implementation • 28 Mar 2022 • Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao
In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.
no code implementations • CVPR 2022 • Zhi Chen, Kun Sun, Fan Yang, Wenbing Tao
In this paper, we present a second order spatial compatibility (SC^2) measure based method for efficient and robust point cloud registration (PCR), called SC^2-PCR.
Ranked #1 on Point Cloud Registration on FP-O-H
1 code implementation • 28 Dec 2021 • Zhi Chen, Fan Yang, Wenbing Tao
In this paper, we propose a neural network named DetarNet to decouple the translation $t$ and rotation $R$, so as to overcome the performance degradation due to their mutual interference in point cloud registration.
no code implementations • 13 Oct 2021 • Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui
However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.
no code implementations • 17 Aug 2021 • Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao
However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem).
1 code implementation • 31 Jan 2021 • Zhi Chen, Fan Yang, Wenbing Tao
We then propose a hybrid attention block to extract feature, which integrates the Bayesian attentive context normalization (BACN) and channel-wise attention (CA).
1 code implementation • 25 Jan 2021 • Yiming Luo, Zhenxing Mi, Wenbing Tao
DeepDT learns to predict inside/outside labels of Delaunay tetrahedrons directly from a point cloud and corresponding Delaunay triangulation.
no code implementations • 15 Jul 2020 • Qingshan Xu, Wenbing Tao
We present a pixelwise visibility network to learn the visibility information for different neighboring images before computing the multi-view similarity, and then construct an adaptive weighted cost volume with the visibility information.
no code implementations • 11 Jul 2020 • Zhi Chen, Fan Yang, Wenbing Tao
We then propose a hybrid attention block to extract feature, which integrates the bayesian attentive context normalization (BACN) and channel-wise attention (CA).
1 code implementation • 26 Dec 2019 • Qingshan Xu, Wenbing Tao
In detail, we utilize a probabilistic graphical model to embed planar models into PatchMatch multi-view stereo and contribute a novel multi-view aggregated matching cost.
2 code implementations • 26 Dec 2019 • Qingshan Xu, Wenbing Tao
This can be attributed to the memory-consuming cost volume representation and inappropriate depth inference.
2 code implementations • 20 Dec 2019 • Lin Zhao, Wenbing Tao
In this paper, we propose a novel joint instance and semantic segmentation approach, which is called JSNet, in order to address the instance and semantic segmentation of 3D point clouds simultaneously.
Ranked #2 on Semantic Segmentation on ShapeNet
1 code implementation • 11 Dec 2019 • Li Zhu, Zihao Xie, Liman Liu, Bo Tao, Wenbing Tao
Region Proposal Network (RPN) is the cornerstone of two-stage object detectors, it generates a sparse set of object proposals and alleviates the extrem foregroundbackground class imbalance problem during training.
no code implementations • CVPR 2020 • Zhenxing Mi, Yiming Luo, Wenbing Tao
Existing learning-based surface reconstruction methods from point clouds are still facing challenges in terms of scalability and preservation of details on large-scale point clouds.
no code implementations • 6 Nov 2019 • Zihao Xie, Wenbing Tao, Li Zhu, Lin Zhao
In this paper, based on discrimination-aware channel pruning (DCP) which is state-of-the-art pruning method for classification, we propose a localization-aware auxiliary network to find out the channels with key information for classification and regression so that we can conduct channel pruning directly for object detection, which saves lots of time and computing resources.
no code implementations • 28 Sep 2019 • Zhi Chen, Fan Yang, Wenbing Tao
To establish the link between Fn-score and loss, we propose to guide the loss with the Fn-score directly.
no code implementations • CVPR 2019 • Qingshan Xu, Wenbing Tao
For the depth estimation of low-textured areas, we further propose to combine ACMH with multi-scale geometric consistency guidance (ACMM) to obtain the reliable depth estimates for low-textured areas at coarser scales and guarantee that they can be propagated to finer scales.
Ranked #9 on Point Clouds on Tanks and Temples
no code implementations • 23 May 2018 • Tao Xu, Kun Sun, Wenbing Tao
In this paper, we proposed a GPU accelerated image matching method with improved Cascade Hashing.
no code implementations • 21 May 2018 • Qingshan Xu, Wenbing Tao
In computer vision domain, how to fast and accurately perform multiview stereo (MVS) is still a challenging problem.
no code implementations • 21 Dec 2016 • Kun Sun, Wenbing Tao
Accuracy and efficiency are two key problems in large scale incremental Structure from Motion (SfM).
no code implementations • 14 Nov 2016 • Kai Chen, Wenbing Tao
In this paper, we propose a Convolutional Regression framework for visual tracking (CRT).
no code implementations • 26 Apr 2016 • Kai Chen, Wenbing Tao
As a result, the model need to be initialized and retrained for different objects.
no code implementations • CVPR 2014 • Wenbing Tao, Kun Sun
The probabilistic methods based on Symmetrical Gauss Mixture Model (SGMM) have achieved great success in point sets registration, but are seldom used to find the correspondences between two images due to the complexity of the non-rigid transformation and too many outliers.