1 code implementation • 1 Sep 2024 • Huixin Zhang, Guangming Wang, Xinrui Wu, Chenfeng Xu, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang
It consists of a pyramid structure with a spatial information reuse strategy, a sequential pose initialization module, a gated hierarchical pose refinement module, and a temporal feature propagation module.
no code implementations • 2 Jul 2024 • Tinghuai Wang, Guangming Wang, Kuan Eeik Tan
Convolutional neural networks (CNN) have made significant advances in detecting roads from satellite images.
no code implementations • 2 May 2024 • Guangming Wang, Lei Pan, Songyou Peng, Shaohui Liu, Chenfeng Xu, Yanzi Miao, Wei Zhan, Masayoshi Tomizuka, Marc Pollefeys, Hesheng Wang
Meticulous 3D environment representations have been a longstanding goal in computer vision and robotics fields.
no code implementations • 18 Mar 2024 • Wenhua Wu, Qi Wang, Guangming Wang, JunPing Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang
To address this, we propose EMIE-MAP, a novel method for large-scale road surface reconstruction based on explicit mesh and implicit encoding.
no code implementations • 18 Mar 2024 • Wenhua Wu, Guangming Wang, Ting Deng, Sebastian Aegidius, Stuart Shanks, Valerio Modugno, Dimitrios Kanoulas, Hesheng Wang
Recent research on Simultaneous Localization and Mapping (SLAM) based on implicit representation has shown promising results in indoor environments.
1 code implementation • 13 Mar 2024 • Hao Shi, Song Wang, Jiaming Zhang, Xiaoting Yin, Zhongdao Wang, Guangming Wang, Jianke Zhu, Kailun Yang, Kaiwei Wang
Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC), presents a significant challenge in computer vision.
1 code implementation • 12 Mar 2024 • Siting Zhu, Renjie Qin, Guangming Wang, Jiuming Liu, Hesheng Wang
We propose SemGauss-SLAM, a dense semantic SLAM system utilizing 3D Gaussian representation, that enables accurate 3D semantic mapping, robust camera tracking, and high-quality rendering simultaneously.
1 code implementation • CVPR 2024 • Chaokang Jiang, Guangming Wang, Jiuming Liu, Hesheng Wang, Zhuang Ma, Zhenqiang Liu, Zhujin Liang, Yi Shan, Dalong Du
We present a novel approach from the perspective of auto-labelling, aiming to generate a large number of 3D scene flow pseudo labels for real-world LiDAR point clouds.
1 code implementation • CVPR 2024 • Jiuming Liu, Guangming Wang, Weicai Ye, Chaokang Jiang, Jinru Han, Zhe Liu, Guofeng Zhang, Dalong Du, Hesheng Wang
Furthermore we also develop an uncertainty estimation module within diffusion to evaluate the reliability of estimated scene flow.
1 code implementation • 29 Nov 2023 • Yu Zheng, Guangming Wang, Jiuming Liu, Marc Pollefeys, Hesheng Wang
Through the hash-based representation, we propose the Spherical Frustum sparse Convolution (SFC) and Frustum Fast Point Sampling (F2PS) to convolve and sample the points stored in spherical frustums respectively.
1 code implementation • 29 Nov 2023 • Jiuming Liu, Guangming Wang, Weicai Ye, Chaokang Jiang, Jinru Han, Zhe Liu, Guofeng Zhang, Dalong Du, Hesheng Wang
Furthermore, we also develop an uncertainty estimation module within diffusion to evaluate the reliability of estimated scene flow.
no code implementations • 10 Sep 2023 • Liang Song, Guangming Wang, Jiuming Liu, Zhenyang Fu, Yanzi Miao, Hesheng
By combining these modules, our approach successfully tackles the challenges of outdoor scene generalization, producing high-quality rendering results.
1 code implementation • ICCV 2023 • Chang Nie, Guangming Wang, Zhe Liu, Luca Cavalli, Marc Pollefeys, Hesheng Wang
Therefore, RLSAC can avoid differentiating to learn the features and the feedback of downstream tasks for end-to-end robust estimation.
1 code implementation • ICCV 2023 • Chensheng Peng, Guangming Wang, Xian Wan Lo, Xinrui Wu, Chenfeng Xu, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang
Previous methods rarely predict scene flow from the entire point clouds of the scene with one-time inference due to the memory inefficiency and heavy overhead from distance calculation and sorting involved in commonly used farthest point sampling, KNN, and ball query algorithms for local feature aggregation.
no code implementations • 20 Jun 2023 • Guangming Wang, Yu Zheng, Yanfeng Guo, Zhe Liu, Yixiang Zhu, Wolfram Burgard, Hesheng Wang
A popular approach to robot localization is based on image-to-point cloud registration, which combines illumination-invariant LiDAR-based mapping with economical image-based localization.
1 code implementation • ICCV 2023 • Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Marc Pollefeys, Hesheng Wang
Specifically, a projection-aware hierarchical transformer is proposed to capture long-range dependencies and filter outliers by extracting point features globally.
no code implementations • 27 Sep 2022 • Chaokang Jiang, Guangming Wang, Yanzi Miao, Hesheng Wang
The proposed method of self-supervised learning of 3D scene flow on real-world images is compared with a variety of methods for learning on the synthesized dataset and learning on LiDAR point clouds.
no code implementations • 15 Sep 2022 • Chaokang Jiang, Guangming Wang, Jinxing Wu, Yanzi Miao, Hesheng Wang
Promising complementarity exists between the texture features of color images and the geometric information of LiDAR point clouds.
no code implementations • 11 Sep 2022 • Guangming Wang, Zhiheng Feng, Chaokang Jiang, Hesheng Wang
Unlike the previous unsupervised learning of scene flow in point clouds, we propose to use odometry information to assist the unsupervised learning of scene flow and use real-world LiDAR data to train our network.
no code implementations • 4 Sep 2022 • Huiying Deng, Guangming Wang, Zhiheng Feng, Chaokang Jiang, Xinrui Wu, Yanzi Miao, Hesheng Wang
In order to make full use of the rich point cloud information provided by the pseudo-LiDAR, a projection-aware dense odometry pipeline is adopted.
1 code implementation • 19 Jul 2022 • Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang
Our proposed model surpasses all existing methods by at least 38. 2% on FlyingThings3D dataset and 24. 7% on KITTI Scene Flow dataset for EPE3D metric.
1 code implementation • 8 Jun 2022 • Guangming Wang, Xiaoyu Tian, Ruiqi Ding, Hesheng Wang
Unsupervised learning of scene flow in this paper mainly consists of two parts: (i) depth estimation and camera pose estimation, and (ii) scene flow estimation based on four different loss functions.
no code implementations • 30 Mar 2022 • Guangming Wang, Chensheng Peng, Jinpeng Zhang, Hesheng Wang
Specifically, through multi-scale interactive query and fusion between pixel-level and point-level features, our method, can obtain more distinguishing features to improve the performance of multiple object tracking.
no code implementations • 4 Mar 2022 • Yueling Shen, Guangming Wang, Hesheng Wang
we proposed a 3D MOT framework based on simultaneous optimization of object detection and scene flow estimation.
no code implementations • 6 Dec 2021 • Guangming Wang, Jiquan Zhong, Shijie Zhao, Wenhua Wu, Zhe Liu, Hesheng Wang
In this framework, the depth and pose estimations are hierarchically and mutually coupled to refine the estimated pose layer by layer.
1 code implementation • 3 Nov 2021 • Guangming Wang, Xinrui Wu, Shuyang Jiang, Zhe Liu, Hesheng Wang
An efficient 3D point cloud learning architecture, named EfficientLO-Net, for LiDAR odometry is first proposed in this paper.
no code implementations • 10 Sep 2021 • Guangming Wang, Yunzhe Hu, Xinrui Wu, Hesheng Wang
To solve the first problem, a novel context-aware set convolution layer is proposed in this paper to exploit contextual structure information of Euclidean space and learn soft aggregation weights for local point features.
no code implementations • 8 Jul 2021 • Guangming Wang, Shuaiqi Ren, Hesheng Wang
Then, two novel loss functions are proposed for the unsupervised learning of optical flow based on the geometric laws of non-occlusion.
no code implementations • 28 Jun 2021 • Guangming Wang, Honghao Zeng, Ziliang Wang, Zhe Liu, Hesheng Wang
Ablation studies demonstrate the effectiveness of the proposed inter-frame projection consistency constraints and intra-frame loop constraints.
Ranked #51 on 3D Human Pose Estimation on Human3.6M
1 code implementation • 1 Apr 2021 • Guangming Wang, Hesheng Wang, Yiling Liu, Weidong Chen
A new unsupervised learning method of depth and ego-motion using multiple masks from monocular video is proposed in this paper.
no code implementations • 20 Dec 2020 • Guangming Wang, Muyao Chen, Hanwen Liu, Yehui Yang, Zhe Liu, Hesheng Wang
Then, anchor-based 3D convolution is adopted to aggregate these anchors' features to the core points.
1 code implementation • CVPR 2021 • Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang
A novel 3D point cloud learning model for deep LiDAR odometry, named PWCLO-Net, using hierarchical embedding mask optimization is proposed in this paper.
no code implementations • 27 Nov 2020 • Guangming Wang, Yehui Yang, Huixin Zhang, Zhe Liu, Hesheng Wang
In this paper, a spherical interpolated convolution operator is proposed to replace the traditional grid-shaped 3D convolution operator.
no code implementations • 24 Nov 2020 • Guangming Wang, Minjian Xin, Wenhua Wu, Zhe Liu, Hesheng Wang
Deep Reinforcement Learning (DRL) enables robots to perform some intelligent tasks end-to-end.
no code implementations • 12 Oct 2020 • Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang
In this paper, a novel hierarchical neural network with double attention is proposed for learning the correlation of point features in adjacent frames and refining scene flow from coarse to fine layer by layer.
1 code implementation • arXiv 2020 • Guangming Wang, Chi Zhang, Hesheng Wang, Jingchuan Wang, Yong Wang, Xinlei Wang
In the occluded region, as depth and camera motion can provide more reliable motion estimation, they can be used to instruct unsupervised learning of optical flow.
no code implementations • 20 Jan 2020 • Tinghuai Wang, Guangming Wang, Kuan Eeik Tan, Donghui Tan
Specifically, we design an architecture to encode the multiple spectral contextual information in the form of spectral pyramid of multiple embedding spaces.