Search Results for author: Hesheng Wang

Found 44 papers, 22 papers with code

Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications

1 code implementation29 Nov 2023 Junyi Ma, Xieyuanli Chen, Jiawei Huang, Jingyi Xu, Zhen Luo, Jintao Xu, Weihao Gu, Rui Ai, Hesheng Wang

Furthermore, the standardized evaluation protocol for preset multiple tasks is also provided to compare the performance of all the proposed baselines on present and future occupancy estimation with respect to objects of interest in autonomous driving scenarios.

Autonomous Driving

LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis

2 code implementations ICCV 2019 Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Yingtian Liu, Peng Yin, Hesheng Wang, Yun-hui Liu

Point cloud based place recognition is still an open issue due to the difficulty in extracting local features from the raw 3D point cloud and generating the global descriptor, and it's even harder in the large-scale dynamic environments.

3D Place Recognition Point Cloud Retrieval +1

Unsupervised Learning of Depth, Optical Flow and Pose with Occlusion from 3D Geometry

1 code implementation arXiv 2020 Guangming Wang, Chi Zhang, Hesheng Wang, Jingchuan Wang, Yong Wang, Xinlei Wang

In the occluded region, as depth and camera motion can provide more reliable motion estimation, they can be used to instruct unsupervised learning of optical flow.

Autonomous Driving Depth And Camera Motion +3

Unsupervised Learning of Monocular Depth and Ego-Motion Using Multiple Masks

1 code implementation1 Apr 2021 Guangming Wang, Hesheng Wang, Yiling Liu, Weidong Chen

A new unsupervised learning method of depth and ego-motion using multiple masks from monocular video is proposed in this paper.

Depth Estimation Motion Estimation

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

1 code implementation ICCV 2023 Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Marc Pollefeys, Hesheng Wang

Specifically, a projection-aware hierarchical transformer is proposed to capture long-range dependencies and filter outliers by extracting point features globally.

Point Cloud Registration

Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

1 code implementation11 Mar 2024 Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang

In this paper, we propose a novel SSM-based point cloud processing backbone, named Point Mamba, with a causality-aware ordering mechanism.

Semantic Segmentation

Efficient 3D Deep LiDAR Odometry

1 code implementation3 Nov 2021 Guangming Wang, Xinrui Wu, Shuyang Jiang, Zhe Liu, Hesheng Wang

An efficient 3D point cloud learning architecture, named EfficientLO-Net, for LiDAR odometry is first proposed in this paper.

Pose Estimation

PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization

1 code implementation CVPR 2021 Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang

A novel 3D point cloud learning model for deep LiDAR odometry, named PWCLO-Net, using hierarchical embedding mask optimization is proposed in this paper.

Pose Estimation

SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments

1 code implementation9 Nov 2020 Hanjiang Hu, Baoquan Yang, Zhijian Qiao, Shiqi Liu, Jiacheng Zhu, Zuxin Liu, Wenhao Ding, Ding Zhao, Hesheng Wang

Different environments pose a great challenge to the outdoor robust visual perception for long-term autonomous driving, and the generalization of learning-based algorithms on different environments is still an open problem.

Autonomous Driving Depth Estimation +4

What Matters for 3D Scene Flow Network

1 code implementation19 Jul 2022 Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang

Our proposed model surpasses all existing methods by at least 38. 2% on FlyingThings3D dataset and 24. 7% on KITTI Scene Flow dataset for EPE3D metric.

Scene Flow Estimation

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

1 code implementation23 Sep 2019 Hanjiang Hu, Hesheng Wang, Zhe Liu, Chenguang Yang, Weidong Chen, Le Xie

To retrieve a target image from the database, the query image is first encoded using the encoder belonging to the query domain to obtain a domain-invariant feature vector.

Autonomous Driving Retrieval +2

3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling

1 code implementation28 Feb 2024 Chaokang Jiang, Guangming Wang, Jiuming Liu, Hesheng Wang, Zhuang Ma, Zhenqiang Liu, Zhujin Liang, Yi Shan, Dalong Du

We present a novel approach from the perspective of auto-labelling, aiming to generate a large number of 3D scene flow pseudo labels for real-world LiDAR point clouds.

Autonomous Driving Data Augmentation +1

DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization

1 code implementation1 Oct 2020 Hanjiang Hu, Zhijian Qiao, Ming Cheng, Zhe Liu, Hesheng Wang

Long-Term visual localization under changing environments is a challenging problem in autonomous driving and mobile robotics due to season, illumination variance, etc.

Autonomous Driving Domain Adaptation +5

End-to-End 3D Point Cloud Learning for Registration Task Using Virtual Correspondences

1 code implementation30 Nov 2020 Zhijian Qiao, Huanshu Wei, Zhe Liu, Chuanzhe Suo, Hesheng Wang

3D Point cloud registration is still a very challenging topic due to the difficulty in finding the rigid transformation between two point clouds with partial correspondences, and it's even harder in the absence of any initial estimation information.

Point Cloud Registration

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

1 code implementation ICCV 2023 Chang Nie, Guangming Wang, Zhe Liu, Luca Cavalli, Marc Pollefeys, Hesheng Wang

Therefore, RLSAC can avoid differentiating to learn the features and the feedback of downstream tasks for end-to-end robust estimation.

reinforcement-learning

A Registration-aided Domain Adaptation Network for 3D Point Cloud Based Place Recognition

1 code implementation9 Dec 2020 Zhijian Qiao, Hanjiang Hu, Weiang Shi, Siyuan Chen, Zhe Liu, Hesheng Wang

In the field of large-scale SLAM for autonomous driving and mobile robotics, 3D point cloud based place recognition has aroused significant research interest due to its robustness to changing environments with drastic daytime and weather variance.

3D Place Recognition Autonomous Driving +2

DELFlow: Dense Efficient Learning of Scene Flow for Large-Scale Point Clouds

1 code implementation ICCV 2023 Chensheng Peng, Guangming Wang, Xian Wan Lo, Xinrui Wu, Chenfeng Xu, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang

Previous methods rarely predict scene flow from the entire point clouds of the scene with one-time inference due to the memory inefficiency and heavy overhead from distance calculation and sorting involved in commonly used farthest point sampling, KNN, and ball query algorithms for local feature aggregation.

Scene Flow Estimation

Unsupervised Learning of 3D Scene Flow from Monocular Camera

1 code implementation8 Jun 2022 Guangming Wang, Xiaoyu Tian, Ruiqi Ding, Hesheng Wang

Unsupervised learning of scene flow in this paper mainly consists of two parts: (i) depth estimation and camera pose estimation, and (ii) scene flow estimation based on four different loss functions.

Depth Estimation Optical Flow Estimation +2

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

1 code implementation29 Mar 2024 Tianchen Deng, Yanbo Wang, Hongle Xie, Hesheng Wang, Jingchuan Wang, Danwei Wang, Weidong Chen

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

3D Reconstruction Denoising +3

Learning Actions from Human Demonstration Video for Robotic Manipulation

no code implementations10 Sep 2019 Shuo Yang, Wei zhang, Weizhi Lu, Hesheng Wang, Yibin Li

However, the general video captioning methods focus more on the understanding of the full frame, lacking of consideration on the specific object of interests in robotic manipulations.

Video Captioning

Hierarchical Attention Learning of Scene Flow in 3D Point Clouds

no code implementations12 Oct 2020 Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang

In this paper, a novel hierarchical neural network with double attention is proposed for learning the correlation of point features in adjacent frames and refining scene flow from coarse to fine layer by layer.

Autonomous Driving Optical Flow Estimation +1

Motion Projection Consistency Based 3D Human Pose Estimation with Virtual Bones from Monocular Videos

no code implementations28 Jun 2021 Guangming Wang, Honghao Zeng, Ziliang Wang, Zhe Liu, Hesheng Wang

Ablation studies demonstrate the effectiveness of the proposed inter-frame projection consistency constraints and intra-frame loop constraints.

3D Human Pose Estimation

NccFlow: Unsupervised Learning of Optical Flow With Non-occlusion from Geometry

no code implementations8 Jul 2021 Guangming Wang, Shuaiqi Ren, Hesheng Wang

Then, two novel loss functions are proposed for the unsupervised learning of optical flow based on the geometric laws of non-occlusion.

Autonomous Driving Blocking +1

Residual 3D Scene Flow Learning with Context-Aware Feature Extraction

no code implementations10 Sep 2021 Guangming Wang, Yunzhe Hu, Xinrui Wu, Hesheng Wang

To solve the first problem, a novel context-aware set convolution layer is proposed in this paper to exploit contextual structure information of Euclidean space and learn soft aggregation weights for local point features.

Autonomous Driving Scene Flow Estimation +1

Interactive Multi-scale Fusion of 2D and 3D Features for Multi-object Tracking

no code implementations30 Mar 2022 Guangming Wang, Chensheng Peng, Jinpeng Zhang, Hesheng Wang

Specifically, through multi-scale interactive query and fusion between pixel-level and point-level features, our method, can obtain more distinguishing features to improve the performance of multiple object tracking.

Autonomous Driving Multi-Object Tracking +1

Pseudo-LiDAR for Visual Odometry

no code implementations4 Sep 2022 Huiying Deng, Guangming Wang, Zhiheng Feng, Chaokang Jiang, Xinrui Wu, Yanzi Miao, Hesheng Wang

In order to make full use of the rich point cloud information provided by the pseudo-LiDAR, a projection-aware dense odometry pipeline is adopted.

Stereo Matching Visual Odometry

Unsupervised Learning of 3D Scene Flow with 3D Odometry Assistance

no code implementations11 Sep 2022 Guangming Wang, Zhiheng Feng, Chaokang Jiang, Hesheng Wang

Unlike the previous unsupervised learning of scene flow in point clouds, we propose to use odometry information to assist the unsupervised learning of scene flow and use real-world LiDAR data to train our network.

Activity Recognition Autonomous Driving +1

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

no code implementations15 Sep 2022 Chaokang Jiang, Guangming Wang, Jinxing Wu, Yanzi Miao, Hesheng Wang

Promising complementarity exists between the texture features of color images and the geometric information of LiDAR point clouds.

3D Object Detection object-detection

3D Scene Flow Estimation on Pseudo-LiDAR: Bridging the Gap on Estimating Point Motion

no code implementations27 Sep 2022 Chaokang Jiang, Guangming Wang, Yanzi Miao, Hesheng Wang

The proposed method of self-supervised learning of 3D scene flow on real-world images is compared with a variety of methods for learning on the synthesized dataset and learning on LiDAR point clouds.

Optical Flow Estimation Scene Flow Estimation +1

End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization

no code implementations20 Jun 2023 Guangming Wang, Yu Zheng, Yanfeng Guo, Zhe Liu, Yixiang Zhu, Wolfram Burgard, Hesheng Wang

A popular approach to robot localization is based on image-to-point cloud registration, which combines illumination-invariant LiDAR-based mapping with economical image-based localization.

Image-Based Localization Image to Point Cloud Registration

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

no code implementations29 Nov 2023 Yu Zheng, Guangming Wang, Jiuming Liu, Marc Pollefeys, Hesheng Wang

Through the hash-based representation, we propose the Spherical Frustum sparse Convolution (SFC) and Frustum Fast Point Sampling (F2PS) to convolve and sample the points stored in spherical frustums respectively.

Position Segmentation +1

SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM

no code implementations12 Mar 2024 Siting Zhu, Renjie Qin, Guangming Wang, Jiuming Liu, Hesheng Wang

We propose SemGauss-SLAM, the first semantic SLAM system utilizing 3D Gaussian representation, that enables accurate 3D semantic mapping, robust camera tracking, and high-quality rendering in real-time.

Semantic SLAM

DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding

no code implementations18 Mar 2024 Wenhua Wu, Guangming Wang, Ting Deng, Sebastian Aegidius, Stuart Shanks, Valerio Modugno, Dimitrios Kanoulas, Hesheng Wang

Recent research on Simultaneous Localization and Mapping (SLAM) based on implicit representation has shown promising results in indoor environments.

Simultaneous Localization and Mapping

Cannot find the paper you are looking for? You can Submit a new open access paper.