Search Results for author: Hesheng Wang

Found 44 papers, 22 papers with code

The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation

1 code implementation • 27 Jul 2023 • Lingdong Kong, Yaru Niu, Shaoyuan Xie, Hanjiang Hu, Lai Xing Ng, Benoit R. Cottereau, Ding Zhao, Liangjun Zhang, Hesheng Wang, Wei Tsang Ooi, Ruijie Zhu, Ziyang Song, Li Liu, Tianzhu Zhang, Jun Yu, Mohan Jing, Pengwei Li, Xiaohua Qi, Cheng Jin, Yingfeng Chen, Jie Hou, Jie Zhang, Zhen Kan, Qiang Ling, Liang Peng, Minglei Li, Di Xu, Changpeng Yang, Yuanqi Yao, Gang Wu, Jian Kuai, Xianming Liu, Junjun Jiang, Jiamian Huang, Baojun Li, Jiale Chen, Shuang Zhang, Sun Ao, Zhenyu Li, Runze Chen, Haiyong Luo, Fang Zhao, Jingze Yu

In this paper, we summarize the winning solutions from the RoboDepth Challenge -- an academic competition designed to facilitate and advance robust OoD depth estimation.

Depth Estimation Image Restoration +1

245

Paper
Code

Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications

1 code implementation • 29 Nov 2023 • Junyi Ma, Xieyuanli Chen, Jiawei Huang, Jingyi Xu, Zhen Luo, Jintao Xu, Weihao Gu, Rui Ai, Hesheng Wang

Furthermore, the standardized evaluation protocol for preset multiple tasks is also provided to compare the performance of all the proposed baselines on present and future occupancy estimation with respect to objects of interest in autonomous driving scenarios.

Autonomous Driving

135

Paper
Code

LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis

2 code implementations • ICCV 2019 • Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Yingtian Liu, Peng Yin, Hesheng Wang, Yun-hui Liu

Point cloud based place recognition is still an open issue due to the difficulty in extracting local features from the raw 3D point cloud and generating the global descriptor, and it's even harder in the large-scale dynamic environments.

Ranked #5 on 3D Place Recognition on CS-Campus3D

3D Place Recognition Point Cloud Retrieval +1

Paper
Code

Unsupervised Learning of Depth, Optical Flow and Pose with Occlusion from 3D Geometry

1 code implementation • arXiv 2020 • Guangming Wang, Chi Zhang, Hesheng Wang, Jingchuan Wang, Yong Wang, Xinlei Wang

In the occluded region, as depth and camera motion can provide more reliable motion estimation, they can be used to instruct unsupervised learning of optical flow.

Autonomous Driving Depth And Camera Motion +3

Paper
Code

Unsupervised Learning of Monocular Depth and Ego-Motion Using Multiple Masks

1 code implementation • 1 Apr 2021 • Guangming Wang, Hesheng Wang, Yiling Liu, Weidong Chen

A new unsupervised learning method of depth and ego-motion using multiple masks from monocular video is proposed in this paper.

Depth Estimation Motion Estimation

Paper
Code

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

1 code implementation • ICCV 2023 • Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Marc Pollefeys, Hesheng Wang

Specifically, a projection-aware hierarchical transformer is proposed to capture long-range dependencies and filter outliers by extracting point features globally.

Point Cloud Registration

Paper
Code

Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy

1 code implementation • 11 Mar 2024 • Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang

In this paper, we propose a novel SSM-based point cloud processing backbone, named Point Mamba, with a causality-aware ordering mechanism.

Semantic Segmentation

Paper
Code

Efficient 3D Deep LiDAR Odometry

1 code implementation • 3 Nov 2021 • Guangming Wang, Xinrui Wu, Shuyang Jiang, Zhe Liu, Hesheng Wang

An efficient 3D point cloud learning architecture, named EfficientLO-Net, for LiDAR odometry is first proposed in this paper.

Pose Estimation

Paper
Code

PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization

1 code implementation • CVPR 2021 • Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang

A novel 3D point cloud learning model for deep LiDAR odometry, named PWCLO-Net, using hierarchical embedding mask optimization is proposed in this paper.

Pose Estimation

Paper
Code

SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments

1 code implementation • 9 Nov 2020 • Hanjiang Hu, Baoquan Yang, Zhijian Qiao, Shiqi Liu, Jiacheng Zhu, Zuxin Liu, Wenhao Ding, Ding Zhao, Hesheng Wang

Different environments pose a great challenge to the outdoor robust visual perception for long-term autonomous driving, and the generalization of learning-based algorithms on different environments is still an open problem.

Autonomous Driving Depth Estimation +4

Paper
Code

What Matters for 3D Scene Flow Network

1 code implementation • 19 Jul 2022 • Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang

Our proposed model surpasses all existing methods by at least 38. 2% on FlyingThings3D dataset and 24. 7% on KITTI Scene Flow dataset for EPE3D metric.

Scene Flow Estimation

Paper
Code

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

1 code implementation • 23 Sep 2019 • Hanjiang Hu, Hesheng Wang, Zhe Liu, Chenguang Yang, Weidong Chen, Le Xie

To retrieve a target image from the database, the query image is first encoded using the encoder belonging to the query domain to obtain a domain-invariant feature vector.

Autonomous Driving Retrieval +2

Paper
Code

Domain-invariant Similarity Activation Map Contrastive Learning for Retrieval-based Long-term Visual Localization

1 code implementation • 16 Sep 2020 • Hanjiang Hu, Hesheng Wang, Zhe Liu, Weidong Chen

Visual localization is a crucial component in the application of mobile robot and autonomous driving.

Autonomous Driving Contrastive Learning +5

Paper
Code

3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling

1 code implementation • 28 Feb 2024 • Chaokang Jiang, Guangming Wang, Jiuming Liu, Hesheng Wang, Zhuang Ma, Zhenqiang Liu, Zhujin Liang, Yi Shan, Dalong Du

We present a novel approach from the perspective of auto-labelling, aiming to generate a large number of 3D scene flow pseudo labels for real-world LiDAR point clouds.

Autonomous Driving Data Augmentation +1

Paper
Code

DASGIL: Domain Adaptation for Semantic and Geometric-aware Image-based Localization

1 code implementation • 1 Oct 2020 • Hanjiang Hu, Zhijian Qiao, Ming Cheng, Zhe Liu, Hesheng Wang

Long-Term visual localization under changing environments is a challenging problem in autonomous driving and mobile robotics due to season, illumination variance, etc.

Autonomous Driving Domain Adaptation +5

Paper
Code

DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement

1 code implementation • 29 Nov 2023 • Jiuming Liu, Guangming Wang, Weicai Ye, Chaokang Jiang, Jinru Han, Zhe Liu, Guofeng Zhang, Dalong Du, Hesheng Wang

Furthermore, we also develop an uncertainty estimation module within diffusion to evaluate the reliability of estimated scene flow.

Scene Flow Estimation

Paper
Code

End-to-End 3D Point Cloud Learning for Registration Task Using Virtual Correspondences

1 code implementation • 30 Nov 2020 • Zhijian Qiao, Huanshu Wei, Zhe Liu, Chuanzhe Suo, Hesheng Wang

3D Point cloud registration is still a very challenging topic due to the difficulty in finding the rigid transformation between two point clouds with partial correspondences, and it's even harder in the absence of any initial estimation information.

Point Cloud Registration

Paper
Code

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

1 code implementation • ICCV 2023 • Chang Nie, Guangming Wang, Zhe Liu, Luca Cavalli, Marc Pollefeys, Hesheng Wang

Therefore, RLSAC can avoid differentiating to learn the features and the feedback of downstream tasks for end-to-end robust estimation.

reinforcement-learning

Paper
Code

A Registration-aided Domain Adaptation Network for 3D Point Cloud Based Place Recognition

1 code implementation • 9 Dec 2020 • Zhijian Qiao, Hanjiang Hu, Weiang Shi, Siyuan Chen, Zhe Liu, Hesheng Wang

In the field of large-scale SLAM for autonomous driving and mobile robotics, 3D point cloud based place recognition has aroused significant research interest due to its robustness to changing environments with drastic daytime and weather variance.

3D Place Recognition Autonomous Driving +2

Paper
Code

DELFlow: Dense Efficient Learning of Scene Flow for Large-Scale Point Clouds

1 code implementation • ICCV 2023 • Chensheng Peng, Guangming Wang, Xian Wan Lo, Xinrui Wu, Chenfeng Xu, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang

Previous methods rarely predict scene flow from the entire point clouds of the scene with one-time inference due to the memory inefficiency and heavy overhead from distance calculation and sorting involved in commonly used farthest point sampling, KNN, and ball query algorithms for local feature aggregation.

Scene Flow Estimation

Paper
Code

Unsupervised Learning of 3D Scene Flow from Monocular Camera

1 code implementation • 8 Jun 2022 • Guangming Wang, Xiaoyu Tian, Ruiqi Ding, Hesheng Wang

Unsupervised learning of scene flow in this paper mainly consists of two parts: (i) depth estimation and camera pose estimation, and (ii) scene flow estimation based on four different loss functions.

Depth Estimation Optical Flow Estimation +2

Paper
Code

NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and Denoising

1 code implementation • 29 Mar 2024 • Tianchen Deng, Yanbo Wang, Hongle Xie, Hesheng Wang, Jingchuan Wang, Danwei Wang, Weidong Chen

Second, the occupancy scene representation is replaced with Signed Distance Field (SDF) hierarchical scene representation for high-quality reconstruction and view synthesis.

3D Reconstruction Denoising +3

Paper
Code

SeqLPD: Sequence Matching Enhanced Loop-Closure Detection Based on Large-Scale Point Cloud Description for Self-Driving Vehicles

no code implementations • 30 Apr 2019 • Zhe Liu, Chuanzhe Suo, Shunbo Zhou, Huanshu Wei, Yingtian Liu, Hesheng Wang, Yun-hui Liu

Place recognition and loop-closure detection are main challenges in the localization, mapping and navigation tasks of self-driving vehicles.

Loop Closure Detection

Paper
Add Code

Learning Actions from Human Demonstration Video for Robotic Manipulation

no code implementations • 10 Sep 2019 • Shuo Yang, Wei zhang, Weizhi Lu, Hesheng Wang, Yibin Li

However, the general video captioning methods focus more on the understanding of the full frame, lacking of consideration on the specific object of interests in robotic manipulations.

Video Captioning

Paper
Add Code

Hierarchical Attention Learning of Scene Flow in 3D Point Clouds

no code implementations • 12 Oct 2020 • Guangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang

In this paper, a novel hierarchical neural network with double attention is proposed for learning the correlation of point features in adjacent frames and refining scene flow from coarse to fine layer by layer.

Autonomous Driving Optical Flow Estimation +1

Paper
Add Code

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks with Base Controllers

no code implementations • 24 Nov 2020 • Guangming Wang, Minjian Xin, Wenhua Wu, Zhe Liu, Hesheng Wang

Deep Reinforcement Learning (DRL) enables robots to perform some intelligent tasks end-to-end.

Robot Manipulation

Paper
Add Code

Spherical Interpolated Convolutional Network with Distance-Feature Density for 3D Semantic Segmentation of Point Clouds

no code implementations • 27 Nov 2020 • Guangming Wang, Yehui Yang, Huixin Zhang, Zhe Liu, Hesheng Wang

In this paper, a spherical interpolated convolution operator is proposed to replace the traditional grid-shaped 3D convolution operator.

3D Semantic Segmentation Feature Correlation

Paper
Add Code

Anchor-Based Spatio-Temporal Attention 3D Convolutional Networks for Dynamic 3D Point Cloud Sequences

no code implementations • 20 Dec 2020 • Guangming Wang, Muyao Chen, Hanwen Liu, Yehui Yang, Zhe Liu, Hesheng Wang

Then, anchor-based 3D convolution is adopted to aggregate these anchors' features to the core points.

Action Recognition Autonomous Driving +1

Paper
Add Code

Motion Projection Consistency Based 3D Human Pose Estimation with Virtual Bones from Monocular Videos

no code implementations • 28 Jun 2021 • Guangming Wang, Honghao Zeng, Ziliang Wang, Zhe Liu, Hesheng Wang

Ablation studies demonstrate the effectiveness of the proposed inter-frame projection consistency constraints and intra-frame loop constraints.

Ranked #45 on 3D Human Pose Estimation on Human3.6M

3D Human Pose Estimation

Paper
Add Code

NccFlow: Unsupervised Learning of Optical Flow With Non-occlusion from Geometry

no code implementations • 8 Jul 2021 • Guangming Wang, Shuaiqi Ren, Hesheng Wang

Then, two novel loss functions are proposed for the unsupervised learning of optical flow based on the geometric laws of non-occlusion.

Autonomous Driving Blocking +1

Paper
Add Code

Residual 3D Scene Flow Learning with Context-Aware Feature Extraction

no code implementations • 10 Sep 2021 • Guangming Wang, Yunzhe Hu, Xinrui Wu, Hesheng Wang

To solve the first problem, a novel context-aware set convolution layer is proposed in this paper to exploit contextual structure information of Euclidean space and learn soft aggregation weights for local point features.

Autonomous Driving Scene Flow Estimation +1

Paper
Add Code

3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose from Monocular Video

no code implementations • 6 Dec 2021 • Guangming Wang, Jiquan Zhong, Shijie Zhao, Wenhua Wu, Zhe Liu, Hesheng Wang

In this framework, the depth and pose estimations are hierarchically and mutually coupled to refine the estimated pose layer by layer.

Autonomous Driving Depth Estimation +3

Paper
Add Code

DetFlowTrack: 3D Multi-object Tracking based on Simultaneous Optimization of Object Detection and Scene Flow Estimation

no code implementations • 4 Mar 2022 • Yueling Shen, Guangming Wang, Hesheng Wang

we proposed a 3D MOT framework based on simultaneous optimization of object detection and scene flow estimation.

3D Multi-Object Tracking Object +3

Paper
Add Code

Interactive Multi-scale Fusion of 2D and 3D Features for Multi-object Tracking

no code implementations • 30 Mar 2022 • Guangming Wang, Chensheng Peng, Jinpeng Zhang, Hesheng Wang

Specifically, through multi-scale interactive query and fusion between pixel-level and point-level features, our method, can obtain more distinguishing features to improve the performance of multiple object tracking.

Autonomous Driving Multi-Object Tracking +1

Paper
Add Code

Pseudo-LiDAR for Visual Odometry

no code implementations • 4 Sep 2022 • Huiying Deng, Guangming Wang, Zhiheng Feng, Chaokang Jiang, Xinrui Wu, Yanzi Miao, Hesheng Wang

In order to make full use of the rich point cloud information provided by the pseudo-LiDAR, a projection-aware dense odometry pipeline is adopted.

Stereo Matching Visual Odometry

Paper
Add Code

Unsupervised Learning of 3D Scene Flow with 3D Odometry Assistance

no code implementations • 11 Sep 2022 • Guangming Wang, Zhiheng Feng, Chaokang Jiang, Hesheng Wang

Unlike the previous unsupervised learning of scene flow in point clouds, we propose to use odometry information to assist the unsupervised learning of scene flow and use real-world LiDAR data to train our network.

Activity Recognition Autonomous Driving +1

Paper
Add Code

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

no code implementations • 15 Sep 2022 • Chaokang Jiang, Guangming Wang, Jinxing Wu, Yanzi Miao, Hesheng Wang

Promising complementarity exists between the texture features of color images and the geometric information of LiDAR point clouds.

3D Object Detection object-detection

Paper
Add Code

3D Scene Flow Estimation on Pseudo-LiDAR: Bridging the Gap on Estimating Point Motion

no code implementations • 27 Sep 2022 • Chaokang Jiang, Guangming Wang, Yanzi Miao, Hesheng Wang

The proposed method of self-supervised learning of 3D scene flow on real-world images is compared with a variety of methods for learning on the synthesized dataset and learning on LiDAR point clouds.

Optical Flow Estimation Scene Flow Estimation +1

Paper
Add Code

End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization

no code implementations • 20 Jun 2023 • Guangming Wang, Yu Zheng, Yanfeng Guo, Zhe Liu, Yixiang Zhu, Wolfram Burgard, Hesheng Wang

A popular approach to robot localization is based on image-to-point cloud registration, which combines illumination-invariant LiDAR-based mapping with economical image-based localization.

Image-Based Localization Image to Point Cloud Registration

Paper
Add Code

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

no code implementations • 29 Nov 2023 • Yu Zheng, Guangming Wang, Jiuming Liu, Marc Pollefeys, Hesheng Wang

Through the hash-based representation, we propose the Spherical Frustum sparse Convolution (SFC) and Frustum Fast Point Sampling (F2PS) to convolve and sample the points stored in spherical frustums respectively.

Position Segmentation +1

Paper
Add Code

SemGauss-SLAM: Dense Semantic Gaussian Splatting SLAM

no code implementations • 12 Mar 2024 • Siting Zhu, Renjie Qin, Guangming Wang, Jiuming Liu, Hesheng Wang

We propose SemGauss-SLAM, the first semantic SLAM system utilizing 3D Gaussian representation, that enables accurate 3D semantic mapping, robust camera tracking, and high-quality rendering in real-time.

Semantic SLAM

Paper
Add Code

DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding

no code implementations • 18 Mar 2024 • Wenhua Wu, Guangming Wang, Ting Deng, Sebastian Aegidius, Stuart Shanks, Valerio Modugno, Dimitrios Kanoulas, Hesheng Wang

Recent research on Simultaneous Localization and Mapping (SLAM) based on implicit representation has shown promising results in indoor environments.

Simultaneous Localization and Mapping

Paper
Add Code

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

no code implementations • 18 Mar 2024 • Wenhua Wu, Qi Wang, Guangming Wang, JunPing Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang

To address this, we propose EMIE-MAP, a novel method for large-scale road surface reconstruction based on explicit mesh and implicit encoding.

Autonomous Driving Surface Reconstruction

Paper
Add Code

DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment

no code implementations • 27 Mar 2024 • Jiuming Liu, Dong Zhuo, Zhiheng Feng, Siting Zhu, Chensheng Peng, Zhe Liu, Hesheng Wang

Image pixels are pre-organized as pseudo points for image-to-point structure alignment.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.