Search Results for author: Zhuoling Li

Found 12 papers, 3 papers with code

UniMODE: Unified Monocular 3D Object Detection

no code implementations • 28 Feb 2024 • Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao

To address these challenges, we build a detector based on the bird's-eye-view (BEV) detection paradigm, where the explicit feature projection is beneficial to addressing the geometry learning ambiguity when employing multiple scenarios of data to train detectors.

Monocular 3D Object Detection Object +2

Paper
Add Code

GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping

no code implementations • 18 Jul 2023 • Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, Xiangyu Zhang

Besides, GroupLane with ResNet18 still surpasses PersFormer by 4. 9% F1 score, while the inference speed is nearly 7x faster and the FLOPs is only 13. 3% of it.

3D Lane Detection

Paper
Add Code

The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge

1 code implementation • 16 Jun 2023 • Dongming Wu, Fan Jia, Jiahao Chang, Zhuoling Li, Jianjian Sun, Chunrui Han, Shuailin Li, Yingfei Liu, Zheng Ge, Tiancai Wang

We present the 1st-place solution of OpenLane Topology in Autonomous Driving Challenge.

Autonomous Driving

118

Paper
Code

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

no code implementations • 23 May 2023 • En Yu, Tiancai Wang, Zhuoling Li, Yuang Zhang, Xiangyu Zhang, Wenbing Tao

Although end-to-end multi-object trackers like MOTR enjoy the merits of simplicity, they suffer from the conflict between detection and association seriously, resulting in unsatisfactory convergence dynamics.

Denoising Multi-Object Tracking +1

Paper
Add Code

VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection

1 code implementation • 3 Apr 2023 • Zhuoling Li, Chuanrui Zhang, Wei-Chiu Ma, Yipin Zhou, Linyan Huang, Haoqian Wang, SerNam Lim, Hengshuang Zhao

In recent years, transformer-based detectors have demonstrated remarkable performance in 2D visual perception tasks.

3D Object Detection object-detection

Paper
Code

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation

no code implementations • 3 Dec 2022 • En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Zeming Li, Shoudong Han, Wenbing Tao

VLM joints the information in the generated visual prompts and the textual prompts from a pre-defined Trackbook to obtain instance-level pseudo textual description, which is domain invariant to different tracking scenes.

Domain Generalization Multi-Object Tracking +1

Paper
Add Code

Delving into the Pre-training Paradigm of Monocular 3D Object Detection

no code implementations • 8 Jun 2022 • Zhuoling Li, Chuanrui Zhang, En Yu, Haoqian Wang

(2) Combining depth estimation and 2D object detection is a promising M3OD pre-training baseline.

Depth Estimation Monocular 3D Object Detection +3

Paper
Add Code

Diversity Matters: Fully Exploiting Depth Clues for Reliable Monocular 3D Object Detection

no code implementations • CVPR 2022 • Zhuoling Li, Zhan Qu, Yang Zhou, Jianzhuang Liu, Haoqian Wang, Lihui Jiang

To tackle this problem, we propose a depth solving system that fully explores the visual clues from the subtasks in M3OD and generates multiple estimations for the depth of each target.

Depth Estimation Monocular 3D Object Detection +2