no code implementations • 14 Jan 2025 • Ke wu, ZiCheng Zhang, Muer Tie, Ziqing Ai, Zhongxue Gan, Wenchao Ding
The framework comprises four main components: VIO Front End, 2D Gaussian Map, NVS Loop Closure, and Dynamic Eraser.
no code implementations • 28 Oct 2024 • Kunyun Wang, Jieru Zhao, Shuo Yang, Wenchao Ding, Minyi Guo
To address these issues, we propose a memory-efficient scheduling method to eliminate memory overhead and an online adjustment mechanism to minimize accuracy degradation.
no code implementations • 11 Sep 2024 • Zhenyu Ning, Jieru Zhao, Qihao Jin, Wenchao Ding, Minyi Guo
In this paper, we introduce Inf-MLLM, an efficient inference framework for MLLMs, which enable streaming inference of MLLM on a single GPU with infinite context.
no code implementations • 5 Sep 2024 • Julong Wei, Shanshuai Yuan, Pengfei Li, Qingda Hu, Zhongxue Gan, Wenchao Ding
Then, we build a unified multi-modal vocabulary for vision, language and action.
no code implementations • 21 Jul 2024 • Mingzhe Gao, Jieru Zhao, Zhe Lin, Wenchao Ding, Xiaofeng Hou, Yu Feng, Chao Li, Minyi Guo
Recently, the use of large language models (LLMs) for software code generation, e. g., C/C++ and Python, has proven a great success.
no code implementations • 10 Apr 2024 • Muer Tie, Julong Wei, Zhengjun Wang, Ke wu, Shansuai Yuan, Kaizhao Zhang, Jie Jia, Jieru Zhao, Zhongxue Gan, Wenchao Ding
Online construction of open-ended language scenes is crucial for robotic applications, where open-vocabulary interactive scene understanding is required.
no code implementations • 29 Mar 2024 • Ke wu, Kaizhao Zhang, Zhiwei Zhang, Shanshuai Yuan, Muer Tie, Julong Wei, Zijun Xu, Jieru Zhao, Zhongxue Gan, Wenchao Ding
However, integrating 3DGS into a street-view dense mapping framework still faces two challenges, including incomplete reconstruction due to the absence of geometric information beyond the LiDAR coverage area and extensive computation for reconstruction in large urban scenes.
no code implementations • 5 Dec 2023 • Xiaze Zhang, Ziheng Ding, Qi Jing, Yuejie Zhang, Wenchao Ding, Rui Feng
Point clouds have shown significant potential in various domains, including Simultaneous Localization and Mapping (SLAM).
1 code implementation • 20 Oct 2023 • Yijie Zhou, Likun Cai, Xianhui Cheng, Zhongxue Gan, xiangyang xue, Wenchao Ding
In the era of big data and large models, automatic annotating functions for multi-modal data are of great significance for real-world AI-driven applications, such as autonomous driving and embodied AI.
no code implementations • 23 Jul 2023 • Guan Shen, Jieru Zhao, Zeke Wang, Zhe Lin, Wenchao Ding, Chentao Wu, Quan Chen, Minyi Guo
Along with the fast evolution of deep neural networks, the hardware system is also developing rapidly.
1 code implementation • 21 Jun 2023 • Xu Zhao, Wenchao Ding, Yongqi An, Yinglong Du, Tao Yu, Min Li, Ming Tang, Jinqiao Wang
In this paper, we propose a speed-up alternative method for this fundamental task with comparable performance.
Ranked #4 on
Zero-Shot Instance Segmentation
on LVIS v1.0 val
no code implementations • 2 May 2023 • Wenchao Ding, Jieru Zhao, Yubin Chu, Haihui Huang, Tong Qin, Chunjing Xu, Yuxiang Guan, Zhongxue Gan
However, how to cognize the ``road'' for automated vehicles where there is no well-defined ``roads'' remains an open problem.
no code implementations • 6 Mar 2021 • Haoran Song, Di Luan, Wenchao Ding, Michael Yu Wang, Qifeng Chen
Predicting the future trajectories of on-road vehicles is critical for autonomous driving.
no code implementations • 5 Jun 2020 • Jieru Zhao, Tingyuan Liang, Liang Feng, Wenchao Ding, Sharad Sinha, Wei zhang, Shaojie Shen
To reduce the design effort and achieve the right balance, we propose FP-Stereo for building high-performance stereo matching pipelines on FPGAs automatically.
1 code implementation • ECCV 2020 • Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, Qifeng Chen
Moreover, our approach enables a novel pipeline which couples the prediction and planning, by conditioning PiP on multiple candidate trajectories of the ego vehicle, which is highly beneficial for autonomous driving in interactive scenarios.
2 code implementations • 5 Mar 2020 • Lu Zhang, Wenchao Ding, Jing Chen, Shaojie Shen
Decision-making in dense traffic scenarios is challenging for automated vehicles (AVs) due to potentially stochastic behaviors of other traffic participants and perception uncertainties (e. g., tracking noise and prediction errors, etc.).
Robotics
2 code implementations • 24 Jun 2019 • Wenchao Ding, Lu Zhang, Jing Chen, Shaojie Shen
Planning safe trajectories for autonomous vehicles in complex urban environments is challenging since there are numerous semantic elements (such as dynamic agents, traffic lights and speed limits) to consider.
no code implementations • 24 Jun 2019 • Wenchao Ding, Wenliang Gao, Kaixuan Wang, Shaojie Shen
Our framework starts with an efficient B-spline-based kinodynamic (EBK) search algorithm which finds a feasible trajectory with minimum control effort and time.
no code implementations • 3 Mar 2019 • Wenchao Ding, Jing Chen, Shaojie Shen
In this paper, we uncover that clues to vehicle behaviors over an extended horizon can be found in vehicle interaction, which makes it possible to anticipate the likelihood of a certain behavior, even in the absence of any clear maneuver pattern.
no code implementations • 3 Mar 2019 • Wenchao Ding, Shaojie Shen
In this paper, we present an online two-level vehicle trajectory prediction framework for urban autonomous driving where there are complex contextual factors, such as lane geometries, road constructions, traffic regulations and moving agents.