Search Results for author: Pengxiang Li

Found 3 papers, 0 papers with code

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

no code implementations • 16 Apr 2024 • Yanze Li, Wenhua Zhang, Kai Chen, Yanxin Liu, Pengxiang Li, Ruiyuan Gao, Lanqing Hong, Meng Tian, Xinhai Zhao, Zhenguo Li, Dit-yan Yeung, Huchuan Lu, Xu Jia

Large Vision-Language Models (LVLMs), due to the remarkable visual reasoning ability to understand images and videos, have received widespread attention in the autonomous driving domain, which significantly advances the development of interpretable end-to-end autonomous driving.

Autonomous Driving Visual Reasoning

Paper
Add Code

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models

no code implementations • 1 Dec 2023 • Pengxiang Li, Kai Chen, Zhili Liu, Ruiyuan Gao, Lanqing Hong, Guo Zhou, Hua Yao, Dit-yan Yeung, Huchuan Lu, Xu Jia

Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the necessity to manage appearance and disappearance, drastic scale changes, and ensure consistency for instances across frames.

Image Classification Multi-Object Tracking +4

Paper
Add Code

A Decomposition Model for Stereo Matching

no code implementations • CVPR 2021 • Chengtang Yao, Yunde Jia, Huijun Di, Pengxiang Li, Yuwei Wu

In this paper, we present a decomposition model for stereo matching to solve the problem of excessive growth in computational cost (time and memory cost) as the resolution increases.

Disparity Estimation Stereo Matching

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.