Search Results for author: Dingchen Yang

Found 3 papers, 2 papers with code

Beyond Intermediate States: Explaining Visual Redundancy through Language

1 code implementation26 Mar 2025 Dingchen Yang, Bowen Cao, Anran Zhang, Weibo Gu, Winston Hu, Guang Chen

Multi-modal Large Langue Models (MLLMs) often process thousands of visual tokens, which consume a significant portion of the context window and impose a substantial computational burden.

Pensieve: Retrospect-then-Compare Mitigates Visual Hallucination

1 code implementation21 Mar 2024 Dingchen Yang, Bowen Cao, Guang Chen, Changjun Jiang

Multi-modal Large Language Models (MLLMs) demonstrate remarkable success across various vision-language tasks.

Hallucination MME +1

3D Data Augmentation for Driving Scenes on Camera

no code implementations18 Mar 2023 Wenwen Tong, Jiangwei Xie, Tianyu Li, Hanming Deng, Xiangwei Geng, Ruoyi Zhou, Dingchen Yang, Bo Dai, Lewei Lu, Hongyang Li

The proposed data augmentation approach contributes to a gain of 1. 7% and 1. 4% in terms of detection accuracy, on Waymo and nuScences respectively.

Autonomous Driving Data Augmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.