1 code implementation • CVPR 2025 • Yiheng Li, Yang Yang, Zichang Tan, Huan Liu, Weihua Chen, Xu Zhou, Zhen Lei
To tackle the threat of fake news, the task of detecting and grounding multi-modal media manipulation DGM4 has received increasing attention.
no code implementations • 28 May 2025 • Yiheng Li, Francisco Carrillo-Perez, Mohammed Alawad, Olivier Gevaert
Lung cancer is the leading cause of cancer mortality worldwide, and non-invasive methods for detecting key mutations and staging are essential for improving patient outcomes.
1 code implementation • 24 May 2025 • Yiheng Li, Feng Liang, Dan Kondratyuk, Masayoshi Tomizuka, Kurt Keutzer, Chenfeng Xu
The substantial training cost of diffusion models hinders their deployment.
no code implementations • 21 May 2025 • Qinmei Xu, Yiheng Li, Xianghao Zhan, Ahmet Gorkem Er, Brittany Dashevsky, Chuanjun Xu, Mohammed Alawad, Mengya Yang, Liu Ya, Changsheng Zhou, Xiao Li, Haruka Itakura, Olivier Gevaert
MAVL, a model incorporating knowledge-enhanced prompts and structured supervision, achieved the highest performance on public (mean AUROC: 0. 82; AUPRC: 0. 32) and private (mean AUROC: 0. 95; AUPRC: 0. 89) datasets, ranking first in 14 of 37 public and 3 of 4 private tasks.
1 code implementation • 11 Jan 2025 • Yiheng Li, Yang Yang, Zhen Lei
It uses the dual-branch structure which adopts class-specific query and Bbox-specific query to corresponding sub-tasks.
1 code implementation • 17 Dec 2024 • Yiheng Li, Yang Yang, Zhen Lei
In radar-camera 3D object detection, the radar point clouds are sparse and noisy, which causes difficulties in fusing camera and radar modalities.
1 code implementation • CVPR 2025 • Yiheng Li, Ruibing Hou, Hong Chang, Shiguang Shan, Xilin Chen
Human pose plays a crucial role in the digital age.
1 code implementation • 21 Oct 2024 • Yun Xing, Yiheng Li, Ivan Laptev, Shijian Lu
Due to the long-term decay in RoPE, LVLMs tend to hallucinate more when relevant visual cues are distant from instruction tokens in the multimodal input sequence.
no code implementations • 9 Oct 2024 • Yuxin Li, Yiheng Li, Xulei Yang, Mengying Yu, Zihang Huang, XiaoJun Wu, Chai Kiat Yeo
In the landscape of autonomous driving, Bird's-Eye-View (BEV) representation has recently garnered substantial academic attention, serving as a transformative framework for the fusion of multi-modal sensor inputs.
no code implementations • 9 Oct 2024 • Yuxin Li, Yiheng Li, Xulei Yang, Mengying Yu, Zihang Huang, XiaoJun Wu, Chai Kiat Yeo
Bird's-Eye-View (BEV) perception has become a vital component of autonomous driving systems due to its ability to integrate multiple sensor inputs into a unified representation, enhancing performance in various downstream tasks.
2 code implementations • 18 Jun 2024 • Yiheng Li, Heyang Jiang, Akio Kodaira, Masayoshi Tomizuka, Kurt Keutzer, Chenfeng Xu
Drawing inspiration from the immiscibility phenomenon in physics, we propose Immiscible Diffusion, a simple and effective method to improve the random mixture of noise-data mapping.
no code implementations • 15 Mar 2024 • Yiheng Li, Hongyang Li, Zehao Huang, Hong Chang, Naiyan Wang
The versatility of SparseFusion is also validated in the temporal object detection task and 3D lane detection task.
no code implementations • 1 Dec 2023 • Yuxin Li, Qiang Han, Mengying Yu, Yuxin Jiang, Chaikiat Yeo, Yiheng Li, Zihang Huang, Nini Liu, Hsuanhan Chen, XiaoJun Wu
3D object detection in Bird's-Eye-View (BEV) space has recently emerged as a prevalent approach in the field of autonomous driving.
1 code implementation • 18 Sep 2023 • Yiheng Li, Seth Z. Zhao, Chenfeng Xu, Chen Tang, Chenran Li, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan
Accumulating substantial volumes of real-world driving data proves pivotal in the realm of trajectory forecasting for autonomous driving.
1 code implementation • 5 May 2023 • Canhui Tang, Yiheng Li, Shaoyi Du, Guofa Wang, Zhiqiang Tian
Feature Descriptors and Detectors are two main components of feature-based point cloud registration.
1 code implementation • 29 Mar 2023 • Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du
Firstly, we propose to use salient points with prominent local features as nodes to increase patch repeatability, and introduce some uniformly distributed points to complete the point cloud, thus constituting hybrid points.
no code implementations • 27 Jul 2022 • Yiheng Li, Connelly Barnes, Kun Huang, Fang-Lue Zhang
Optical flow computation is essential in the early stages of the video processing pipeline.
no code implementations • 7 Aug 2021 • Xianghao Zhan, Yiheng Li, Yuzhe Liu, Nicholas J. Cecchi, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo
However, due to different kinematic characteristics, many brain injury risk estimation models are not generalizable across the variety of impacts that humans may sustain.
no code implementations • 19 Apr 2021 • Xianghao Zhan, Yiheng Li, Yuzhe Liu, Nicholas J. Cecchi, Samuel J. Raymond, Zhou Zhou, Hossein Vahid Alizadeh, Jesse Ruan, Saeed Barbat, Stephen Tiernan, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo
A random forest classifier with spectral densities of linear acceleration and angular velocity was built to classify head impact types (e. g., football, car crash, mixed martial arts).
no code implementations • 9 Feb 2021 • Xianghao Zhan, Yiheng Li, Yuzhe Liu, August G. Domel, Hossein Vahid Alizadeh, Zhou Zhou, Nicholas J. Cecchi, Samuel J. Raymond, Stephen Tiernan, Jesse Ruan, Saeed Barbat, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo
To better design brain injury criteria, the predictive power of rotational kinematics factors, which are different in 1) the derivative order (angular velocity, angular acceleration, angular jerk), 2) the direction and 3) the power (e. g., square-rooted, squared, cubic) of the angular velocity, were analyzed based on different datasets including laboratory impacts, American football, mixed martial arts (MMA), NHTSA automobile crashworthiness tests and NASCAR crash events.
no code implementations • 18 Dec 2020 • Xianghao Zhan, Yiheng Li, Yuzhe Liu, August G. Domel, Hossein Vahid Alizadeh, Samuel J. Raymond, Jesse Ruan, Saeed Barbat, Stephen Tiernan, Olivier Gevaert, Michael Zeineh, Gerald Grant, David B. Camarillo
The results show a significant difference in the relationship between BIC and brain strain across datasets, indicating the same BIC value may suggest different brain strain in different head impact types.