Search Results for author: Yiheng Li

Found 21 papers, 10 papers with code

Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation

1 code implementation CVPR 2025 Yiheng Li, Yang Yang, Zichang Tan, Huan Liu, Weihua Chen, Xu Zhou, Zhen Lei

To tackle the threat of fake news, the task of detecting and grounding multi-modal media manipulation DGM4 has received increasing attention.

Decoder

Comparative Analysis of Machine Learning Models for Lung Cancer Mutation Detection and Staging Using 3D CT Scans

no code implementations28 May 2025 Yiheng Li, Francisco Carrillo-Perez, Mohammed Alawad, Olivier Gevaert

Lung cancer is the leading cause of cancer mortality worldwide, and non-invasive methods for detecting key mutations and staging are essential for improving patient outcomes.

Multiple Instance Learning

Benchmarking Chest X-ray Diagnosis Models Across Multinational Datasets

no code implementations21 May 2025 Qinmei Xu, Yiheng Li, Xianghao Zhan, Ahmet Gorkem Er, Brittany Dashevsky, Chuanjun Xu, Mohammed Alawad, Mengya Yang, Liu Ya, Changsheng Zhou, Xiao Li, Haruka Itakura, Olivier Gevaert

MAVL, a model incorporating knowledge-enhanced prompts and structured supervision, achieved the highest performance on public (mean AUROC: 0. 82; AUPRC: 0. 32) and private (mean AUROC: 0. 95; AUPRC: 0. 89) datasets, ranking first in 14 of 37 public and 3 of 4 private tasks.

Benchmarking Diagnostic

RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection

1 code implementation17 Dec 2024 Yiheng Li, Yang Yang, Zhen Lei

In radar-camera 3D object detection, the radar point clouds are sparse and noisy, which causes difficulties in fusing camera and radar modalities.

3D Object Detection Decoder +2

Mitigating Object Hallucination via Concentric Causal Attention

1 code implementation21 Oct 2024 Yun Xing, Yiheng Li, Ivan Laptev, Shijian Lu

Due to the long-term decay in RoPE, LVLMs tend to hallucinate more when relevant visual cues are distant from instruction tokens in the multimodal input sequence.

Hallucination Object +1

Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation

no code implementations9 Oct 2024 Yuxin Li, Yiheng Li, Xulei Yang, Mengying Yu, Zihang Huang, XiaoJun Wu, Chai Kiat Yeo

In the landscape of autonomous driving, Bird's-Eye-View (BEV) representation has recently garnered substantial academic attention, serving as a transformative framework for the fusion of multi-modal sensor inputs.

Autonomous Driving Computational Efficiency +1

QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation

no code implementations9 Oct 2024 Yuxin Li, Yiheng Li, Xulei Yang, Mengying Yu, Zihang Huang, XiaoJun Wu, Chai Kiat Yeo

Bird's-Eye-View (BEV) perception has become a vital component of autonomous driving systems due to its ability to integrate multiple sensor inputs into a unified representation, enhancing performance in various downstream tasks.

3D Object Detection Autonomous Driving +2

Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment

2 code implementations18 Jun 2024 Yiheng Li, Heyang Jiang, Akio Kodaira, Masayoshi Tomizuka, Kurt Keutzer, Chenfeng Xu

Drawing inspiration from the immiscibility phenomenon in physics, we propose Immiscible Diffusion, a simple and effective method to improve the random mixture of noise-data mapping.

Denoising

Pre-training on Synthetic Driving Data for Trajectory Prediction

1 code implementation18 Sep 2023 Yiheng Li, Seth Z. Zhao, Chenfeng Xu, Chen Tang, Chenran Li, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan

Accumulating substantial volumes of real-world driving data proves pivotal in the realm of trajectory forecasting for autonomous driving.

Autonomous Driving Prediction +1

HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching

1 code implementation29 Mar 2023 Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du

Firstly, we propose to use salient points with prominent local features as nodes to increase patch repeatability, and introduce some uniformly distributed points to complete the point cloud, thus constituting hybrid points.

Patch Matching Point Cloud Registration

Kinematics clustering enables head impact subtyping for better traumatic brain injury prediction

no code implementations7 Aug 2021 Xianghao Zhan, Yiheng Li, Yuzhe Liu, Nicholas J. Cecchi, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo

However, due to different kinematic characteristics, many brain injury risk estimation models are not generalizable across the variety of impacts that humans may sustain.

Car Racing Clustering +2

Predictive Factors of Kinematics in Traumatic Brain Injury from Head Impacts Based on Statistical Interpretation

no code implementations9 Feb 2021 Xianghao Zhan, Yiheng Li, Yuzhe Liu, August G. Domel, Hossein Vahid Alizadeh, Zhou Zhou, Nicholas J. Cecchi, Samuel J. Raymond, Stephen Tiernan, Jesse Ruan, Saeed Barbat, Olivier Gevaert, Michael M. Zeineh, Gerald A. Grant, David B. Camarillo

To better design brain injury criteria, the predictive power of rotational kinematics factors, which are different in 1) the derivative order (angular velocity, angular acceleration, angular jerk), 2) the direction and 3) the power (e. g., square-rooted, squared, cubic) of the angular velocity, were analyzed based on different datasets including laboratory impacts, American football, mixed martial arts (MMA), NHTSA automobile crashworthiness tests and NASCAR crash events.

Relationship between brain injury criteria and brain strain across different types of head impacts can be different

no code implementations18 Dec 2020 Xianghao Zhan, Yiheng Li, Yuzhe Liu, August G. Domel, Hossein Vahid Alizadeh, Samuel J. Raymond, Jesse Ruan, Saeed Barbat, Stephen Tiernan, Olivier Gevaert, Michael Zeineh, Gerald Grant, David B. Camarillo

The results show a significant difference in the relationship between BIC and brain strain across datasets, indicating the same BIC value may suggest different brain strain in different head impact types.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.