1 code implementation • 30 May 2025 • Zefan Cai, Wen Xiao, Hanshi Sun, Cheng Luo, Yikai Zhang, Ke Wan, Yucheng Li, Yeyang Zhou, Li-Wen Chang, Jiuxiang Gu, Zhen Dong, Anima Anandkumar, Abedelkadir Asi, Junjie Hu
To address this, we propose Redundancy-aware KV Cache Compression for Reasoning models (R-KV), a novel method specifically targeting redundant tokens in reasoning models.
no code implementations • 24 Apr 2025 • Ke Wan, Kensuke Tanioka, Toshio Shimokawa
By bridging the gap between accuracy and interpretability, our study contributes a valuable tool for multi-arm HTE estimation, supporting precision medicine.
1 code implementation • 7 Jul 2024 • Ke Wan, Yi Liang, Susik Yoon
Continuous learning from an immense volume of data streams becomes exceptionally critical in the internet era.
no code implementations • 2 May 2023 • Ke Wan, Alain Kornhauser
This paper investigates Gaussian copula mixture models (GCMM), which are an extension of Gaussian mixture models (GMM) that incorporate copula concepts.