no code implementations • 23 May 2025 • Xiaohao Liu, Xiaobo Xia, Weixiang Zhao, Manyi Zhang, Xianzhi Yu, Xiu Su, Shuo Yang, See-Kiong Ng, Tat-Seng Chua
To overcome these challenges, we propose leap multi-token prediction~(L-MTP), an innovative token prediction method that extends the capabilities of multi-token prediction (MTP) by introducing a leap-based mechanism.
1 code implementation • 7 Apr 2025 • Ruikang Liu, Yuxuan Sun, Manyi Zhang, Haoli Bai, Xianzhi Yu, Tiezheng Yu, Chun Yuan, Lu Hou
In addition, strategically scaling the model sizes or reasoning steps can effectively enhance the performance.
no code implementations • ICCV 2023 • Manyi Zhang, Xuyang Zhao, Jun Yao, Chun Yuan, Weiran Huang
In this paper, to handle the problem and address the limitations of prior works, we propose a representation calibration method RCAL.
no code implementations • 11 Oct 2022 • Manyi Zhang, Yuxin Ren, ZiHao Wang, Chun Yuan
In this paper, to address the distribution shift in learning with instance-dependent label noise, a dynamic distribution-calibration strategy is adopted.