1 code implementation • 31 Mar 2025 • Qiyuan Zhang, Fuyuan Lyu, Zexu Sun, Lei Wang, Weixu Zhang, Wenyue Hua, Haolun Wu, Zhihan Guo, YuFei Wang, Niklas Muennighoff, Irwin King, Xue Liu, Chen Ma
As enthusiasm for scaling computation (data and parameters) in the pretraining era gradually diminished, test-time scaling (TTS), also referred to as ``test-time computing'' has emerged as a prominent research focus.
no code implementations • 13 Mar 2025 • Qiyuan Zhang, Chenyu Wu, Wenzhang Sun, Huaize Liu, Donglin Di, Wei Chen, Changqing Zou
Second, in the Reasoning step, long-term modeling and efficient reasoning are performed in this latent space to generate motion sequences.
no code implementations • 26 Dec 2024 • Wenzhang Sun, Xiang Li, Donglin Di, Zhuding Liang, Qiyuan Zhang, Hao Li, Wei Chen, Jianxun Cui
Recently, animating portrait images using audio input is a popular task.
no code implementations • 21 Dec 2024 • Minda Hu, Qiyuan Zhang, YuFei Wang, Bowei He, Hongru Wang, Jingyan Zhou, Liangyou Li, Yasheng Wang, Chen Ma, Irwin King
However, existing IFT datasets often contain knowledge that is inconsistent with LLMs' internal knowledge learned from the pre-training phase, which can greatly affect the efficacy of IFT.
no code implementations • 7 Oct 2024 • Qiyuan Zhang, YuFei Wang, Tiezheng Yu, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma
With significant efforts in recent studies, LLM-as-a-Judge has become a cost-effective alternative to human evaluation for assessing the text generation quality in a wide range of tasks.
1 code implementation • 1 Jul 2024 • Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma
The pioneering scaling law on downstream works demonstrated intrinsic similarities within model families and utilized such similarities for performance prediction.
no code implementations • 17 Jul 2022 • Christopher D. Wallbridge, Qiyuan Zhang
This extended abstract introduces the initial steps taken to develop a system for Rapid Internal Simulation of Knowledge (RISK).
1 code implementation • Findings (EMNLP) 2021 • Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim
While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.
1 code implementation • 2 Sep 2021 • Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim
Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs.
Ranked #9 on
Math Word Problem Solving
on Math23K
1 code implementation • NeurIPS 2021 • Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao
Moreover, we extend ICQ to multi-agent tasks by decomposing the joint-policy under the implicit constraint.