Search Results for author: Qiyuan Zhang

Found 10 papers, 5 papers with code

A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well?

1 code implementation31 Mar 2025 Qiyuan Zhang, Fuyuan Lyu, Zexu Sun, Lei Wang, Weixu Zhang, Wenyue Hua, Haolun Wu, Zhihan Guo, YuFei Wang, Niklas Muennighoff, Irwin King, Xue Liu, Chen Ma

As enthusiasm for scaling computation (data and parameters) in the pretraining era gradually diminished, test-time scaling (TTS), also referred to as ``test-time computing'' has emerged as a prominent research focus.

Semantic Latent Motion for Portrait Video Generation

no code implementations13 Mar 2025 Qiyuan Zhang, Chenyu Wu, Wenzhang Sun, Huaize Liu, Donglin Di, Wei Chen, Changqing Zou

Second, in the Reasoning step, long-term modeling and efficient reasoning are performed in this latent space to generate motion sequences.

Descriptive Video Generation

NILE: Internal Consistency Alignment in Large Language Models

no code implementations21 Dec 2024 Minda Hu, Qiyuan Zhang, YuFei Wang, Bowei He, Hongru Wang, Jingyan Zhou, Liangyou Li, Yasheng Wang, Chen Ma, Irwin King

However, existing IFT datasets often contain knowledge that is inconsistent with LLMs' internal knowledge learned from the pre-training phase, which can greatly affect the efficacy of IFT.

RevisEval: Improving LLM-as-a-Judge via Response-Adapted References

no code implementations7 Oct 2024 Qiyuan Zhang, YuFei Wang, Tiezheng Yu, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma

With significant efforts in recent studies, LLM-as-a-Judge has become a cost-effective alternative to human evaluation for assessing the text generation quality in a wide range of tasks.

Instruction Following Text Generation

Collaborative Performance Prediction for Large Language Models

1 code implementation1 Jul 2024 Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma

The pioneering scaling law on downstream works demonstrated intrinsic similarities within model families and utilized such similarities for performance prediction.

Prediction

Introducing RISK

no code implementations17 Jul 2022 Christopher D. Wallbridge, Qiyuan Zhang

This extended abstract introduces the initial steps taken to develop a system for Rapid Internal Simulation of Knowledge (RISK).

NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

1 code implementation Findings (EMNLP) 2021 Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects.

Graph Question Answering Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.