Search Results for author: Che Zhang

Found 3 papers, 3 papers with code

Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models

1 code implementation20 Feb 2024 Che Zhang, Zhenyang Xiao, Chengcheng Han, Yixin Lian, Yuejian Fang

After integrating the original CoT data and checking-correction data for training, we observe that models could improve their self-checking capabilities, thereby enhancing their self-correction capacity and eliminating the need for external feedback or ground truth labels to ascertain the endpoint of correction.

Mathematical Reasoning

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models

1 code implementation8 Oct 2023 Chengcheng Han, Xiaowei Du, Che Zhang, Yixin Lian, Xiang Li, Ming Gao, Baoyuan Wang

Chain-of-Thought (CoT) prompting has proven to be effective in enhancing the reasoning capabilities of Large Language Models (LLMs) with at least 100 billion parameters.

Arithmetic Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.