Search Results for author: Kairong Luo

Found 1 papers, 1 papers with code

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

1 code implementation17 Mar 2025 Kairong Luo, Haodong Wen, Shengding Hu, Zhenbo Sun, Zhiyuan Liu, Maosong Sun, Kaifeng Lyu, WenGuang Chen

Training large models is both resource-intensive and time-consuming, making it crucial to understand the quantitative relationship between model performance and hyperparameters.

Cannot find the paper you are looking for? You can Submit a new open access paper.