Search Results for author: Zecheng Wang

Found 5 papers, 3 papers with code

Checkpoint Merging via Bayesian Optimization in LLM Pretraining

no code implementations28 Mar 2024 Deyuan Liu, Zecheng Wang, Bingning Wang, WeiPeng Chen, Chunshan Li, Zhiying Tu, Dianhui Chu, Bo Li, Dianbo Sui

The rapid proliferation of large language models (LLMs) such as GPT-4 and Gemini underscores the intense demand for resources during their training processes, posing significant challenges due to substantial computational and environmental costs.

Bayesian Optimization

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

1 code implementation1 Oct 2023 Zecheng Wang, Che Wang, Zixuan Dong, Keith Ross

Recently, it has been shown that for offline deep reinforcement learning (DRL), pre-training Decision Transformer with a large language corpus can improve downstream performance (Reid et al., 2022).

D4RL Q-Learning +2

Suffix Retrieval-Augmented Language Modeling

1 code implementation6 Nov 2022 Zecheng Wang, Yik-Cheung Tam

SUREALM employs an embedding retriever to search for training sentences in a data store that share similar word history during sequence generation.

Causal Language Modeling Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.