Search Results for author: ChengWei Wu

Found 5 papers, 2 papers with code

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

no code implementations11 Sep 2024 Hanyu Zhao, Li Du, Yiming Ju, ChengWei Wu, Tengfei Pan

With the availability of various instruction datasets, a pivotal challenge is how to effectively select and integrate these instructions to fine-tune large language models (LLMs).

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

1 code implementation13 Aug 2024 Bo-Wen Zhang, Liangdong Wang, Ye Yuan, Jijie Li, Shuhao Gu, Mengdi Zhao, Xinya Wu, Guang Liu, ChengWei Wu, Hanyu Zhao, Li Du, Yiming Ju, Quanyue Ma, Yulong Ao, Yingli Zhao, Songhe Zhu, Zhou Cao, Dong Liang, Yonghua Lin, Ming Zhang, Shunfei Wang, Yanxin Zhou, Min Ye, Xuekai Chen, Xinyang Yu, Xiangjun Huang, Jian Yang

In this paper, we present AquilaMoE, a cutting-edge bilingual 8*16B Mixture of Experts (MoE) language model that has 8 experts with 16 billion parameters each and is developed using an innovative training methodology called EfficientScale.

Language Modelling Transfer Learning

Lyapunov-Based Reinforcement Learning State Estimator

no code implementations26 Oct 2020 Liang Hu, ChengWei Wu, Wei Pan

An actor-critic reinforcement learning algorithm is proposed to learn the state estimator approximated by a deep neural network.

reinforcement-learning Reinforcement Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.