Search Results for author: Zhijian Ma

Found 4 papers, 2 papers with code

Data Mixing Made Efficient: A Bivariate Scaling Law for Language Model Pretraining

no code implementations23 May 2024 Ce Ge, Zhijian Ma, Daoyuan Chen, Yaliang Li, Bolin Ding

Large language models exhibit exceptional generalization capabilities, primarily attributed to the utilization of diversely sourced data.

UniDM: A Unified Framework for Data Manipulation with Large Language Models

no code implementations10 May 2024 Yichen Qian, Yongyi He, Rong Zhu, Jintao Huang, Zhijian Ma, Haibin Wang, Yaohua Wang, Xiuyu Sun, Defu Lian, Bolin Ding, Jingren Zhou

In this paper, inspired by the cross-task generality of LLMs on NLP tasks, we pave the first step to design an automatic and general solution to tackle with data manipulation tasks.

Cannot find the paper you are looking for? You can Submit a new open access paper.