Search Results for author: Qianlong Du

Found 4 papers, 3 papers with code

A Survey on Data Selection for LLM Instruction Tuning

1 code implementation • 4 Feb 2024 • Jiahao Wang, Bolin Zhang, Qianlong Du, Jiajun Zhang, Dianhui Chu

Instruction tuning is a vital step of training large language models (LLM), so how to enhance the effect of instruction tuning has received increased attention.

Instruction Following

Paper
Code

MoDS: Model-oriented Data Selection for Instruction Tuning

1 code implementation • 27 Nov 2023 • Qianlong Du, Chengqing Zong, Jiajun Zhang

First, our approach utilizes a quality evaluation model to filter out the high-quality subset from the original instruction dataset, and then designs an algorithm to further select from the high-quality subset a seed instruction dataset with good coverage.

Instruction Following

Paper
Code

ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

1 code implementation • 2 Nov 2023 • Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

Using our proposed approach, we release the largest and latest large-scale high-quality Chinese web text ChineseWebText, which consists of 1. 42 TB and each text is associated with a quality score, facilitating the LLM researchers to choose the data according to the desired quality thresholds.

114

Paper
Code

Adopting the Word-Pair-Dependency-Triplets with Individual Comparison for Natural Language Inference

no code implementations • COLING 2018 • Qianlong Du, Cheng-qing Zong, Keh-Yih Su

This paper proposes to perform natural language inference with Word-Pair-Dependency-Triplets.

Decision Making Machine Translation +4

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.