Search Results for author: Zhiyu Li

Found 10 papers, 7 papers with code

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

no code implementations • 7 Mar 2024 • Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, MingChuan Yang

Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values.

Language Modelling Large Language Model +2

Paper
Add Code

NewsBench: Systematic Evaluation of LLMs for Writing Proficiency and Safety Adherence in Chinese Journalistic Editorial Applications

no code implementations • 29 Feb 2024 • Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

This study presents NewsBench, a novel benchmark framework developed to evaluate the capability of Large Language Models (LLMs) in Chinese Journalistic Writing Proficiency (JWP) and their Safety Adherence (SA), addressing the gap between journalistic ethics and the risks associated with AI utilization.

Ethics

Paper
Add Code

Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

1 code implementation • 17 Feb 2024 • Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang

In this study, we introduce a pluggable CTG framework for Large Language Models (LLMs) named Dynamic Attribute Graphs-based controlled text generation (DATG).

Attribute Language Modelling +2

Paper
Code

CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

1 code implementation • 30 Jan 2024 • Yuanjie Lyu, Zhiyu Li, Simin Niu, Feiyu Xiong, Bo Tang, Wenjin Wang, Hao Wu, Huanyong Liu, Tong Xu, Enhong Chen, Yi Luo, Peng Cheng, Haiying Deng, Zhonghao Wang, Zijia Lu

For each of these CRUD categories, we have developed comprehensive datasets to evaluate the performance of RAG systems.

Question Answering Retrieval

114

Paper
Code

CAT-LLM: Prompting Large Language Models with Text Style Definition for Chinese Article-style Transfer

1 code implementation • 11 Jan 2024 • Zhen Tao, Dinghao Xi, Zhiyu Li, Liumin Tang, Wei Xu

Text style transfer is increasingly prominent in online entertainment and social media.

Style Transfer Text Style Transfer

Paper
Code

Grimoire is All You Need for Enhancing Large Language Models

1 code implementation • 7 Jan 2024 • Ding Chen, Shichao Song, Qingchen Yu, Zhiyu Li, Wenjin Wang, Feiyu Xiong, Bo Tang

In this paper, we propose a method SLEICL that involves learning from examples using strong language models and then summarizing and transferring these learned skills to weak language models for inference and application.

In-Context Learning

107

Paper
Code

UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation

1 code implementation • 26 Nov 2023 • Xun Liang, Shichao Song, Simin Niu, Zhiyu Li, Feiyu Xiong, Bo Tang, Zhaohui Wy, Dawei He, Peng Cheng, Zhonghao Wang, Haiying Deng

These techniques encompass the use of directed hallucination induction and strategies that deliberately alter authentic text to produce hallucinations.

Benchmarking Hallucination +2

161

Paper
Code

Controllable Multi-Objective Re-ranking with Policy Hypernetworks

1 code implementation • 8 Jun 2023 • Sirui Chen, YuAn Wang, Zijing Wen, Zhiyu Li, Changshuo Zhang, Xiao Zhang, Quan Lin, Cheng Zhu, Jun Xu

In this paper, we propose a framework called controllable multi-objective re-ranking (CMR) which incorporates a hypernetwork to generate parameters for a re-ranking model according to different preference weights.

Recommendation Systems Re-Ranking

Paper
Code

Reinforcement Re-ranking with 2D Grid-based Recommendation Panels

no code implementations • 11 Apr 2022 • Sirui Chen, Xiao Zhang, Xu Chen, Zhiyu Li, YuAn Wang, Quan Lin, Jun Xu

Then, it defines \emph{the MDP discrete time steps as the ranks in the initial ranking list, and the actions as the prediction of the user-item preference and the selection of the slots}.

Recommendation Systems Re-Ranking

Paper
Add Code

Automating Code Review Activities by Large-Scale Pre-training

2 code implementations • 17 Mar 2022 • Zhiyu Li, Shuai Lu, Daya Guo, Nan Duan, Shailesh Jannu, Grant Jenks, Deep Majumder, Jared Green, Alexey Svyatkovskiy, Shengyu Fu, Neel Sundaresan

In this research, we focus on utilizing pre-training techniques for the tasks in the code review scenario.

Comment Generation

1,981

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.