Search Results for author: Duanyu Feng

Found 7 papers, 5 papers with code

Towards Understanding the Influence of Reward Margin on Preference Model Performance

no code implementations • 7 Apr 2024 • Bowen Qin, Duanyu Feng, Xi Yang

Reinforcement Learning from Human Feedback (RLHF) is a widely used framework for the training of language models.

Paper
Add Code

Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective

no code implementations • 6 Apr 2024 • Duanyu Feng, Bowen Qin, Chen Huang, Zheng Zhang, Wenqiang Lei

Direct Preference Optimization (DPO), which derives reward signals directly from pairwise preference data, has shown its effectiveness on aligning Large Language Models (LLMs) with human preferences.

Paper
Add Code

The FinBen: An Holistic Financial Benchmark for Large Language Models

2 code implementations • 20 Feb 2024 • Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang

This along with the rapid development of LLMs, highlights the urgent need for a systematic financial evaluation benchmark for LLMs.

399

Paper
Code

Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English

2 code implementations • 12 Feb 2024 • Xiao Zhang, Ruoyu Xiang, Chenhan Yuan, Duanyu Feng, Weiguang Han, Alejandro Lopez-Lira, Xiao-Yang Liu, Sophia Ananiadou, Min Peng, Jimin Huang, Qianqian Xie

We evaluate our model and existing LLMs using FLARE-ES, the first comprehensive bilingual evaluation benchmark with 21 datasets covering 9 tasks.

399

Paper
Code

DREditor: An Time-efficient Approach for Building a Domain-specific Dense Retrieval Model

1 code implementation • 23 Jan 2024 • Chen Huang, Duanyu Feng, Wenqiang Lei, Jiancheng Lv

Motivated by this, we develop a time-efficient approach called DREditor to edit the matching rule of an off-the-shelf dense retrieval model to suit a specific domain.

Retrieval

Paper
Code

LAiW: A Chinese Legal Large Language Models Benchmark

1 code implementation • 9 Oct 2023 • Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian, Hao Wang

Through automated evaluation of current general and legal domain LLMs on our benchmark, we indicate that these LLMs may not align with the logic of legal practice.

Information Retrieval

Paper
Code

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

1 code implementation • 1 Oct 2023 • Duanyu Feng, Yongfu Dai, Jimin Huang, Yifang Zhang, Qianqian Xie, Weiguang Han, Zhengyu Chen, Alejandro Lopez-Lira, Hao Wang

We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.

Decision Making Language Modelling +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.