Search Results for author: Duanyu Feng

Found 7 papers, 5 papers with code

Towards Understanding the Influence of Reward Margin on Preference Model Performance

no code implementations7 Apr 2024 Bowen Qin, Duanyu Feng, Xi Yang

Reinforcement Learning from Human Feedback (RLHF) is a widely used framework for the training of language models.

Language Modelling

Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective

no code implementations6 Apr 2024 Duanyu Feng, Bowen Qin, Chen Huang, Zheng Zhang, Wenqiang Lei

Direct Preference Optimization (DPO), which derives reward signals directly from pairwise preference data, has shown its effectiveness on aligning Large Language Models (LLMs) with human preferences.

Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English

2 code implementations12 Feb 2024 Xiao Zhang, Ruoyu Xiang, Chenhan Yuan, Duanyu Feng, Weiguang Han, Alejandro Lopez-Lira, Xiao-Yang Liu, Sophia Ananiadou, Min Peng, Jimin Huang, Qianqian Xie

We evaluate our model and existing LLMs using FLARE-ES, the first comprehensive bilingual evaluation benchmark with 21 datasets covering 9 tasks.

DREditor: An Time-efficient Approach for Building a Domain-specific Dense Retrieval Model

1 code implementation23 Jan 2024 Chen Huang, Duanyu Feng, Wenqiang Lei, Jiancheng Lv

Motivated by this, we develop a time-efficient approach called DREditor to edit the matching rule of an off-the-shelf dense retrieval model to suit a specific domain.

Retrieval

LAiW: A Chinese Legal Large Language Models Benchmark

1 code implementation9 Oct 2023 Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian, Hao Wang

Through automated evaluation of current general and legal domain LLMs on our benchmark, we indicate that these LLMs may not align with the logic of legal practice.

Information Retrieval

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

1 code implementation1 Oct 2023 Duanyu Feng, Yongfu Dai, Jimin Huang, Yifang Zhang, Qianqian Xie, Weiguang Han, Zhengyu Chen, Alejandro Lopez-Lira, Hao Wang

We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.

Decision Making Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.