Search Results for author: Haitao Li

Found 28 papers, 13 papers with code

PoAct: Policy and Action Dual-Control Agent for Generalized Applications

no code implementations13 Jan 2025 Guozhi Yuan, Youfeng Liu, Jingli Yang, Wei Jia, Kai Lin, Yansong Gao, Shan He, Zilin Ding, Haitao Li

Code Action addresses these issues while also introducing the challenges of a more complex action space and more difficult action organization.

Large Language Model

LegalAgentBench: Evaluating LLM Agents in Legal Domain

1 code implementation23 Dec 2024 Haitao Li, Junjie Chen, Jingli Yang, Qingyao Ai, Wei Jia, Youfeng Liu, Kai Lin, Yueyue Wu, Guozhi Yuan, Yiran Hu, Wuyue Wang, Yiqun Liu, Minlie Huang

Therefore, we propose LegalAgentBench, a comprehensive benchmark specifically designed to evaluate LLM Agents in the Chinese legal domain.

Decision Making

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

1 code implementation7 Dec 2024 Haitao Li, Qian Dong, Junjie Chen, Huixue Su, Yujia Zhou, Qingyao Ai, Ziyi Ye, Yiqun Liu

Finally, we provide a detailed analysis of the limitations of LLM judges and discuss potential future directions.

De-biased Multimodal Electrocardiogram Analysis

no code implementations22 Nov 2024 Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang

We analyzed this phenomenon from a causal perspective in the context of ECG MLLMs and discovered that the confounder, severity of illness, introduces a spurious correlation between the question and answer, leading the model to rely on this spurious correlation and ignore the ECG input.

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges

1 code implementation20 Oct 2024 Haitao Li, Junjie Chen, Qingyao Ai, Zhumin Chu, Yujia Zhou, Qian Dong, Yiqun Liu

The use of large language models (LLMs) as automated evaluation tools to assess the quality of generated natural language, known as LLMs-as-Judges, has demonstrated promising capabilities and is rapidly gaining widespread attention.

Fairness Prediction +1

An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation

no code implementations16 Oct 2024 Junjie Chen, Weihang Su, Zhumin Chu, Haitao Li, Qinyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma

Moreover, our study highlights the impact of prompt strategies and evaluation formats on evaluation performance, offering guidance for method optimization in the future.

Dialogue Generation Question Answering

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models

1 code implementation30 Sep 2024 Haitao Li, You Chen, Qingyao Ai, Yueyue Wu, Ruizhe Zhang, Yiqun Liu

Applying existing LLMs to legal systems without careful evaluation of their potential and limitations could pose significant risks in legal practice.

Fairness

Towards an In-Depth Comprehension of Case Relevance for Better Legal Retrieval

no code implementations1 Apr 2024 Haitao Li, You Chen, Zhekai Ge, Qingyao Ai, Yiqun Liu, Quan Zhou, Shuai Huo

Legal retrieval techniques play an important role in preserving the fairness and equality of the judicial system.

Fairness Learning-To-Rank +2

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models

no code implementations27 Mar 2024 Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Zhijing Wu, Yiqun Liu, Chong Chen, Qi Tian

However, general LLMs, which are developed on open-domain data, may lack the domain-specific knowledge essential for tasks in vertical domains, such as legal, medical, etc.

Bayesian Optimization

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment

no code implementations27 Mar 2024 Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu, Chong Chen, Qi Tian

Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.

Retrieval Semantic Similarity +2

Evaluation Ethics of LLMs in Legal Domain

no code implementations17 Mar 2024 Ruizhe Zhang, Haitao Li, Yueyue Wu, Qingyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma

In recent years, the utilization of large language models for natural language dialogue has gained momentum, leading to their widespread adoption across various domains.

Ethics

PRE: A Peer Review Based Large Language Model Evaluator

no code implementations28 Jan 2024 Zhumin Chu, Qingyao Ai, Yiteng Tu, Haitao Li, Yiqun Liu

Existing paradigms rely on either human annotators or model-based evaluators to evaluate the performance of LLMs on different tasks.

Language Modeling Language Modelling +2

Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions

1 code implementation1 Nov 2023 Weihang Su, Qingyao Ai, Yueyue Wu, Yixiao Ma, Haitao Li, Yiqun Liu, Zhijing Wu, Min Zhang

Legal case retrieval aims to help legal workers find relevant cases related to their cases at hand, which is important for the guarantee of fairness and justice in legal judgments.

Fairness Retrieval

LeCaRDv2: A Large-Scale Chinese Legal Case Retrieval Dataset

no code implementations26 Oct 2023 Haitao Li, Yunqiu Shao, Yueyue Wu, Qingyao Ai, Yixiao Ma, Yiqun Liu

However, the development of legal case retrieval technologies in the Chinese legal system is restricted by three problems in existing datasets: limited data size, narrow definitions of legal relevance, and naive candidate pooling strategies used in data sampling.

Fairness Retrieval

Unsupervised Large Language Model Alignment for Information Retrieval via Contrastive Feedback

no code implementations29 Sep 2023 Qian Dong, Yiding Liu, Qingyao Ai, Zhijing Wu, Haitao Li, Yiqun Liu, Shuaiqiang Wang, Dawei Yin, Shaoping Ma

Large language models (LLMs) have demonstrated remarkable capabilities across various research domains, including the field of Information Retrieval (IR).

Data Augmentation Information Retrieval +5

An Intent Taxonomy of Legal Case Retrieval

no code implementations25 Jul 2023 Yunqiu Shao, Haitao Li, Yueyue Wu, Yiqun Liu, Qingyao Ai, Jiaxin Mao, Yixiao Ma, Shaoping Ma

Through a laboratory user study, we reveal significant differences in user behavior and satisfaction under different search intents in legal case retrieval.

Information Retrieval Retrieval +1

I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval

1 code implementation4 Jun 2023 Qian Dong, Yiding Liu, Qingyao Ai, Haitao Li, Shuaiqiang Wang, Yiqun Liu, Dawei Yin, Shaoping Ma

Moreover, the proposed implicit interaction is compatible with special pre-training and knowledge distillation for passage retrieval, which brings a new state-of-the-art performance.

Knowledge Distillation Passage Retrieval +2

THUIR at WSDM Cup 2023 Task 1: Unbiased Learning to Rank

1 code implementation25 Apr 2023 Jia Chen, Haitao Li, Weihang Su, Qingyao Ai, Yiqun Liu

This paper introduces the approaches we have used to participate in the WSDM Cup 2023 Task 1: Unbiased Learning to Rank.

Learning-To-Rank

Constructing Tree-based Index for Efficient and Effective Dense Retrieval

1 code implementation24 Apr 2023 Haitao Li, Qingyao Ai, Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Zheng Liu, Zhao Cao

Unfortunately, while ANN can improve the efficiency of DR models, it usually comes with a significant price on retrieval performance.

Contrastive Learning Retrieval

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval

1 code implementation22 Apr 2023 Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Yueyue Wu, Yiqun Liu, Chong Chen, Qi Tian

Moreover, in contrast to the general retrieval, the relevance in the legal domain is sensitive to key legal elements.

Language Modeling Language Modelling +1

Towards Better Web Search Performance: Pre-training, Fine-tuning and Learning to Rank

no code implementations28 Feb 2023 Haitao Li, Jia Chen, Weihang Su, Qingyao Ai, Yiqun Liu

This paper describes the approach of the THUIR team at the WSDM Cup 2023 Pre-training for Web Search task.

Learning-To-Rank

Reachability of Dimension-Bounded Linear Systems

no code implementations9 Aug 2021 Yiliang Li, Haitao Li, Jun-e Feng, Jinjin Li

In this paper, the reachability of dimension-bounded linear systems is investigated. Since state dimensions of dimension-bounded linear systems vary with time, the expression of state dimension at each time is provided. A method for judging the reachability of a given vector space is proposed.

Forecasting Popularity of Videos using Social Media

no code implementations22 Mar 2014 Jie Xu, Mihaela van der Schaar, Jiangchuan Liu, Haitao Li

This paper presents a systematic online prediction method (Social-Forecast) that is capable to accurately forecast the popularity of videos promoted by social media.

Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.