no code implementations • 13 Jan 2025 • Guozhi Yuan, Youfeng Liu, Jingli Yang, Wei Jia, Kai Lin, Yansong Gao, Shan He, Zilin Ding, Haitao Li
Code Action addresses these issues while also introducing the challenges of a more complex action space and more difficult action organization.
1 code implementation • 23 Dec 2024 • Haitao Li, Junjie Chen, Jingli Yang, Qingyao Ai, Wei Jia, Youfeng Liu, Kai Lin, Yueyue Wu, Guozhi Yuan, Yiran Hu, Wuyue Wang, Yiqun Liu, Minlie Huang
Therefore, we propose LegalAgentBench, a comprehensive benchmark specifically designed to evaluate LLM Agents in the Chinese legal domain.
1 code implementation • 7 Dec 2024 • Haitao Li, Qian Dong, Junjie Chen, Huixue Su, Yujia Zhou, Qingyao Ai, Ziyi Ye, Yiqun Liu
Finally, we provide a detailed analysis of the limitations of LLM judges and discuss potential future directions.
no code implementations • 22 Nov 2024 • Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing Huang
We analyzed this phenomenon from a causal perspective in the context of ECG MLLMs and discovered that the confounder, severity of illness, introduces a spurious correlation between the question and answer, leading the model to rely on this spurious correlation and ignore the ECG input.
1 code implementation • 19 Nov 2024 • Zhengyao Ding, Yujian Hu, Youyao Xu, Chengchen Zhao, Ziyu Li, Yiheng Mao, Haitao Li, Qian Li, Jing Wang, Yue Chen, Mengjia Chen, Longbo Wang, Xuesen Chu, Weichao Pan, Ziyi Liu, Fei Wu, HongKun Zhang, Ting Chen, Zhengxing Huang
Cardiovascular diseases (CVDs) present significant challenges for early and accurate diagnosis.
1 code implementation • 20 Oct 2024 • Haitao Li, Junjie Chen, Qingyao Ai, Zhumin Chu, Yujia Zhou, Qian Dong, Yiqun Liu
The use of large language models (LLMs) as automated evaluation tools to assess the quality of generated natural language, known as LLMs-as-Judges, has demonstrated promising capabilities and is rapidly gaining widespread attention.
no code implementations • 16 Oct 2024 • Junjie Chen, Weihang Su, Zhumin Chu, Haitao Li, Qinyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma
Moreover, our study highlights the impact of prompt strategies and evaluation formats on evaluation performance, offering guidance for method optimization in the future.
1 code implementation • 30 Sep 2024 • Haitao Li, You Chen, Qingyao Ai, Yueyue Wu, Ruizhe Zhang, Yiqun Liu
Applying existing LLMs to legal systems without careful evaluation of their potential and limitations could pose significant risks in legal practice.
no code implementations • 1 Apr 2024 • Haitao Li, You Chen, Zhekai Ge, Qingyao Ai, Yiqun Liu, Quan Zhou, Shuai Huo
Legal retrieval techniques play an important role in preserving the fairness and equality of the judicial system.
no code implementations • 27 Mar 2024 • Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Zhijing Wu, Yiqun Liu, Chong Chen, Qi Tian
However, general LLMs, which are developed on open-domain data, may lack the domain-specific knowledge essential for tasks in vertical domains, such as legal, medical, etc.
no code implementations • 27 Mar 2024 • Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu, Chong Chen, Qi Tian
Most of the existing works focus on improving the representation ability for the contextualized embedding of the [CLS] token and calculate relevance using textual semantic similarity.
no code implementations • 17 Mar 2024 • Ruizhe Zhang, Haitao Li, Yueyue Wu, Qingyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma
In recent years, the utilization of large language models for natural language dialogue has gained momentum, leading to their widespread adoption across various domains.
no code implementations • 28 Jan 2024 • Zhumin Chu, Qingyao Ai, Yiteng Tu, Haitao Li, Yiqun Liu
Existing paradigms rely on either human annotators or model-based evaluators to evaluate the performance of LLMs on different tasks.
1 code implementation • 1 Nov 2023 • Weihang Su, Qingyao Ai, Yueyue Wu, Yixiao Ma, Haitao Li, Yiqun Liu, Zhijing Wu, Min Zhang
Legal case retrieval aims to help legal workers find relevant cases related to their cases at hand, which is important for the guarantee of fairness and justice in legal judgments.
no code implementations • 26 Oct 2023 • Haitao Li, Yunqiu Shao, Yueyue Wu, Qingyao Ai, Yixiao Ma, Yiqun Liu
However, the development of legal case retrieval technologies in the Chinese legal system is restricted by three problems in existing datasets: limited data size, narrow definitions of legal relevance, and naive candidate pooling strategies used in data sampling.
no code implementations • 16 Oct 2023 • Peng Wen, Junhu Zhang, Haitao Li
Our proposed method significantly enhances the detection capability of mask-wearing.
no code implementations • 29 Sep 2023 • Qian Dong, Yiding Liu, Qingyao Ai, Zhijing Wu, Haitao Li, Yiqun Liu, Shuaiqiang Wang, Dawei Yin, Shaoping Ma
Large language models (LLMs) have demonstrated remarkable capabilities across various research domains, including the field of Information Retrieval (IR).
no code implementations • 25 Jul 2023 • Yunqiu Shao, Haitao Li, Yueyue Wu, Yiqun Liu, Qingyao Ai, Jiaxin Mao, Yixiao Ma, Shaoping Ma
Through a laboratory user study, we reveal significant differences in user behavior and satisfaction under different search intents in legal case retrieval.
1 code implementation • 4 Jun 2023 • Qian Dong, Yiding Liu, Qingyao Ai, Haitao Li, Shuaiqiang Wang, Yiqun Liu, Dawei Yin, Shaoping Ma
Moreover, the proposed implicit interaction is compatible with special pre-training and knowledge distillation for passage retrieval, which brings a new state-of-the-art performance.
2 code implementations • 11 May 2023 • Haitao Li, Weihang Su, Changyue Wang, Yueyue Wu, Qingyao Ai, Yiqun Liu
Legal case retrieval techniques play an essential role in modern intelligent legal systems.
2 code implementations • 11 May 2023 • Haitao Li, Changyue Wang, Weihang Su, Yueyue Wu, Qingyao Ai, Yiqun Liu
This paper describes the approach of the THUIR team at the COLIEE 2023 Legal Case Entailment task.
1 code implementation • 25 Apr 2023 • Jia Chen, Haitao Li, Weihang Su, Qingyao Ai, Yiqun Liu
This paper introduces the approaches we have used to participate in the WSDM Cup 2023 Task 1: Unbiased Learning to Rank.
1 code implementation • 24 Apr 2023 • Haitao Li, Qingyao Ai, Jingtao Zhan, Jiaxin Mao, Yiqun Liu, Zheng Liu, Zhao Cao
Unfortunately, while ANN can improve the efficiency of DR models, it usually comes with a significant price on retrieval performance.
1 code implementation • 22 Apr 2023 • Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Yueyue Wu, Yiqun Liu, Chong Chen, Qi Tian
Moreover, in contrast to the general retrieval, the relevance in the legal domain is sensitive to key legal elements.
1 code implementation • 7 Apr 2023 • Xiaohui Xie, Qian Dong, Bingning Wang, Feiyang Lv, Ting Yao, Weinan Gan, Zhijing Wu, Xiangsheng Li, Haitao Li, Yiqun Liu, Jin Ma
T2Ranking comprises more than 300K queries and over 2M unique passages from real-world search engines.
no code implementations • 28 Feb 2023 • Haitao Li, Jia Chen, Weihang Su, Qingyao Ai, Yiqun Liu
This paper describes the approach of the THUIR team at the WSDM Cup 2023 Pre-training for Web Search task.
no code implementations • 9 Aug 2021 • Yiliang Li, Haitao Li, Jun-e Feng, Jinjin Li
In this paper, the reachability of dimension-bounded linear systems is investigated. Since state dimensions of dimension-bounded linear systems vary with time, the expression of state dimension at each time is provided. A method for judging the reachability of a given vector space is proposed.
no code implementations • 22 Mar 2014 • Jie Xu, Mihaela van der Schaar, Jiangchuan Liu, Haitao Li
This paper presents a systematic online prediction method (Social-Forecast) that is capable to accurately forecast the popularity of videos promoted by social media.