Search Results for author: Lang Cao

Found 17 papers, 4 papers with code

Bingo: Boosting Efficient Reasoning of LLMs via Dynamic and Significance-based Reinforcement Learning

no code implementations9 Jun 2025 Hanbing Liu, Lang Cao, Yuanyi Ren, Mengyu Zhou, Haoyu Dong, Xiaojun Ma, Shi Han, Dongmei Zhang

Large language models have demonstrated impressive reasoning capabilities, yet they often suffer from inefficiencies due to unnecessarily verbose or redundant outputs.

Reinforcement Learning (RL)

Fortune: Formula-Driven Reinforcement Learning for Symbolic Table Reasoning in Language Models

no code implementations29 May 2025 Lang Cao, Jingxian Xu, Hanbing Liu, Jinyu Wang, Mengyu Zhou, Haoyu Dong, Shi Han, Dongmei Zhang

In this paper, we propose Formula Tuning (Fortune), a reinforcement learning (RL) framework that trains LMs to generate executable spreadsheet formulas for question answering over general tabular data.

Question Answering Reinforcement Learning (RL)

Process Reward Modeling with Entropy-Driven Uncertainty

no code implementations28 Mar 2025 Lang Cao, Renhong Chen, Yingtian Zou, Chao Peng, Wu Ning, Huacong Xu, Qian Chen, Yuxian Wang, Peishuo Su, Mofan Peng, Zijie Chen, Yitong Li

This paper presents the Entropy-Driven Unified Process Reward Model (EDU-PRM), a novel framework that approximates state-of-the-art performance in process supervision while drastically reducing training costs.

TablePilot: Recommending Human-Preferred Tabular Data Analysis with Large Language Models

no code implementations17 Mar 2025 Deyin Yi, Yihao Liu, Lang Cao, Mengyu Zhou, Haoyu Dong, Shi Han, Dongmei Zhang

Tabular data analysis is crucial in many scenarios, yet efficiently identifying the most relevant data analysis queries and results for a new table remains a significant challenge.

DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning

1 code implementation28 Feb 2025 Pengcheng Jiang, Jiacheng Lin, Lang Cao, Runchu Tian, SeongKu Kang, Zifeng Wang, Jimeng Sun, Jiawei Han

We introduce DeepRetrieval, a reinforcement learning (RL) approach that trains LLMs for query generation through trial and error without supervised data (reference query).

Information Retrieval reinforcement-learning +3

RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation

2 code implementations16 Feb 2025 Pengcheng Jiang, Lang Cao, Ruike Zhu, Minhao Jiang, Yunyi Zhang, Jimeng Sun, Jiawei Han

Large language models (LLMs) have achieved impressive performance on knowledge-intensive tasks, yet they often struggle with multi-step reasoning due to the unstructured nature of retrieved context.

graph construction Knowledge Graphs +5

TableMaster: A Recipe to Advance Table Understanding with Language Models

no code implementations31 Jan 2025 Lang Cao, Hanbing Liu

While current language models (LMs) excel at many text-based tasks, they still face challenges in table understanding due to the complex characteristics of tabular data, such as their structured nature.

Accelerating Clinical Evidence Synthesis with Large Language Models

no code implementations25 Jun 2024 Zifeng Wang, Lang Cao, Benjamin Danek, Qiao Jin, Zhiyong Lu, Jimeng Sun

Here, we introduce TrialMind, a generative artificial intelligence (AI) pipeline for facilitating human-AI collaboration in three crucial tasks for evidence synthesis: study search, screening, and data extraction.

Language Modelling

KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge

1 code implementation26 May 2024 Pengcheng Jiang, Lang Cao, Cao Xiao, Parminder Bhatia, Jimeng Sun, Jiawei Han

Knowledge Graph Embedding (KGE) techniques are crucial in learning compact representations of entities and relations within a knowledge graph, facilitating efficient reasoning and knowledge discovery.

Informativeness Knowledge Graph Embedding +4

PILOT: Legal Case Outcome Prediction with Case Law

no code implementations28 Jan 2024 Lang Cao, Zifeng Wang, Cao Xiao, Jimeng Sun

We demonstrate the importance of accurately identifying precedent cases and mitigating the temporal shift when making predictions for case law, as our method shows a significant improvement over the prior methods that focus on civil law case outcome predictions.

Decision Making Prediction +1

Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism

no code implementations2 Nov 2023 Lang Cao

To achieve this, we utilize a structured knowledge base to represent all the LLM's understanding of the world, enabling it to provide traceable gold knowledge.

Hallucination Misinformation +1

AutoAM: An End-To-End Neural Model for Automatic and Universal Argument Mining

no code implementations17 Sep 2023 Lang Cao

Argument mining is to analyze argument structure and extract important argument information from unstructured text.

Argument Mining

GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach

no code implementations18 Aug 2023 Lang Cao

By evaluating these graphs, models can yield more accurate and reliable results. Our experimental results show that our graph-based verification method not only significantly enhances the reasoning abilities of LLMs but also outperforms existing verifier methods in terms of improving these models' reasoning performance.

Math

DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue

no code implementations15 Aug 2023 Lang Cao

A significant application of Large Language Models (LLMs), like ChatGPT, is their deployment as chat agents, which respond to human inquiries across a variety of domains.

Chatbot Diagnostic +2

Cannot find the paper you are looking for? You can Submit a new open access paper.