Search Results for author: Xiaozhong Liu

Found 73 papers, 28 papers with code

De-Biased Court's View Generation with Causality

no code implementations EMNLP 2020 Yiquan Wu, Kun Kuang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Jun Xiao, Yueting Zhuang, Luo Si, Fei Wu

Court{'}s view generation is a novel but essential task for legal AI, aiming at improving the interpretability of judgment prediction results and enabling automatic legal document generation.

counterfactual Text Generation

LeCoDe: A Benchmark Dataset for Interactive Legal Consultation Dialogue Evaluation

no code implementations26 May 2025 Weikang Yuan, Kaisong Song, Zhuoren Jiang, Junjie Cao, Yujie Zhang, Jun Lin, Kun Kuang, Ji Zhang, Xiaozhong Liu

To address these challenges, we introduce LeCoDe, a real-world multi-turn benchmark dataset comprising 3, 696 legal consultation dialogues with 110, 008 dialogue turns, designed to evaluate and improve LLMs' legal consultation capability.

Dialogue Evaluation

CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs

no code implementations24 May 2025 Yiqing Zhang, Xiaozhong Liu, Fabricio Murai

To address this limitation, we introduce CLaDMoP, a new pre-training approach for clinical trial outcome prediction, alongside the Successful Clinical Trials dataset(SCT), specifically designed for this task.

Large Language Model parameter-efficient fine-tuning +1

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

1 code implementation22 May 2025 YuTing Huang, Meitong Guo, Yiquan Wu, Ang Li, Xiaozhong Liu, Keting Yin, Changlong Sun, Fei Wu, Kun Kuang

Recent advances in LegalAI have primarily focused on individual case judgment analysis, often overlooking the critical appellate process within the judicial system.

Decision Making Multi-class Classification +2

KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model

no code implementations27 Feb 2025 Kai Zhang, Rui Zhu, Shutian Ma, Jingwei Xiong, Yejin Kim, Fabricio Murai, Xiaozhong Liu

Drug discovery is a critical task in biomedical natural language processing (NLP), yet explainable drug discovery remains underexplored.

Drug Discovery Knowledge Graphs +3

Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning

1 code implementation11 Feb 2025 Rujing Yao, Yang Wu, Chenghao Wang, Jingwei Xiong, Fang Wang, Xiaozhong Liu

Large Language Models (LLMs) have achieved impressive results across numerous domains, yet they experience notable deficiencies in legal question-answering tasks.

Hallucination In-Context Learning +7

Auto-Drafting Police Reports from Noisy ASR Outputs: A Trust-Centered LLM Approach

no code implementations11 Feb 2025 Param Kulkarni, Yingchi Liu, Hao-Ming Fu, Shaohua Yang, Isuru Gunasekara, Matt Peloquin, Noah Spitzer-Williams, Xiaotian Zhou, Xiaozhong Liu, Zhengping Ji, Yasser Ibrahim

Achieving a delicate balance between fostering trust in law en- forcement and protecting the rights of both officers and civilians continues to emerge as a pressing research and product challenge in the world today.

Fairness

Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models

no code implementations3 Feb 2025 Yuyang Gong, Zhuo Chen, Miaokun Chen, Fengchang Yu, Wei Lu, XiaoFeng Wang, Xiaozhong Liu, Jiawei Liu

Retrieval-Augmented Generation (RAG) systems based on Large Language Models (LLMs) have become essential for tasks such as question answering and content generation.

Question Answering RAG +2

MEXA-CTP: Mode Experts Cross-Attention for Clinical Trial Outcome Prediction

1 code implementation12 Jan 2025 Yiqing Zhang, Xiaozhong Liu, Fabricio Murai

Clinical trials are the gold standard for assessing the effectiveness and safety of drugs for treating diseases.

FlippedRAG: Black-Box Opinion Manipulation Adversarial Attacks to Retrieval-Augmented Generation Models

no code implementations6 Jan 2025 Zhuo Chen, Jiawei Liu, Yuyang Gong, Miaokun Chen, Haotan Liu, Qikai Cheng, Fan Zhang, Wei Lu, Xiaozhong Liu, XiaoFeng Wang

In this paper, we investigate a more realistic and critical threat scenario: adversarial attacks intended for opinion manipulation against black-box RAG models, particularly on controversial topics.

Adversarial Attack Hallucination +4

Science Out of Its Ivory Tower: Improving Accessibility with Reinforcement Learning

1 code implementation22 Oct 2024 Haining Wang, Jason Clark, Hannah McKelvey, Leila Sterman, Zheng Gao, Zuoyu Tian, Sandra Kübler, Xiaozhong Liu

A vast amount of scholarly work is published daily, yet much of it remains inaccessible to the general public due to dense jargon and complex language.

Language Modeling Language Modelling +1

A Speaker Turn-Aware Multi-Task Adversarial Network for Joint User Satisfaction Estimation and Sentiment Analysis

no code implementations12 Oct 2024 Kaisong Song, Yangyang Kang, Jiawei Liu, Xurui Li, Changlong Sun, Xiaozhong Liu

It is observed that whether the user's needs are met often triggers various sentiments, which can be pertinent to the successful estimation of user satisfaction, and vice versa.

Goal-Oriented Dialogue Systems Sentiment Analysis

LLM Cascade with Multi-Objective Optimal Consideration

no code implementations10 Oct 2024 Kai Zhang, Liqian Peng, Congchao Wang, Alec Go, Xiaozhong Liu

Large Language Models (LLMs) have demonstrated exceptional capabilities in understanding and generating natural language.

Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

no code implementations18 Jul 2024 Zhuo Chen, Jiawei Liu, Haotan Liu, Qikai Cheng, Fan Zhang, Wei Lu, Xiaozhong Liu

Retrieval-Augmented Generation (RAG) is applied to solve hallucination problems and real-time constraints of large language models, but it also induces vulnerabilities against retrieval corruption attacks.

Decision Making Hallucination +3

Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning

no code implementations5 Jun 2024 Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu

The integration of generative Large Language Models (LLMs) into various applications, including the legal domain, has been accelerated by their expansive and versatile nature.

Diagnostic Language Modeling +2

Enhance Robustness of Language Models Against Variation Attack through Graph Integration

no code implementations18 Apr 2024 Zi Xiong, Lizhi Qing, Yangyang Kang, Jiawei Liu, Hongsong Li, Changlong Sun, Xiaozhong Liu, Wei Lu

The widespread use of pre-trained language models (PLMs) in natural language processing (NLP) has greatly improved performance outcomes.

Diversity Language Modeling +1

From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

no code implementations10 Apr 2024 Yongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng

Therefore, our study shifts the focus from model-centered to human-centered evaluation in the context of AI-powered writing assistance applications.

Personalized LLM Response Generation with Parameterized Memory Injection

1 code implementation4 Apr 2024 Kai Zhang, Yejin Kim, Xiaozhong Liu

Large Language Models (LLMs) have exhibited remarkable proficiency in comprehending and generating natural language.

Bayesian Optimisation parameter-efficient fine-tuning +1

Empowering Dual-Level Graph Self-Supervised Pretraining with Motif Discovery

1 code implementation19 Dec 2023 Pengwei Yan, Kaisong Song, Zhuoren Jiang, Yangyang Kang, Tianqianjin Lin, Changlong Sun, Xiaozhong Liu

While self-supervised graph pretraining techniques have shown promising results in various domains, their application still experiences challenges of limited topology learning, human knowledge dependency, and incompetent multi-level interactions.

Representation Learning Transfer Learning

Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration

no code implementations13 Oct 2023 Yiquan Wu, Siying Zhou, Yifei Liu, Weiming Lu, Xiaozhong Liu, Yating Zhang, Changlong Sun, Fei Wu, Kun Kuang

Precedents are the previous legal cases with similar facts, which are the basis for the judgment of the subsequent case in national legal systems.

Large Language Model Soft Ideologization via AI-Self-Consciousness

no code implementations28 Sep 2023 Xiaotian Zhou, Qian Wang, XiaoFeng Wang, Haixu Tang, Xiaozhong Liu

Large language models (LLMs) have demonstrated human-level performance on a vast spectrum of natural language tasks.

Language Modeling Language Modelling +1

Community-Based Hierarchical Positive-Unlabeled (PU) Model Fusion for Chronic Disease Prediction

1 code implementation6 Sep 2023 Yang Wu, Xurui Li, Xuhong Zhang, Yangyang Kang, Changlong Sun, Xiaozhong Liu

Positive-Unlabeled (PU) Learning is a challenge presented by binary classification problems where there is an abundance of unlabeled data along with a small number of positive data instances, which can be used to address chronic disease screening problem.

Binary Classification Data Augmentation +3

I3: Intent-Introspective Retrieval Conditioned on Instructions

no code implementations19 Aug 2023 Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang

Recent studies indicate that dense retrieval models struggle to perform well on a wide variety of retrieval tasks that lack dedicated training data, as different retrieval tasks often entail distinct search intents.

Retrieval Text-to-Image Generation

Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document

1 code implementation23 May 2023 Xiangnan Chen, Qian Xiao, Juncheng Li, Duo Dong, Jun Lin, Xiaozhong Liu, Siliang Tang

GOSE initiates by generating preliminary relation predictions on entity pairs extracted from a scanned image of the document.

Relation Relation Extraction

MAWSEO: Adversarial Wiki Search Poisoning for Illicit Online Promotion

no code implementations22 Apr 2023 Zilong Lin, Zhengyi Li, Xiaojing Liao, XiaoFeng Wang, Xiaozhong Liu

As a prominent instance of vandalism edits, Wiki search poisoning for illicit promotion is a cybercrime in which the adversary aims at editing Wiki articles to promote illicit businesses through Wiki search results of relevant queries.

Multidimensional Perceptron for Efficient and Explainable Long Text Classification

no code implementations4 Apr 2023 Yexiang Wang, Yating Zhang, Xiaozhong Liu, Changlong Sun

Because of the inevitable cost and complexity of transformer and pre-trained models, efficiency concerns are raised for long text classification.

text-classification Text Classification

AI vs. Human -- Differentiation Analysis of Scientific Content Generation

no code implementations24 Jan 2023 Yongqiang Ma, Jiawei Liu, Fan Yi, Qikai Cheng, Yong Huang, Wei Lu, Xiaozhong Liu

We find that there exists a "writing style" gap between AI-generated scientific text and human-written scientific text.

Text Detection

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning

no code implementations7 Jun 2022 Jiannan Guo, Yangyang Kang, Yu Duan, Xiaozhong Liu, Siliang Tang, Wenqiao Zhang, Kun Kuang, Changlong Sun, Fei Wu

Motivated by the industry practice of labeling data, we propose an innovative Inconsistency-based virtual aDvErsarial Active Learning (IDEAL) algorithm to further investigate SSL-AL's potential superiority and achieve mutual enhancement of AL and SSL, i. e., SSL propagates label information to unlabeled samples and provides smoothed embeddings for AL, while AL excludes samples with inconsistent predictions and considerable uncertainty for SSL.

Active Learning

Dialogue Inspectional Summarization with Factual Inconsistency Awareness

no code implementations5 Nov 2021 Leilei Gan, Yating Zhang, Kun Kuang, Lin Yuan, Shuo Li, Changlong Sun, Xiaozhong Liu, Fei Wu

Dialogue summarization has been extensively studied and applied, where the prior works mainly focused on exploring superior model structures to align the input dialogue and the output summary.

dialogue summary Medical Diagnosis

A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

1 code implementation EMNLP 2021 Jiawei Liu, Kaisong Song, Yangyang Kang, Guoxiu He, Zhuoren Jiang, Changlong Sun, Wei Lu, Xiaozhong Liu

Chatbot is increasingly thriving in different domains, however, because of unexpected discourse complexity and training data sparseness, its potential distrust hatches vital apprehension.

Chatbot Multi-Task Learning

A Neural Conversation Generation Model via Equivalent Shared Memory Investigation

1 code implementation20 Aug 2021 Changzhen Ji, Yating Zhang, Xiaozhong Liu, Adam Jatowt, Changlong Sun, Conghui Zhu, Tiejun Zhao

Nevertheless, few works utilized the knowledge extracted from similar conversations for utterance generation.

Text Generation

RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy

no code implementations ACL 2021 Xiyan Fu, Yating Zhang, Tianyi Wang, Xiaozhong Liu, Changlong Sun, Zhenglu Yang

In the field of dialogue summarization, due to the lack of training data, it is often difficult for supervised summary generation methods to learn vital information from dialogue context with limited data.

Dialogue Generation Sentence +1

Leveraging Online Shopping Behaviors as a Proxy for Personal Lifestyle Choices: New Insights into Chronic Disease Prevention Literacy

no code implementations29 Apr 2021 Yongzhen Wang, Xiaozhong Liu, Katy Börner, Jun Lin, Yingnan Ju, Changlong Sun, Luo Si

Objective: Ubiquitous internet access is reshaping the way we live, but it is accompanied by unprecedented challenges in preventing chronic diseases that are usually planted by long exposure to unhealthy lifestyles.

Medical Diagnosis

Community-based Cyberreading for Information Understanding

no code implementations27 Mar 2021 Zhuoren Jiang, Xiaozhong Liu, Liangcai Gao, Zhi Tang

Although the content in scientific publications is increasingly challenging, it is necessary to investigate another important problem, that of scientific information understanding.

Learning-To-Rank

Chronological Citation Recommendation with Time Preference

no code implementations19 Jan 2021 Shutian Ma, Heng Zhang, Chengzhi Zhang, Xiaozhong Liu

Citation recommendation is an important task to assist scholars in finding candidate literature to cite.

Citation Recommendation

Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling

1 code implementation14 Dec 2020 Yicheng Zou, Lujun Zhao, Yangyang Kang, Jun Lin, Minlong Peng, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

In a customer service system, dialogue summarization can boost service efficiency by automatically creating summaries for long spoken dialogues in which customers and agents try to address issues about specific topics.

Cross Copy Network for Dialogue Generation

1 code implementation EMNLP 2020 Changzhen Ji, Xin Zhou, Yating Zhang, Xiaozhong Liu, Changlong Sun, Conghui Zhu, Tiejun Zhao

In the past few years, audiences from different fields witness the achievements of sequence-to-sequence models (e. g., LSTM+attention, Pointer Generator Networks, and Transformer) to enhance dialogue content generation.

Dialogue Generation

Predicting Clinical Trial Results by Implicit Evidence Integration

1 code implementation EMNLP 2020 Qiao Jin, Chuanqi Tan, Mosha Chen, Xiaozhong Liu, Songfang Huang

In the CTRP framework, a model takes a PICO-formatted clinical trial proposal with its background as input and predicts the result, i. e. how the Intervention group compares with the Comparison group in terms of the measured Outcome in the studied Population.

PICO

Detecting User Community in Sparse Domain via Cross-Graph Pairwise Learning

no code implementations6 Sep 2020 Zheng Gao, Hongsong Li, Zhuoren Jiang, Xiaozhong Liu

In this paper, our model, Pairwise Cross-graph Community Detection (PCCD), is proposed to cope with the sparse graph problem by involving external graph knowledge to learn user pairwise community closeness instead of detecting direct communities.

Community Detection

Efficient Personalized Community Detection via Genetic Evolution

no code implementations6 Sep 2020 Zheng Gao, Chun Guo, Xiaozhong Liu

Personalized community detection aims to generate communities associated with user need on graphs, which benefits many downstream tasks such as node recommendation and link prediction for users, etc.

Community Detection Link Prediction

Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning

no code implementations ACL 2020 Zhuoren Jiang, Zhe Gao, Yu Duan, Yangyang Kang, Changlong Sun, Qiong Zhang, Xiaozhong Liu

We propose a Semi-supervIsed GeNerative Active Learning (SIGNAL) model to address the imbalance, efficiency, and text camouflage problems of Chinese text spam detection task.

Active Learning Chinese Spam Detection +3

Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce

1 code implementation17 May 2020 Juntao Li, Chang Liu, Jian Wang, Lidong Bing, Hongsong Li, Xiaozhong Liu, Dongyan Zhao, Rui Yan

We manually collect a new and high-quality paired dataset, where each pair contains an unordered product attribute set in the source language and an informative product description in the target language.

Attribute Cross-Lingual Information Retrieval +1

Masking Orchestration: Multi-task Pretraining for Multi-role Dialogue Representation Learning

2 code implementations27 Feb 2020 Tianyi Wang, Yating Zhang, Xiaozhong Liu, Changlong Sun, Qiong Zhang

Multi-role dialogue understanding comprises a wide range of diverse tasks such as question answering, act classification, dialogue summarization etc.

Dialogue Understanding Question Answering +1

Read Beyond the Lines: Understanding the Implied Textual Meaning via a Skim and Intensive Reading Model

no code implementations3 Jan 2020 Guoxiu He, Zhe Gao, Zhuoren Jiang, Yangyang Kang, Changlong Sun, Xiaozhong Liu, Wei Lu

The nonliteral interpretation of a text is hard to be understood by machine models due to its high context-sensitivity and heavy usage of figurative language.

Reading Comprehension Sentence

Using Customer Service Dialogues for Satisfaction Analysis with Context-Assisted Multiple Instance Learning

no code implementations IJCNLP 2019 Kaisong Song, Lidong Bing, Wei Gao, Jun Lin, Lujun Zhao, Jiancheng Wang, Changlong Sun, Xiaozhong Liu, Qiong Zhang

Customers ask questions and customer service staffs answer their questions, which is the basic service model via multi-turn customer service (CS) dialogues on E-commerce platforms.

Multiple Instance Learning

Uncover Sexual Harassment Patterns from Personal Stories by Joint Key Element Extraction and Categorization

no code implementations IJCNLP 2019 Yingchi Liu, Quanzhi Li, Marika Cifor, Xiaozhong Liu, Qiong Zhang, Luo Si

Sexual harassment occurred in a variety of situations, and categorization of the stories and extraction of their key elements will provide great help for the related parties to understand and address sexual harassment.

AMAD: Adversarial Multiscale Anomaly Detection on High-Dimensional and Time-Evolving Categorical Data

no code implementations12 Jul 2019 Zheng Gao, Lin Guo, Chi Ma, Xiao Ma, Kai Sun, Hang Xiang, Xiaoqiang Zhu, Hongsong Li, Xiaozhong Liu

Anomaly detection is facing with emerging challenges in many important industry domains, such as cyber security and online recommendation and advertising.

Anomaly Detection

Aspect Sentiment Classification Towards Question-Answering with Reinforced Bidirectional Attention Network

no code implementations ACL 2019 Jingjing Wang, Changlong Sun, Shoushan Li, Xiaozhong Liu, Luo Si, Min Zhang, Guodong Zhou

This paper extends the research to interactive reviews and proposes a new research task, namely Aspect Sentiment Classification towards Question-Answering (ASC-QA), for real-world applications.

General Classification Question Answering +2

Neural Related Work Summarization with a Joint Context-driven Attention Mechanism

1 code implementation EMNLP 2018 Yongzhen Wang, Xiaozhong Liu, Zheng Gao

Conventional solutions to automatic related work summarization rely heavily on human-engineered features.

Cross-language Citation Recommendation via Hierarchical Representation Learning on Heterogeneous Graph

1 code implementation31 Dec 2018 Zhuoren Jiang, Yue Yin, Liangcai Gao, Yao Lu, Xiaozhong Liu

While the volume of scholarly publications has increased at a frenetic pace, accessing and consuming the useful candidate papers, in very large digital libraries, is becoming an essential and challenging task for scholars.

Citation Recommendation Representation Learning

edge2vec: Representation learning using edge semantics for biomedical knowledge discovery

1 code implementation7 Sep 2018 Zheng Gao, Gang Fu, Chunping Ouyang, Satoshi Tsutsui, Xiaozhong Liu, Jeremy Yang, Christopher Gessner, Brian Foote, David Wild, Qi Yu, Ying Ding

We propose this method for its added value relative to existing graph analytical methodology, and in the real world context of biomedical knowledge discovery applicability.

Biomedical Information Retrieval Information Retrieval +3

Cannot find the paper you are looking for? You can Submit a new open access paper.