Search Results for author: Zheheng Luo

Found 14 papers, 5 papers with code

Process-based Self-Rewarding Language Models

1 code implementation5 Mar 2025 Shimao Zhang, Xiao Liu, Xin Zhang, Junxiao Liu, Zheheng Luo, ShuJian Huang, Yeyun Gong

Human-annotated preference data is used for training to further improve LLMs' performance, which is constrained by the upper limit of human performance.

Mathematical Reasoning

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

no code implementations21 Nov 2024 Zheheng Luo, Xin Zhang, Xiao Liu, Haoling Li, Yeyun Gong, Chen Qi, Peng Cheng

To evaluate the effectiveness of Velocitune, we conduct experiments in a reasoning-focused dataset with CodeLlama, as well as in a corpus specialised for system command generation with Llama3 and Mistral.

Math

Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams

1 code implementation17 Jun 2024 Zheheng Luo, Chenhan Yuan, Qianqian Xie, Sophia Ananiadou

To fill this research gap, we introduce the Examinations for Medical Personnel in Chinese (EMPEC), a pioneering large-scale healthcare knowledge benchmark in traditional Chinese.

All Benchmarking +1

Factual Consistency Evaluation of Summarisation in the Era of Large Language Models

no code implementations21 Feb 2024 Zheheng Luo, Qianqian Xie, Sophia Ananiadou

Experiments on TreatFact suggest that both previous methods and LLM-based evaluators are unable to capture factual inconsistencies in clinical summaries, posing a new challenge for FC evaluation.

Articles Misinformation

The Lay Person's Guide to Biomedicine: Orchestrating Large Language Models

no code implementations21 Feb 2024 Zheheng Luo, Qianqian Xie, Sophia Ananiadou

Moreover, automated methods that can effectively assess the `layness' of generated summaries are lacking.

Articles Text Simplification

FinBen: A Holistic Financial Benchmark for Large Language Models

2 code implementations20 Feb 2024 Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang

Our evaluation of 15 representative LLMs, including GPT-4, ChatGPT, and the latest Gemini, reveals several key findings: While LLMs excel in IE and textual analysis, they struggle with advanced reasoning and complex tasks like text generation and forecasting.

Question Answering RAG +3

Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles

no code implementations29 Sep 2023 Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin

This paper presents the results of the shared task on Lay Summarisation of Biomedical Research Articles (BioLaySumm), hosted at the BioNLP Workshop at ACL 2023.

Articles Lay Summarization

Graph Contrastive Topic Model

1 code implementation5 Jul 2023 Zheheng Luo, Lei Liu, Qianqian Xie, Sophia Ananiadou

Based on it, we propose the graph contrastive topic model (GCTM), which conducts graph contrastive learning (GCL) using informative positive and negative samples that are generated by the graph-based sampling strategy leveraging in-depth correlation and irrelevance among documents and words.

Contrastive Learning model +1

A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models

no code implementations18 Apr 2023 Qianqian Xie, Zheheng Luo, Benyou Wang, Sophia Ananiadou

In this paper, we present a systematic review of recent advancements in BTS, leveraging cutting-edge NLP techniques from PLMs to LLMs, to help understand the latest progress, challenges, and future directions.

Information Retrieval Language Modelling +4

ChatGPT as a Factual Inconsistency Evaluator for Text Summarization

no code implementations27 Mar 2023 Zheheng Luo, Qianqian Xie, Sophia Ananiadou

In this paper, we particularly explore ChatGPT's ability to evaluate factual inconsistency under a zero-shot setting by examining it on both coarse-grained and fine-grained evaluation tasks including binary entailment inference, summary ranking, and consistency rating.

Abstractive Text Summarization Natural Language Inference +3

CitationSum: Citation-aware Graph Contrastive Learning for Scientific Paper Summarization

no code implementations26 Jan 2023 Zheheng Luo, Qianqian Xie, Sophia Ananiadou

To fill that gap, we propose a novel citation-aware scientific paper summarization framework based on citation graphs, able to accurately locate and incorporate the salient contents from references, as well as capture varying relevance between source papers and their references.

Contrastive Learning Text Summarization

Readability Controllable Biomedical Document Summarization

no code implementations10 Oct 2022 Zheheng Luo, Qianqian Xie, Sophia Ananiadou

Different from general documents, it is recognised that the ease with which people can understand a biomedical text is eminently varied, owing to the highly technical nature of biomedical documents and the variance of readers' domain knowledge.

Document Summarization Extractive Summarization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.