Search Results for author: Zhaowei Wang

Found 35 papers, 24 papers with code

VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training

no code implementations16 Jun 2025 Jipeng Zhang, Kehao Miao, Renjie Pi, Zhaowei Wang, Runtao Liu, Rui Pan, Tong Zhang

Reinforcement Fine-Tuning (RFT) with verifiable rewards has advanced large language models but remains underexplored for Vision-Language (VL) models.

Hallucination Multimodal Reasoning

PIPE: Physics-Informed Position Encoding for Alignment of Satellite Images and Time Series

no code implementations27 May 2025 Haobo Li, Eunseo Jung, Zixin Chen, Zhaowei Wang, Yueya Wang, Huamin Qu, Alexis Kai Hon Lau

Multimodal time series forecasting is foundational in various fields, such as utilizing satellite imagery and numerical data for predicting typhoons in climate science.

Position Time Series +1

S2LPP: Small-to-Large Prompt Prediction across LLMs

no code implementations26 May 2025 Liang Cheng, Tianyi Li, Zhaowei Wang, Mark Steedman

The performance of pre-trained Large Language Models (LLMs) is often sensitive to nuances in prompt templates, requiring careful prompt engineering, adding costs in terms of computing and human effort.

Natural Language Inference Prediction +2

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

1 code implementation15 May 2025 Zhaowei Wang, Wenhao Yu, Xiyu Ren, Jipeng Zhang, Yu Zhao, Rohit Saxena, Liang Cheng, Ginny Wong, Simon See, Pasquale Minervini, Yangqiu Song, Mark Steedman

The rapid extension of context windows in large vision-language models has given rise to long-context vision-language models (LCVLMs), which are capable of handling hundreds of images with interleaved text tokens in a single forward pass.

8k Benchmarking +1

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

1 code implementation1 May 2025 Yiming Du, WenYu Huang, Danna Zheng, Zhaowei Wang, Sebastien Montella, Mirella Lapata, Kam-Fai Wong, Jeff Z. Pan

By reframing memory systems through the lens of atomic operations and representation types, this survey provides a structured and dynamic perspective on research, benchmark datasets, and tools related to memory in AI, clarifying the functional interplay in LLMs based agents while outlining promising directions for future research\footnote{The paper list, datasets, methods and tools are available at \href{https://github. com/Elvin-Yiming-Du/Survey_Memory_in_AI}{https://github. com/Elvin-Yiming-Du/Survey\_Memory\_in\_AI}.

Survey

Neutralizing Bias in LLM Reasoning using Entailment Graphs

1 code implementation14 Mar 2025 Liang Cheng, Tianyi Li, Zhaowei Wang, Tianyang Liu, Mark Steedman

Extensive evaluations show that our framework can significantly reduce hallucinations from attestation bias.

counterfactual Counterfactual Reasoning +1

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

1 code implementation28 Dec 2024 Qing Zong, Zhaowei Wang, Tianshi Zheng, Xiyu Ren, Yangqiu Song

In addition, to avoid possible semantic shortcuts, which is a severe problem of current LLMs study, we design a two-round method for knowledge robustness measurement utilizing both correctness and uncertainty.

What Really is Commonsense Knowledge?

no code implementations6 Nov 2024 Quyet V. Do, Junze Li, Tung-Duong Vuong, Zhaowei Wang, Yangqiu Song, Xiaojuan Ma

Commonsense datasets have been well developed in Natural Language Processing, mainly through crowdsource human annotation.

Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction

1 code implementation15 Oct 2024 Kaiqiao Han, Tianqing Fang, Zhaowei Wang, Yangqiu Song, Mark Steedman

While Large Language Models (LLMs) have showcased remarkable proficiency in reasoning, there is still a concern about hallucinations and unreliable reasoning issues due to semantic associations and superficial logical chains.

CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting

no code implementations27 Sep 2024 Haobo Li, Zhaowei Wang, Jiachen Wang, Yueya Wang, Alexis Kai Hon Lau, Huamin Qu

Our experiments reveal the advantages and limitations of existing MLLMs and the value of CLLMate for the training and benchmarking of the WCEF task.

Articles Benchmarking +1

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

no code implementations5 Sep 2024 Zhaowei Wang, Ying Hao, Hao Wei, Qing Xiao, Lulu Chen, Yulong Li, Yue Yang, Tianyi Li

Recent advancements in text-to-image diffusion models have significantly transformed visual content generation, yet their application in specialized fields such as interior design remains underexplored.

Model Optimization

CodeGraph: Enhancing Graph Reasoning of LLMs with Code

1 code implementation25 Aug 2024 Qiaolong Cai, Zhaowei Wang, Shizhe Diao, James Kwok, Yangqiu Song

Compared to the existing methods, CodeGraph demonstrates strong performance on arithmetic problems in graph tasks and offers a more controllable and interpretable approach to the reasoning process.

ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases

1 code implementation25 Jan 2024 Quyet V. Do, Tianqing Fang, Shizhe Diao, Zhaowei Wang, Yangqiu Song

When considering a new knowledge instance, ConstraintChecker employs a rule-based module to produce a list of constraints, then it uses a zero-shot learning module to check whether this knowledge instance satisfies all constraints.

Prompt Engineering Zero-Shot Learning

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

2 code implementations14 Jan 2024 Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Jiayang Cheng, Chunkit Chan, Yangqiu Song

The sequential process of conceptualization and instantiation is essential to generalizable commonsense reasoning as it allows the application of existing knowledge to unfamiliar scenarios.

Diversity

AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph

1 code implementation15 Nov 2023 Zhaowei Wang, Haochen Shi, Weiqi Wang, Tianqing Fang, Hongming Zhang, Sehyun Choi, Xin Liu, Yangqiu Song

Cognitive research indicates that abstraction ability is essential in human intelligence, which remains under-explored in language models.

Benchmarking

Gold: A Global and Local-aware Denoising Framework for Commonsense Knowledge Graph Noise Detection

1 code implementation18 Oct 2023 Zheye Deng, Weiqi Wang, Zhaowei Wang, Xin Liu, Yangqiu Song

Commonsense Knowledge Graphs (CSKGs) are crucial for commonsense reasoning, yet constructing them through human annotations can be costly.

Denoising Knowledge Graphs +1

Label-free Deep Learning Driven Secure Access Selection in Space-Air-Ground Integrated Networks

no code implementations28 Aug 2023 Zhaowei Wang, Zhisheng Yin, Xiucheng Wang, Nan Cheng, Yuan Zhang, Tom H. Luan

Considering the inherent co-channel interference due to spectrum sharing among multi-tier access networks in SAGIN, it can be leveraged to assist the physical layer security among heterogeneous transmissions.

Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal Reasoning

1 code implementation24 May 2023 Tianqing Fang, Zhaowei Wang, Wenxuan Zhou, Hongming Zhang, Yangqiu Song, Muhao Chen

However, knowledge conflicts arise when there is a mismatch between the actual temporal relations of events in the context and the prior knowledge or biases learned by the model.

counterfactual Data Augmentation +2

COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective

1 code implementation9 May 2023 Zhaowei Wang, Quyet V. Do, Hongming Zhang, Jiayao Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See

This paper proposes a new task to detect commonsense causation between two events in an event sequence (i. e., context), called contextualized commonsense causal reasoning.

Causal Inference CoLA +1

CKBP v2: Better Annotation and Reasoning for Commonsense Knowledge Base Population

1 code implementation20 Apr 2023 Tianqing Fang, Quyet V. Do, Zihao Zheng, Weiqi Wang, Sehyun Choi, Zhaowei Wang, Yangqiu Song

We show that CKBP v2 serves as a challenging and representative evaluation dataset for the CSKB Population task, while its development set aids in selecting a population model that leads to improved knowledge acquisition for downstream commonsense reasoning.

Knowledge Base Population Question Answering

SubeventWriter: Iterative Sub-event Sequence Generation with Coherence Controller

1 code implementation13 Oct 2022 Zhaowei Wang, Hongming Zhang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong, Simon See

In this paper, we propose a new task of sub-event generation for an unseen process to evaluate the understanding of the coherence of sub-event actions and objects.

Legal Element-oriented Modeling with Multi-view Contrastive Learning for Legal Case Retrieval

no code implementations11 Oct 2022 Zhaowei Wang

In addition to general topical relevance, the relevant cases also involve similar situations and legal elements, which can support the judgment of the current case.

Contrastive Learning Language Modelling +1

SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer

no code implementations12 Jun 2022 Junde Wu, Huihui Fang, Fangxin Shang, Dalu Yang, Zhaowei Wang, Jing Gao, Yehui Yang, Yanwu Xu

To model the segmentation-diagnosis interaction, SeA-block first embeds the diagnosis feature based on the segmentation information via the encoder, and then transfers the embedding back to the diagnosis feature space by a decoder.

Decoder Melanoma Diagnosis +2

Learning self-calibrated optic disc and cup segmentation from multi-rater annotations

1 code implementation10 Jun 2022 Junde Wu, Huihui Fang, Fangxin Shang, Zhaowei Wang, Dalu Yang, Wenshuo Zhou, Yehui Yang, Yanwu Xu

In this paper, we propose a novel neural network framework to learn OD/OC segmentation from multi-rater annotations.

Segmentation

Opinions Vary? Diagnosis First!

1 code implementation14 Feb 2022 Junde Wu, Huihui Fang, Dalu Yang, Zhaowei Wang, Wenshuo Zhou, Fangxin Shang, Yehui Yang, Yanwu Xu

Motivated by the observation that OD/OC segmentation is often used for the glaucoma diagnosis clinically, in this paper, we propose a novel strategy to fuse the multi-rater OD/OC segmentation labels via the glaucoma diagnosis performance.

Medical Image Segmentation Segmentation +1

UltraGCN: Ultra Simplification of Graph Convolutional Networks for Recommendation

2 code implementations28 Oct 2021 Kelong Mao, Jieming Zhu, Xi Xiao, Biao Lu, Zhaowei Wang, Xiuqiang He

In this paper, we take one step further to propose an ultra-simplified formulation of GCNs (dubbed UltraGCN), which skips infinite layers of message passing for efficient recommendation.

Collaborative Filtering

SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval

no code implementations2 Oct 2020 Yang Bai, Xiaoguang Li, Gang Wang, Chaoliang Zhang, Lifeng Shang, Jun Xu, Zhaowei Wang, Fangshan Wang, Qun Liu

Term-based sparse representations dominate the first-stage text retrieval in industrial applications, due to its advantage in efficiency, interpretability, and exact term matching.

Language Modeling Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.