Search Results for author: Jinghan Zhang

Found 18 papers, 8 papers with code

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

1 code implementation8 May 2025 Shiqi Chen, Jinghan Zhang, Tongyao Zhu, Wei Liu, Siyang Gao, Miao Xiong, Manling Li, Junxian He

Vision-Language Models (VLMs) combine visual perception with the general capabilities, such as reasoning, of Large Language Models (LLMs).

Entropy-based Exploration Conduction for Multi-step Reasoning

no code implementations20 Mar 2025 Jinghan Zhang, Xiting Wang, Fengran Mo, Yeyang Zhou, Wanfu Gao, Kunpeng Liu

In large language model (LLM) reasoning, multi-step processes have proven effective for solving complex tasks.

Language Modeling Language Modelling +1

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

1 code implementation3 Mar 2025 Shiqi Chen, Tongyao Zhu, Ruochen Zhou, Jinghan Zhang, Siyang Gao, Juan Carlos Niebles, Mor Geva, Junxian He, Jiajun Wu, Manling Li

By tracing attention distribution over the image through out intermediate layers, we observe that successful spatial reasoning correlates strongly with the model's ability to align its attention distribution with actual object locations, particularly differing between familiar and unfamiliar spatial relationships.

Spatial Reasoning

Diversity-Oriented Data Augmentation with Large Language Models

no code implementations17 Feb 2025 Zaitian Wang, Jinghan Zhang, Xinhao Zhang, Kunpeng Liu, Pengfei Wang, Yuanchun Zhou

Data augmentation is an essential technique in natural language processing (NLP) for enriching training datasets by generating diverse samples.

Data Augmentation Diversity

LEKA:LLM-Enhanced Knowledge Augmentation

no code implementations29 Jan 2025 Xinhao Zhang, Jinghan Zhang, Fengran Mo, Dongjie Wang, Yanjie Fu, Kunpeng Liu

Therefore, we design a knowledge augmentation method LEKA for knowledge transfer that actively searches for suitable knowledge sources that can enrich the target domain's knowledge.

Decision Making Transfer Learning

Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues

1 code implementation28 Dec 2024 Henry J. Xie, Jinghan Zhang, Xinhao Zhang, Kunpeng Liu

We develop a novel and comprehensive framework for investigating how effective LLMs are at measuring and scoring empathy of responses in dialogues, and what methods can be employed to deepen our understanding of LLM scoring.

feature selection

Dynamic Weight Adjusting Deep Q-Networks for Real-Time Environmental Adaptation

no code implementations4 Nov 2024 Xinhao Zhang, Jinghan Zhang, Wujun Si, Kunpeng Liu

Deep Reinforcement Learning has shown excellent performance in generating efficient solutions for complex tasks.

Deep Reinforcement Learning

Enhancing Risk Assessment in Transformers with Loss-at-Risk Functions

no code implementations4 Nov 2024 Jinghan Zhang, Henry Xie, Xinhao Zhang, Kunpeng Liu

In the financial field, precise risk assessment tools are essential for decision-making.

Decision Making

Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning

no code implementations31 Oct 2024 Jinghan Zhang, Fengran Mo, Xiting Wang, Kunpeng Liu

Recent advances in large language models (LLMs) have demonstrated their potential in handling complex reasoning tasks, which are usually achieved by constructing a thought chain to guide the model to solve the problem with multi-step thinking.

Language Modeling Language Modelling +1

Prototypical Reward Network for Data-Efficient RLHF

no code implementations6 Jun 2024 Jinghan Zhang, Xiting Wang, Yiqiao Jin, Changyu Chen, Xinhao Zhang, Kunpeng Liu

The reward model for Reinforcement Learning from Human Feedback (RLHF) has proven effective in fine-tuning Large Language Models (LLMs).

RATT: A Thought Structure for Coherent and Correct LLM Reasoning

1 code implementation4 Jun 2024 Jinghan Zhang, Xiting Wang, Weijieying Ren, Lu Jiang, Dongjie Wang, Kunpeng Liu

To address these limitations, we introduce the Retrieval Augmented Thought Tree (RATT), a novel thought structure that considers both overall logical soundness and factual correctness at each step of the thinking process.

Decision Making Fact Checking +3

Dynamic and Adaptive Feature Generation with LLM

no code implementations4 Jun 2024 Xinhao Zhang, Jinghan Zhang, Banafsheh Rekabdar, Yuanchun Zhou, Pengfei Wang, Kunpeng Liu

The representation of feature space is a crucial environment where data points get vectorized and embedded for upcoming modeling.

Automated Feature Engineering Feature Engineering

Compression Represents Intelligence Linearly

1 code implementation15 Apr 2024 Yuzhen Huang, Jinghan Zhang, Zifei Shan, Junxian He

We open-source our compression datasets as well as our data collection pipelines to facilitate future researchers to assess compression properly.

Language Modeling Language Modelling +1

Composing Parameter-Efficient Modules with Arithmetic Operations

2 code implementations26 Jun 2023 Jinghan Zhang, Shiqi Chen, Junteng Liu, Junxian He

In this paper, we propose to compose these parameter-efficient modules through linear arithmetic operations in the weight space, thereby integrating different module capabilities.

Language Modeling Language Modelling +2

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

1 code implementation NeurIPS 2023 Yuzhen Huang, Yuzhuo Bai, Zhihao Zhu, Junlei Zhang, Jinghan Zhang, Tangjun Su, Junteng Liu, Chuancheng Lv, Yikai Zhang, Jiayi Lei, Yao Fu, Maosong Sun, Junxian He

We present C-Eval, the first comprehensive Chinese evaluation suite designed to assess advanced knowledge and reasoning abilities of foundation models in a Chinese context.

Multiple-choice

Cannot find the paper you are looking for? You can Submit a new open access paper.