Search Results for author: Kai Xiong

Found 20 papers, 7 papers with code

Beyond Similarity: A Gradient-based Graph Method for Instruction Tuning Data Selection

no code implementations16 Feb 2025 Yang Zhao, Li Du, Xiao Ding, Yangou Ouyang, Hepeng Wang, Kai Xiong, Jinglong Gao, Zhouhao Sun, Dongliang Xu, Yang Qing, Dongchen Li, Bing Qin, Ting Liu

Large language models (LLMs) have shown great potential across various industries due to their remarkable ability to generalize through instruction tuning.

Domain Adaptation Transfer Learning

Efficient and Accurate Prompt Optimization: the Benefit of Memory in Exemplar-Guided Reflection

no code implementations12 Nov 2024 Cilin Yan, Jingyun Wang, Lin Zhang, Ruihui Zhao, Xiaopu Wu, Kai Xiong, Qingsong Liu, Guoliang Kang, Yangyang Kang

In this work, we propose an Exemplar-Guided Reflection with Memory mechanism (ERM) to realize more efficient and accurate prompt optimization.

Prompt Engineering

Supervised Fine-Tuning Achieve Rapid Task Adaption Via Alternating Attention Head Activation Patterns

no code implementations24 Sep 2024 Yang Zhao, Li Du, Xiao Ding, Kai Xiong, Ting Liu, Bing Qin

We find that: (1) LLMs selectively activate task-specific attention heads during SFT; (2) activation patterns for complex tasks are combinations of basic task patterns; and (3) changes in a few parameters can significantly impact activation patterns after SFT on a small number of samples. Based on these insights, experiments are conducted to actually enhance the efficiency and effectiveness of SFT.

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning

no code implementations21 Aug 2024 Kai Xiong, Xiao Ding, Li Du, Jiahao Ying, Ting Liu, Bing Qin, Yixin Cao

This makes it a challenge to diagnose and remedy the deficiencies of LLMs through rich label-free user queries.

Diagnostic

Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance

1 code implementation14 Mar 2024 Kai Xiong, Xiao Ding, Ting Liu, Bing Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Yixin Cao

The results show that our approach not only boosts the general reasoning performance of LLMs but also makes considerable strides towards their capacity for abstract reasoning, moving beyond simple memorization or imitation to a more nuanced understanding and application of generic facts.

Memorization

RIS-empowered Topology Control for Distributed Learning in Urban Air Mobility

no code implementations8 Mar 2024 Kai Xiong, Rui Wang, Supeng Leng, Wenyang Che, Chongwen Huang, Chau Yuen

Urban Air Mobility (UAM) expands vehicles from the ground to the near-ground space, envisioned as a revolution for transportation systems.

Federated Learning MULTI-VIEW LEARNING

Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning

no code implementations18 Feb 2024 Yang Zhao, Li Du, Xiao Ding, Kai Xiong, Zhouhao Sun, Jun Shi, Ting Liu, Bing Qin

Through pretraining on a corpus with various sources, Large Language Models (LLMs) have gained impressive performance.

Machine Unlearning

PACE: A Large-Scale Dataset with Pose Annotations in Cluttered Environments

1 code implementation23 Dec 2023 Yang You, Kai Xiong, Zhening Yang, Zhengxiang Huang, Junwei Zhou, Ruoxi Shi, Zhou Fang, Adam W. Harley, Leonidas Guibas, Cewu Lu

We introduce PACE (Pose Annotations in Cluttered Environments), a large-scale benchmark designed to advance the development and evaluation of pose estimation methods in cluttered scenarios.

Pose Estimation Pose Tracking

Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts

no code implementations29 Sep 2023 Jiahao Ying, Yixin Cao, Kai Xiong, Yidong He, Long Cui, Yongbin Liu

Drawing on cognitive theory, we target the first scenario of decision-making styles where there is no superiority in the conflict and categorize LLMs' preference into dependent, intuitive, and rational/irrational styles.

Benchmarking Decision Making +2

Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate

1 code implementation19 May 2023 Kai Xiong, Xiao Ding, Yixin Cao, Ting Liu, Bing Qin

Through extensive experiments on various datasets, LLMs can effectively collaborate to reach a consensus despite noticeable inter-inconsistencies, but imbalances in their abilities can lead to domination by superior LLMs.

Decision Making

Improving Cross-Task Generalization with Step-by-Step Instructions

no code implementations8 May 2023 Yang Wu, Yanyan Zhao, Zhongyang Li, Bing Qin, Kai Xiong

Instruction tuning has been shown to be able to improve cross-task generalization of language models.

ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks

1 code implementation16 Dec 2022 Kai Xiong, Xiao Ding, Zhongyang Li, Li Du, Bing Qin, Yi Zheng, Baoxing Huai

Causal chain reasoning (CCR) is an essential ability for many decision-making AI systems, which requires the model to build reliable causal chains by connecting causal pairs.

Decision Making

One-Shot General Object Localization

1 code implementation24 Nov 2022 Yang You, Zhuochen Miao, Kai Xiong, Weiming Wang, Cewu Lu

In contrast, our proposed OneLoc algorithm efficiently finds the object center and bounding box size by a special voting scheme.

Object Object Localization

A Graph Enhanced BERT Model for Event Prediction

no code implementations Findings (ACL) 2022 Li Du, Xiao Ding, Yue Zhang, Kai Xiong, Ting Liu, Bing Qin

To this end, we incorporate an additional structured variable into BERT to learn to predict the event connections in the training process.

model Prediction

e-CARE: a New Dataset for Exploring Explainable Causal Reasoning

1 code implementation ACL 2022 Li Du, Xiao Ding, Kai Xiong, Ting Liu, Bing Qin

Understanding causality has vital importance for various Natural Language Processing (NLP) applications.

valid

ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning

1 code implementation ACL 2021 Li Du, Xiao Ding, Kai Xiong, Ting Liu, Bing Qin

ExCAR first acquires additional evidence information from a large-scale causal event graph as logical rules for causal reasoning.

Representation Learning

Multi-hop RIS-Empowered Terahertz Communications: A DRL-based Hybrid Beamforming Design

no code implementations22 Jan 2021 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang, Merouane Debbah

We investigate the joint design of digital beamforming matrix at the BS and analog beamforming matrices at the RISs, by leveraging the recent advances in deep reinforcement learning (DRL) to combat the propagation loss.

Deep Reinforcement Learning

Hybrid Beamforming for RIS-Empowered Multi-hop Terahertz Communications: A DRL-based Method

no code implementations20 Sep 2020 Chongwen Huang, Zhaohui Yang, George C. Alexandropoulos, Kai Xiong, Li Wei, Chau Yuen, Zhaoyang Zhang

Wireless communication in the TeraHertz band (0. 1--10 THz) is envisioned as one of the key enabling technologies for the future six generation (6G) wireless communication systems.

Deep Reinforcement Learning

Quda: Natural Language Queries for Visual Data Analytics

no code implementations7 May 2020 Siwei Fu, Kai Xiong, Xiaodong Ge, Siliang Tang, Wei Chen, Yingcai Wu

To address this challenge, we present a new dataset, called Quda, that aims to help V-NLIs recognize analytic tasks from free-form natural language by training and evaluating cutting-edge multi-label classification models.

Multi-Label Classification MUlTI-LABEL-ClASSIFICATION +2

Cannot find the paper you are looking for? You can Submit a new open access paper.