Search Results for author: Qi Zhang

Found 454 papers, 186 papers with code

Read Extensively, Focus Smartly: A Cross-document Semantic Enhancement Method for Visual Documents NER

no code implementations COLING 2022 Jun Zhao, Xin Zhao, WenYu Zhan, Tao Gui, Qi Zhang, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu

To deal with this problem, this work proposes a cross-document semantic enhancement method, which consists of two modules: 1) To prevent distractions from irrelevant regions in the current document, we design a learnable attention mask mechanism, which is used to adaptively filter redundant information in the current document.

NER

LFKQG: A Controlled Generation Framework with Local Fine-tuning for Question Generation over Knowledge Bases

no code implementations COLING 2022 Zichu Fei, Xin Zhou, Tao Gui, Qi Zhang, Xuanjing Huang

Existing KBQG models still face two main challenges: (1) Most models often focus on the most relevant part of the answer entity, while neglecting the rest of the subgraph.

Natural Questions Question Generation +1

Iterative GNN-based Decoder for Question Generation

1 code implementation EMNLP 2021 Zichu Fei, Qi Zhang, Yaqian Zhou

However, (1) they ignore the rich structure information that is hidden in the previously generated text.

Decoder Graph Neural Network +3

A Progressive Framework for Role-Aware Rumor Resolution

1 code implementation COLING 2022 Lei Chen, Guanying Li, Zhongyu Wei, Yang Yang, Baohua Zhou, Qi Zhang, Xuanjing Huang

Existing works on rumor resolution have shown great potential in recognizing word appearance and user participation.

PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack

no code implementations COLING 2022 Rui Zheng, Rong Bao, Qin Liu, Tao Gui, Qi Zhang, Xuanjing Huang, Rui Xie, Wei Wu

To reduce the potential side effects of using defense modules, we further propose a novel forgetting restricted adversarial training, which filters out bad adversarial examples that impair the performance of original ones.

Adversarial Attack Domain Adaptation +2

An Empirical Assessment of the Qualitative Aspects of Misinformation in Health News

no code implementations NAACL (NLP4IF) 2021 Chaoyuan Zuo, Qi Zhang, Ritwik Banerjee

We present a health news classification task to determine whether medical news articles satisfy a set of review criteria deemed important by medical experts and health care journalists.

Fact Checking Misinformation +1

CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation

1 code implementation ACL 2022 Zichu Fei, Qi Zhang, Tao Gui, Di Liang, Sirui Wang, Wei Wu, Xuanjing Huang

CQG employs a simple method to generate the multi-hop questions that contain key entities in multi-hop reasoning chains, which ensure the complexity and quality of the questions.

Decoder Question Generation +1

Making Parameter-efficient Tuning More Efficient: A Unified Framework for Classification Tasks

1 code implementation COLING 2022 Xin Zhou, Ruotian Ma, Yicheng Zou, Xuanting Chen, Tao Gui, Qi Zhang, Xuanjing Huang, Rui Xie, Wei Wu

Specifically, we re-formulate both token and sentence classification tasks into a unified language modeling task, and map label spaces of different tasks into the same vocabulary space.

Language Modelling Sentence +2

A Structure-Aware Argument Encoder for Literature Discourse Analysis

1 code implementation COLING 2022 Yinzi Li, Wei Chen, Zhongyu Wei, Yujun Huang, Chujun Wang, Siyuan Wang, Qi Zhang, Xuanjing Huang, Libo Wu

Existing research for argument representation learning mainly treats tokens in the sentence equally and ignores the implied structure information of argumentative context.

Position Representation Learning +1

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

1 code implementation13 Oct 2024 Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

However, the current evaluation of RMs may not directly correspond to their alignment performance due to the limited distribution of evaluation data and evaluation methods that are not closely related to alignment objectives.

Benchmarking

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

no code implementations12 Oct 2024 Yaming Yang, Dilixat Muhtar, Yelong Shen, Yuefeng Zhan, Jianfeng Liu, Yujing Wang, Hao Sun, Denvy Deng, Feng Sun, Qi Zhang, Weizhu Chen, Yunhai Tong

Parameter-efficient fine-tuning (PEFT) has been widely employed for domain adaptation, with LoRA being one of the most prominent methods due to its simplicity and effectiveness.

Domain Adaptation Multi-Task Learning +2

When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs Learning

1 code implementation11 Oct 2024 Hao Yan, Chaozhuo Li, Zhigang Yu, Jun Yin, Ruochen Liu, Peiyan Zhang, Weihao Han, Mingzheng Li, Zhengxin Zeng, Hao Sun, Weiwei Deng, Feng Sun, Qi Zhang, Senzhang Wang

However, the absence of meaningful benchmark datasets and standardized evaluation procedures for MAG representation learning has impeded progress in this field.

Attribute Benchmarking +1

DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation

no code implementations10 Oct 2024 Xiaoshan Yu, Chuan Qin, Qi Zhang, Chen Zhu, Haiping Ma, Xingyi Zhang, HengShu Zhu

To this end, in this paper, we propose DISCO, a hierarchical Disentanglement based Cognitive diagnosis framework, aimed at flexibly accommodating the underlying representation learning model for effective and interpretable job recommendations.

cognitive diagnosis Contrastive Learning +3

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

1 code implementation29 Sep 2024 Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui

However, we argue that this formulation does not adequately convey the complete reading order information in the layout, which may potentially lead to performance decline in downstream VrD tasks.

document understanding Entity Linking +4

Convergence-aware Clustered Federated Graph Learning Framework for Collaborative Inter-company Labor Market Forecasting

no code implementations29 Sep 2024 Zhuoning Guo, Hao liu, Le Zhang, Qi Zhang, HengShu Zhu, Hui Xiong

To this end, in this paper, we formulate the Federated Labor Market Forecasting (FedLMF) problem and propose a Meta-personalized Convergence-aware Clustered Federated Learning (MPCAC-FL) framework to provide accurate and timely collaborative talent demand and supply prediction in a privacy-preserving way.

Federated Learning Graph Learning +2

AI Delegates with a Dual Focus: Ensuring Privacy and Strategic Self-Disclosure

no code implementations26 Sep 2024 Xi Chen, Zhiyang Zhang, Fangkai Yang, Xiaoting Qin, Chao Du, Xi Cheng, Hangxin Liu, QIngwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

Large language model (LLM)-based AI delegates are increasingly utilized to act on behalf of users, assisting them with a wide range of tasks through conversational interfaces.

Language Modelling Large Language Model

Neural Implicit Representation for Highly Dynamic LiDAR Mapping and Odometry

no code implementations26 Sep 2024 Qi Zhang, He Wang, Ru Li, Wenbin Li

By identifying and excluding dynamic elements from the mapping process, this segmentation enables the creation of a dense 3D map that accurately represents the static background only.

3D Scene Reconstruction Simultaneous Localization and Mapping

Empirical Insights on Fine-Tuning Large Language Models for Question-Answering

no code implementations24 Sep 2024 Junjie Ye, Yuming Yang, Qi Zhang, Tao Gui, Xuanjing Huang, Peng Wang, Zhongchao shi, Jianping Fan

Large language models (LLMs) encode extensive world knowledge through pre-training on massive datasets, which can then be fine-tuned for the question-answering (QA) task.

Question Answering World Knowledge

Online Learning via Memory: Retrieval-Augmented Detector Adaptation

no code implementations16 Sep 2024 Yanan Jian, Fuxun Yu, Qi Zhang, William LeVine, Brandon Dubbs, Nikolaos Karianakis

This paper presents a novel way of online adapting any off-the-shelf object detection model to a novel domain without retraining the detector model.

Memorization object-detection +2

Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation

no code implementations14 Sep 2024 Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu

Reinforcement Learning has revolutionized decision-making processes in dynamic environments, yet it often struggles with autonomously detecting and achieving goals without clear feedback signals.

Decision Making

Unleash LLMs Potential for Recommendation by Coordinating Twin-Tower Dynamic Semantic Token Generator

no code implementations14 Sep 2024 Jun Yin, Zhengxin Zeng, Mingzheng Li, Hao Yan, Chaozhuo Li, Weihao Han, Jianjin Zhang, Ruochen Liu, Allen Sun, Denvy Deng, Feng Sun, Qi Zhang, Shirui Pan, Senzhang Wang

Owing to the unprecedented capability in semantic understanding and logical reasoning, the pre-trained large language models (LLMs) have shown fantastic potential in developing the next-generation recommender systems (RSs).

Logical Reasoning Recommendation Systems

WeatherReal: A Benchmark Based on In-Situ Observations for Evaluating Weather Models

1 code implementation14 Sep 2024 Weixin Jin, Jonathan Weyn, Pengcheng Zhao, Siqi Xiang, Jiang Bian, Zuliang Fang, Haiyu Dong, Hongyu Sun, Kit Thambiratnam, Qi Zhang

Our work aims to advance the AI-based weather forecasting research towards a more application-focused and operation-ready approach.

Weather Forecasting

Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization

1 code implementation3 Sep 2024 Qi Zhang, Kaiyi Zhang, Antoni B. Chan, Hui Huang

Second, the object-to-camera distance in each view is used to adjust the optimal transport cost of each location further, where the wrong predictions far away from the camera are more heavily penalized.

Multiview Detection

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

no code implementations27 Aug 2024 Han Xia, Songyang Gao, Qiming Ge, Zhiheng Xi, Qi Zhang, Xuanjing Huang

Reinforcement Learning from Human Feedback (RLHF) has proven effective in aligning large language models with human intentions, yet it often relies on complex methodologies like Proximal Policy Optimization (PPO) that require extensive hyper-parameter tuning and present challenges in sample efficiency and stability.

reinforcement-learning Reinforcement Learning

A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts

no code implementations15 Aug 2024 ZhiHao Lin, Zhen Tian, Qi Zhang, Ziyang Ye, Hanyang Zhuang, Jianglin Lan

This paper introduces a learning-based algorithm tailored to foster safe and efficient driving behaviors across varying levels of traffic flows in roundabouts.

Autonomous Driving Model Predictive Control +1

UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents

no code implementations2 Aug 2024 Yi Tu, Chong Zhang, Ya Guo, Huan Chen, Jinyang Tang, Huijia Zhu, Qi Zhang

The recognition of named entities in visually-rich documents (VrD-NER) plays a critical role in various real-world scenarios and applications.

named-entity-recognition Named Entity Recognition +2

Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°

no code implementations1 Aug 2024 Yuxiao He, Yiyu Zhuang, Yanwen Wang, Yao Yao, Siyu Zhu, Xiaoyu Li, Qi Zhang, Xun Cao, Hao Zhu

To the best of our knowledge, our model is the first parametric 3D full-head that achieves 360{\deg} free-view synthesis, image-based fitting, appearance editing, and animation within a single model.

RevGNN: Negative Sampling Enhanced Contrastive Graph Learning for Academic Reviewer Recommendation

no code implementations30 Jul 2024 Weibin Liao, Yifan Zhu, Yanyan Li, Qi Zhang, Zhonghong Ou, Xuesong Li

Therefore, investigating how to better comprehend the negative labeling of unobserved interactions in academic reviewer recommendations is a significant challenge.

Contrastive Learning Graph Learning

FINER++: Building a Family of Variable-periodic Functions for Activating Implicit Neural Representation

no code implementations28 Jul 2024 Hao Zhu, Zhen Liu, Qi Zhang, Jingde Fu, Weibing Deng, Zhan Ma, Yanwen Guo, Xun Cao

By initializing the bias of the neural network with different ranges, sub-functions with various frequencies in the variable-periodic function are selected for activation.

Graph Memory Learning: Imitating Lifelong Remembering and Forgetting of Brain Networks

no code implementations27 Jul 2024 Jiaxing Miao, Liang Hu, Qi Zhang, Longbing Cao

BGML incorporates a multi-granular hierarchical progressive learning mechanism rooted in feature graph grain learning to mitigate potential conflict between memorization and forgetting in graph memory learning.

Memorization Node Classification

The Vision of Autonomic Computing: Can LLMs Make It a Reality?

no code implementations19 Jul 2024 Zhiyang Zhang, Fangkai Yang, Xiaoting Qin, Jue Zhang, QIngwei Lin, Gong Cheng, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

The Vision of Autonomic Computing (ACV), proposed over two decades ago, envisions computing systems that self-manage akin to biological organisms, adapting seamlessly to changing environments.

Management

Robust Multivariate Time Series Forecasting against Intra- and Inter-Series Transitional Shift

no code implementations18 Jul 2024 Hui He, Qi Zhang, Kun Yi, Xiaojun Xue, Shoujin Wang, Liang Hu, Longbing Cao

The non-stationary nature of real-world Multivariate Time Series (MTS) data presents forecasting models with a formidable challenge of the time-variant distribution of time series, referred to as distribution shift.

Multivariate Time Series Forecasting Time Series

E5-V: Universal Embeddings with Multimodal Large Language Models

1 code implementation17 Jul 2024 Ting Jiang, Minghui Song, Zihan Zhang, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang, Deqing Wang, Fuzhen Zhuang

We propose a single modality training approach for E5-V, where the model is trained exclusively on text pairs.

Power Optimization and Deep Learning for Channel Estimation of Active IRS-Aided IoT

no code implementations12 Jul 2024 Yan Wang, Feng Shu, Rongen Dong, Wei Gao, Qi Zhang, Jiajia Liu

In the second case, when the transmit power at the IoT devices is fixed, there exists an optimal reflective power at active IRS.

Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending

no code implementations11 Jul 2024 Delong Wu, Hao Zhu, Qi Zhang, You Li, Zhan Ma, Xun Cao

To tackle this issue, we introduce the Neural Poisson Solver, a plug-and-play and universally applicable framework across different signal dimensions for blending visual signals represented by INRs.

An Earth Rover dataset recorded at the ICRA@40 party

1 code implementation8 Jul 2024 Qi Zhang, ZhiHao Lin, Arnoud Visser

The ICRA conference is celebrating its $40^{th}$ anniversary in Rotterdam in September 2024, with as highlight the Happy Birthday ICRA Party at the iconic Holland America Line Cruise Terminal.

2k Autonomous Navigation

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

no code implementations1 Jul 2024 Qi Zhang, Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang

In recent years, the rise of generative self-supervised learning (SSL) paradigms has exhibited impressive performance across visual, language, and multi-modal domains.

Self-Supervised Learning

Deep Frequency Derivative Learning for Non-stationary Time Series Forecasting

no code implementations29 Jun 2024 Wei Fan, Kun Yi, Hangting Ye, Zhiyuan Ning, Qi Zhang, Ning An

We present a deep frequency derivative learning framework, DERITS, for non-stationary time series forecasting.

Time Series Time Series Forecasting

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

no code implementations27 Jun 2024 Jia Fu, Xiaoting Qin, Fangkai Yang, Lu Wang, Jue Zhang, QIngwei Lin, Yubo Chen, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

Recent advancements in Large Language Models have transformed ML/AI development, necessitating a reevaluation of AutoML principles for the Retrieval-Augmented Generation (RAG) systems.

AutoML Efficient Exploration +3

SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance

1 code implementation26 Jun 2024 Caishuang Huang, Wanxu Zhao, Rui Zheng, Huijie Lv, Shihan Dou, Sixian Li, Xiao Wang, Enyu Zhou, Junjie Ye, Yuming Yang, Tao Gui, Qi Zhang, Xuanjing Huang

As the development of large language models (LLMs) rapidly advances, securing these models effectively without compromising their utility has become a pivotal area of research.

Safety Alignment

Discovering Common Information in Multi-view Data

1 code implementation21 Jun 2024 Qi Zhang, Mingfei Lu, Shujian Yu, Jingmin Xin, Badong Chen

We introduce an innovative and mathematically rigorous definition for computing common information from multi-view data, drawing inspiration from G\'acs-K\"orner common information in information theory.

MULTI-VIEW LEARNING

Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

1 code implementation17 Jun 2024 Rong Bao, Rui Zheng, Shihan Dou, Xiao Wang, Enyu Zhou, Bo wang, Qi Zhang, Liang Ding, DaCheng Tao

In aligning large language models (LLMs), utilizing feedback from existing advanced AI rather than humans is an important method to scale supervisory signals.

Position

Efficient Sequential Decision Making with Large Language Models

no code implementations17 Jun 2024 Dingyang Chen, Qi Zhang, Yinglun Zhu

In this paper, we propose a new approach that leverages online model selection algorithms to efficiently incorporate LLMs agents into sequential decision making.

Decision Making Model Selection

Toward Optimal LLM Alignments Using Two-Player Games

1 code implementation16 Jun 2024 Rui Zheng, Hongyi Guo, Zhihan Liu, Xiaoying Zhang, Yuanshun Yao, Xiaojun Xu, Zhaoran Wang, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang, Hang Li, Yang Liu

We theoretically demonstrate that this iterative reinforcement learning optimization converges to a Nash Equilibrium for the game induced by the agents.

reinforcement-learning Reinforcement Learning

Uncertainty Aware Learning for Language Model Alignment

no code implementations7 Jun 2024 Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, DaCheng Tao

As instruction-tuned large language models (LLMs) evolve, aligning pretrained foundation models presents increasing challenges.

GSM8K Language Modelling

P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation

1 code implementation30 May 2024 Qi Zhang, Guohua Geng, Longquan Yan, Pengbo Zhou, Zhaodi Li, Kang Li, Qinglin Liu

This adjustment enhances the structure for multi-head attention computation, leading to enhanced network performance and CBLA is a plug-and-play module.

Denoising Image Segmentation +2

MGDA Converges under Generalized Smoothness, Provably

no code implementations29 May 2024 Qi Zhang, Peiyao Xiao, Shaofeng Zou, Kaiyi Ji

We provide a comprehensive convergence analysis of these algorithms and show that they converge to an $\epsilon$-accurate Pareto stationary point with a guaranteed $\epsilon$-level average CA distance (i. e., the gap between the updating direction and the CA direction) over all iterations, where totally $\mathcal{O}(\epsilon^{-2})$ and $\mathcal{O}(\epsilon^{-4})$ samples are needed for deterministic and stochastic settings, respectively.

Multi-Task Learning

Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh

no code implementations28 May 2024 Xiangjun Gao, Xiaoyu Li, Yiyu Zhuang, Qi Zhang, WenBo Hu, Chaopeng Zhang, Yao Yao, Ying Shan, Long Quan

This approach reduces the need to design various algorithms for different types of Gaussian manipulation.

Novel View Synthesis

BO4IO: A Bayesian optimization approach to inverse optimization with uncertainty quantification

1 code implementation28 May 2024 Yen-An Lu, Wei-Shou Hu, Joel A. Paulson, Qi Zhang

This work addresses data-driven inverse optimization (IO), where the goal is to estimate unknown parameters in an optimization model from observed decisions that can be assumed to be optimal or near-optimal solutions to the optimization problem.

Bayesian Optimization Uncertainty Quantification

ASI++: Towards Distributionally Balanced End-to-End Generative Retrieval

no code implementations23 May 2024 Yuxuan Liu, Tianchi Yang, Zihan Zhang, Minghui Song, Haizhen Huang, Weiwei Deng, Feng Sun, Qi Zhang

Generative retrieval, a promising new paradigm in information retrieval, employs a seq2seq model to encode document features into parameters and decode relevant document identifiers (IDs) based on search queries.

Information Retrieval Quantization +1

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning

1 code implementation5 May 2024 Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang

In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning.

GSM8K Math +1

CoSD: Collaborative Stance Detection with Contrastive Heterogeneous Topic Graph Learning

no code implementations26 Apr 2024 Yinghan Cheng, Qi Zhang, Chongyang Shi, Liang Xiao, Shufeng Hao, Liang Hu

To address these challenges, we present a novel collaborative stance detection framework called (CoSD) which leverages contrastive heterogeneous topic graph learning to learn topic-aware semantics and collaborative signals among texts, topics, and stance labels for enhancing stance detection.

Graph Learning Stance Detection

Dynamic fault detection and diagnosis for alkaline water electrolyzer with variational Bayesian Sparse principal component analysis

no code implementations24 Apr 2024 Qi Zhang, Weihua Xu, Lei Xie, Hongye Su

Electrolytic hydrogen production serves as not only a vital source of green hydrogen but also a key strategy for addressing renewable energy consumption challenges.

Fault Detection

Wills Aligner: A Robust Multi-Subject Brain Representation Learner

no code implementations20 Apr 2024 Guangyin Bao, Zixuan Gong, Qi Zhang, Jialei Zhou, Wei Fan, Kun Yi, Usman Naseem, Liang Hu, Duoqian Miao

We meticulously evaluate the performance of our approach across coarse-grained and fine-grained visual decoding tasks.

Representation Learning

MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction

no code implementations19 Apr 2024 Zixuan Gong, Qi Zhang, Guangyin Bao, Lei Zhu, Ke Liu, Liang Hu, Duoqian Miao

Decoding natural visual scenes from brain activity has flourished, with extensive research in single-subject tasks and, however, less in cross-subject tasks.

Image Reconstruction Text Retrieval

Unveiling the Misuse Potential of Base Large Language Models via In-Context Learning

no code implementations16 Apr 2024 Xiao Wang, Tianze Chen, Xianjun Yang, Qi Zhang, Xun Zhao, Dahua Lin

The open-sourcing of large language models (LLMs) accelerates application development, innovation, and scientific progress.

In-Context Learning Instruction Following

Dynamic fault detection and diagnosis of industrial alkaline water electrolyzer process with variational Bayesian dictionary learning

no code implementations15 Apr 2024 Qi Zhang, Lei Xie, Weihua Xu, Hongye Su

A novel robust dynamic variational Bayesian dictionary learning (RDVDL) monitoring approach is proposed to improve the reliability and safety of AWE operation.

Dictionary Learning Fault Detection

Nonlinear sparse variational Bayesian learning based model predictive control with application to PEMFC temperature control

no code implementations15 Apr 2024 Qi Zhang, Lei Wang, Weihua Xu, Hongye Su, Lei Xie

Variational inference is used by NSVB-MPC to assess the predictive accuracy and make the necessary corrections to quantify system uncertainty.

Model Predictive Control Variational Inference

A Copula Graphical Model for Multi-Attribute Data using Optimal Transport

no code implementations10 Apr 2024 Qi Zhang, Bing Li, Lingzhou Xue

Motivated by modern data forms such as images and multi-view data, the multi-attribute graphical model aims to explore the conditional independence structure among vectors.

Attribute

Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

no code implementations1 Apr 2024 Qi Zhang, Yi Zhou, Ashley Prater-Bennette, Lixin Shen, Shaofeng Zou

We prove that our algorithm finds an $\epsilon$-stationary point with a computational complexity of $\mathcal O(\epsilon^{-3k_*-5})$, where $k_*$ is the parameter of the Cressie-Read divergence.

Convergence Guarantees for RMSProp and Adam in Generalized-smooth Non-convex Optimization with Affine Noise Variance

no code implementations1 Apr 2024 Qi Zhang, Yi Zhou, Shaofeng Zou

Specifically, to solve the challenges due to dependence among adaptive update, unbounded gradient estimate and Lipschitz constant, we demonstrate that the first-order term in the descent lemma converges and its denominator is upper bounded by a function of gradient norm.

LEMMA

Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models

1 code implementation1 Apr 2024 wei he, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

The generated demos strategically interpolate between existing demos and the given query, transforming the query from OOD to ID.

In-Context Learning Math

Information Cascade Prediction under Public Emergencies: A Survey

no code implementations28 Mar 2024 Qi Zhang, Guang Wang, Li Lin, Kaiwen Xia, Shuai Wang

With the advent of the era of big data, massive information, expert experience, and high-accuracy models bring great opportunities to the information cascade prediction of public emergencies.

Survey

Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals

no code implementations24 Mar 2024 Rui Zheng, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

We first empirically show that the features of either clean signals or adversarial perturbations are redundant and span in low-dimensional linear subspaces respectively with minimal overlap, and the classical low-dimensional subspace projection can suppress perturbation features out of the subspace of clean signals.

Adversarial Defense

Non-negative Contrastive Learning

1 code implementation19 Mar 2024 Yifei Wang, Qi Zhang, Yaoyu Guo, Yisen Wang

In this paper, we propose Non-negative Contrastive Learning (NCL), a renaissance of Non-negative Matrix Factorization (NMF) aimed at deriving interpretable features.

Contrastive Learning Disentanglement +1

UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling

no code implementations18 Mar 2024 Yujiao Jiang, Qingmin Liao, Xiaoyu Li, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan

Therefore, we propose UV Gaussians, which models the 3D human body by jointly learning mesh deformations and 2D UV-space Gaussian textures.

Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

no code implementations17 Mar 2024 Zhihao Liang, Qi Zhang, WenBo Hu, Ying Feng, Lei Zhu, Kui Jia

This is because 3DGS treats each pixel as an isolated, single point rather than as an area, causing insensitivity to changes in the footprints of pixels.

Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments

no code implementations13 Mar 2024 Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, QIngwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang

We propose Reasoning-Path-Editing (Readi), a novel framework where LLMs can efficiently and faithfully reason over structured environments.

ResLoRA: Identity Residual Mapping in Low-Rank Adaption

1 code implementation28 Feb 2024 Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

As one of the most popular parameter-efficient fine-tuning (PEFT) methods, low-rank adaptation (LoRA) is commonly applied to fine-tune large language models (LLMs).

parameter-efficient fine-tuning

RECOST: External Knowledge Guided Data-efficient Instruction Tuning

no code implementations27 Feb 2024 Qi Zhang, Yiming Zhang, Haobo Wang, Junbo Zhao

When it comes to datasets synthesized by LLMs, a common scenario in this field, dirty samples will even be selected with a higher probability than other samples.

Diversity Re-Ranking

RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions

no code implementations26 Feb 2024 Yuansen Zhang, Xiao Wang, Zhiheng Xi, Han Xia, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, drawing inspiration from recent works that LLMs are sensitive to the design of the instructions, we utilize instructions in code style, which are more structural and less ambiguous, to replace typically natural language instructions.

CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models

1 code implementation26 Feb 2024 Huijie Lv, Xiao Wang, Yuansen Zhang, Caishuang Huang, Shihan Dou, Junjie Ye, Tao Gui, Qi Zhang, Xuanjing Huang

Adversarial misuse, particularly through `jailbreaking' that circumvents a model's safety and ethical protocols, poses a significant challenge for Large Language Models (LLMs).

Code Completion Response Generation

Domain Generalization via Causal Adjustment for Cross-Domain Sentiment Analysis

no code implementations22 Feb 2024 Siyin Wang, Jie zhou, Qin Chen, Qi Zhang, Tao Gui, Xuanjing Huang

Domain adaption has been widely adapted for cross-domain sentiment analysis to transfer knowledge from the source domain to the target domain.

Domain Generalization Sentiment Analysis

LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition

no code implementations22 Feb 2024 Junjie Ye, Nuo Xu, Yikun Wang, Jie zhou, Qi Zhang, Tao Gui, Xuanjing Huang

To overcome the limitations of existing data augmentation methods that compromise semantic integrity and address the uncertainty inherent in LLM-generated text, we leverage the distinctive characteristics of the NER task by augmenting the original data at both the contextual and entity levels.

Data Augmentation few-shot-ner +5

Unveiling Linguistic Regions in Large Language Models

1 code implementation22 Feb 2024 Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang

Furthermore, this core region exhibits significant dimensional dependence, perturbations to even a single parameter on specific dimensions leading to a loss of linguistic competence.

$Se^2$: Sequential Example Selection for In-Context Learning

1 code implementation21 Feb 2024 Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang

The remarkable capability of large language models (LLMs) for in-context learning (ICL) needs to be activated by demonstration examples.

Diversity In-Context Learning

Text Diffusion with Reinforced Conditioning

no code implementations19 Feb 2024 Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang

Diffusion models have demonstrated exceptional capability in generating high-quality images, videos, and audio.

MSynFD: Multi-hop Syntax aware Fake News Detection

no code implementations18 Feb 2024 Liang Xiao, Qi Zhang, Chongyang Shi, Shoujin Wang, Usman Naseem, Liang Hu

These existing methods fail to handle the complex, subtle twists in news articles, such as syntax-semantics mismatches and prior biases, leading to lower performance and potential failure when modalities or social context are missing.

Fake News Detection

Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution

no code implementations18 Feb 2024 Nuo Xu, Jun Zhao, Can Zu, Sixian Li, Lu Chen, Zhihao Zhang, Rui Zheng, Shihan Dou, Wenjuan Qin, Tao Gui, Qi Zhang, Xuanjing Huang

To address this issue, we propose a cost-effective preference learning strategy, optimizing reward models by distinguishing between human and machine translations.

Machine Translation Translation

LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration

1 code implementation18 Feb 2024 Jun Zhao, Can Zu, Hao Xu, Yi Lu, wei he, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang

Large language models (LLMs) have demonstrated impressive performance in understanding language and executing complex reasoning tasks.

Multi-hop Question Answering Question Answering +1

LongHeads: Multi-Head Attention is Secretly a Long Context Processor

1 code implementation16 Feb 2024 Yi Lu, Xin Zhou, wei he, Jun Zhao, Tao Ji, Tao Gui, Qi Zhang, Xuanjing Huang

Instead of allowing each head to attend to the full sentence, which struggles with generalizing to longer sequences due to out-of-distribution (OOD) issues, we allow each head to process in-distribution length by selecting and attending to important context chunks.

Sentence

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages

1 code implementation16 Feb 2024 Junjie Ye, Sixian Li, Guanyu Li, Caishuang Huang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang

Tool learning is widely acknowledged as a foundational approach or deploying large language models (LLMs) in real-world scenarios.

UFO: A UI-Focused Agent for Windows OS Interaction

1 code implementation8 Feb 2024 Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, QIngwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision.

Navigate

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

1 code implementation8 Feb 2024 Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, wei he, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

In this paper, we propose R$^3$: Learning Reasoning through Reverse Curriculum Reinforcement Learning (RL), a novel method that employs only outcome supervision to achieve the benefits of process supervision for large language models.

GSM8K reinforcement-learning +1

Rethinking the Evaluation of Pre-trained Text-and-Layout Models from an Entity-Centric Perspective

1 code implementation4 Feb 2024 Chong Zhang, Yixi Zhao, Chenshu Yuan, Yi Tu, Ya Guo, Qi Zhang

Therefore, we claim the necessary standards for an ideal benchmark to evaluate the information extraction ability of PTLMs.

Entity Linking Semantic entity labeling