Search Results for author: Yang Zhang

Found 408 papers, 183 papers with code

Dual Attention Model for Citation Recommendation with Analyses on Explainability of Attention Mechanisms and Qualitative Experiments

no code implementations CL (ACL) 2022 Yang Zhang, Qiang Ma

A neural network model is designed to maximize the similarity between the embedding of the three inputs (local context words, section headers, and structural contexts) and the target citation appearing in the context.

Citation Recommendation

Tracing Human Stress from Physiological Signals using UWB Radar

no code implementations14 Oct 2024 Jia Xu, Teng Xiao, Pin Lv, Zhe Chen, Chao Cai, Yang Zhang, Zehui Xiong

Experimental results show that the proposed DST method significantly outperforms all the baselines in terms of tracing human stress states.

Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner

no code implementations30 Sep 2024 Chenyou Fan, Chenjia Bai, Zhao Shan, Haoran He, Yang Zhang, Zhen Wang

To address these challenges, we aim to develop a versatile diffusion planner that can leverage large-scale inferior data that contains task-agnostic sub-optimal trajectories, with the ability to fast adapt to specific tasks.

Reinforcement Learning (RL)

Offline Signature Verification Based on Feature Disentangling Aided Variational Autoencoder

no code implementations29 Sep 2024 Hansong Zhang, Jiangjian Guo, Kun Li, Yang Zhang, Yimei Zhao

First, genuine signatures and skilled forgeries are highly similar in their appearances, resulting in a small inter-class distance.

Investigating Layer Importance in Large Language Models

no code implementations22 Sep 2024 Yang Zhang, Yanfei Dong, Kenji Kawaguchi

In this study, we advance the understanding of LLM by investigating the significance of individual layers in LLMs.

Data Valuation

Axial Attention Transformer Networks: A New Frontier in Breast Cancer Detection

no code implementations18 Sep 2024 Weijie He, Runyuan Bao, Yiru Cang, Jianjun Wei, Yang Zhang, Jiacheng Hu

This paper delves into the challenges and advancements in the field of medical image segmentation, particularly focusing on breast cancer diagnosis.

Breast Cancer Detection Computational Efficiency +4

Efficient Fine-Tuning of Large Language Models for Automated Medical Documentation

no code implementations14 Sep 2024 Hui Yi Leong, Yi Fan Gao, Ji Shuai, Yang Zhang, Uktu Pamuksuz

Scientific research indicates that for every hour spent in direct patient care, physicians spend nearly two additional hours on administrative tasks, particularly on electronic health records (EHRs) and desk work.

Language Modelling Large Language Model

Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data

no code implementations12 Sep 2024 Atilla Akkus, Mingjie Li, Junjie Chu, Michael Backes, Yang Zhang, Sinem Sav

Large language models (LLMs) have shown considerable success in a range of domain-specific tasks, especially after fine-tuning.

Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning

no code implementations12 Sep 2024 Teng Yan, Zhendong Ruan, Yaobang Cai, Yu Han, Wenxian Li, Yang Zhang

However, due to the inconsistency between the sampled returns within a single trajectory and the optimal returns across multiple trajectories, it is challenging to set an expected return to output the optimal action and stitch together suboptimal trajectories.

D4RL Offline RL +2

Understanding Data Importance in Machine Learning Attacks: Does Valuable Data Pose Greater Harm?

no code implementations5 Sep 2024 Rui Wen, Michael Backes, Yang Zhang

By analyzing the linkage between membership inference vulnerability and data importance, we demonstrate that sample characteristics can be integrated into membership metrics by introducing sample-specific criteria, therefore enhancing the membership inference performance.

SPDiffusion: Semantic Protection Diffusion for Multi-concept Text-to-image Generation

no code implementations2 Sep 2024 Yang Zhang, Rui Zhang, Xuecheng Nie, Haochen Li, Jikun Chen, Yifan Hao, Xin Zhang, Luoqi Liu, Ling Li

We found that attribute confusion occurs when a certain region of the latent features attend to multiple or incorrect prompt tokens.

Attribute Text-to-Image Generation

Membership Inference Attacks Against In-Context Learning

no code implementations2 Sep 2024 Rui Wen, Zheng Li, Michael Backes, Yang Zhang

Adapting Large Language Models (LLMs) to specific tasks introduces concerns about computational efficiency, prompting an exploration of efficient methods such as In-Context Learning (ICL).

Computational Efficiency In-Context Learning +2

Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution

no code implementations30 Aug 2024 Yixin Wu, Yun Shen, Michael Backes, Yang Zhang

This study takes an initial step in investigating the evolution of text-to-image models from the perspectives of safety, bias, and authenticity.

EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs

no code implementations30 Aug 2024 Zhen Fan, Peng Dai, Zhuo Su, Xu Gao, Zheng Lv, Jiarui Zhang, Tianyuan Du, Guidong Wang, Yang Zhang

Specifically, EMHI provides synchronized stereo images from downward-sloping cameras on the headset and IMU data from body-worn sensors, along with pose annotations in SMPL format.

Pose Estimation

Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders

no code implementations20 Aug 2024 Yuan Xin, Zheng Li, Ning Yu, Dingfan Chen, Mario Fritz, Michael Backes, Yang Zhang

Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data.

BadMerging: Backdoor Attacks Against Model Merging

1 code implementation14 Aug 2024 Jinghuai Zhang, Jianfeng Chi, Zheng Li, Kunlin Cai, Yang Zhang, Yuan Tian

Considering that a merged model may incorporate tasks from different domains, BadMerging can jointly compromise the tasks provided by the adversary (on-task attack) and other contributors (off-task attack) and solve the corresponding unique challenges with novel attack designs.

Backdoor Attack Transfer Learning

Membership Inference Attack Against Masked Image Modeling

no code implementations13 Aug 2024 Zheng Li, Xinlei He, Ning Yu, Yang Zhang

Masked Image Modeling (MIM) has achieved significant success in the realm of self-supervised learning (SSL) for visual recognition.

Inference Attack Membership Inference Attack +1

Model Hijacking Attack in Federated Learning

no code implementations4 Aug 2024 Zheng Li, Siyuan Wu, Ruichuan Chen, Paarijaat Aditya, Istemi Ekin Akkus, Manohar Vanga, Min Zhang, Hao Li, Yang Zhang

Machine learning (ML), driven by prominent paradigms such as centralized and federated learning, has made significant progress in various critical applications ranging from autonomous driving to face recognition.

Autonomous Driving Data Poisoning +2

Vera Verto: Multimodal Hijacking Attack

no code implementations31 Jul 2024 Minxing Zhang, Ahmed Salem, Michael Backes, Yang Zhang

A recent attack in this domain is the model hijacking attack, whereby an adversary hijacks a victim model to implement their own -- possibly malicious -- hijacking tasks.

Decoder Image Classification

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

no code implementations30 Jul 2024 Boyang Zhang, Yicong Tan, Yun Shen, Ahmed Salem, Michael Backes, Savvas Zannettou, Yang Zhang

Through attacks on implemented and deployable agents in multi-agent scenarios, we accentuate the realistic risks associated with these vulnerabilities.

Language Modelling

GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting

1 code implementation29 Jul 2024 Yimeng Bai, Yang Zhang, Fuli Feng, Jing Lu, Xiaoxue Zang, Chenyi Lei, Yang song

GradCraft ensures the concurrent achievement of appropriate magnitude balance and global direction balance, aligning with the inherent characteristics of recommendation scenarios.

Multi-Task Learning Recommendation Systems

Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective

1 code implementation24 Jul 2024 Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang

This paper investigates Who's Harry Potter (WHP), a pioneering yet insufficiently understood method for LLM unlearning.

SeqMIA: Sequential-Metric Based Membership Inference Attack

1 code implementation21 Jul 2024 Hao Li, Zheng Li, Siyuan Wu, Chengrui Hu, Yutong Ye, Min Zhang, Dengguo Feng, Yang Zhang

Building upon this signal, we introduce a novel attack method called Sequential-metric based Membership Inference Attack (SeqMIA).

Inference Attack Knowledge Distillation +1

Towards Understanding Unsafe Video Generation

1 code implementation17 Jul 2024 Yan Pang, Aiping Xiong, Yang Zhang, Tianhao Wang

With the labeled information and the corresponding prompts, we created the first dataset of unsafe videos generated by VGMs.

Image Generation Video Generation

ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization

no code implementations9 Jul 2024 Wai Man Si, Michael Backes, Yang Zhang

It is a fine-tuning framework designed to allow the model owner to regulate ICL behavior on different data.

In-Context Learning

SOS! Soft Prompt Attack Against Open-Source Large Language Models

no code implementations3 Jul 2024 Ziqing Yang, Michael Backes, Yang Zhang, Ahmed Salem

In this work, we present a new training time attack, SOS, which is designed to be low in computational demand and does not require clean data or modification of the model weights, thereby maintaining the model's utility intact.

Backdoor Attack

Movable Antenna-enabled RIS-aided Integrated Sensing and Communication

no code implementations3 Jul 2024 Haisu Wu, Hong Ren, Cunhua Pan, Yang Zhang

In this paper, we investigate a movable antenna (MA)-aided integrated sensing and communication (ISAC) system, where a reconfigurable intelligent surface (RIS) is employed to enhance wireless communication and sensing performance in dead zones.

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs

1 code implementation2 Jul 2024 Qiucheng Wu, Handong Zhao, Michael Saxon, Trung Bui, William Yang Wang, Yang Zhang, Shiyu Chang

One understudied capability in VLMs is visual spatial planning -- the ability to comprehend the spatial arrangements of objects and devise action plans to achieve desired outcomes in visual scenes.

ProgressGym: Alignment with a Millennium of Moral Progress

no code implementations28 Jun 2024 Tianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji, Yaodong Yang

Frontier AI systems, including large language models (LLMs), hold increasing influence over the epistemology of human users.

Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

1 code implementation22 Jun 2024 Yang Zhang, Chenjia Bai, Bin Zhao, Junchi Yan, Xiu Li, Xuelong Li

We cast the dynamics learning as an auto-regressive sequence modeling problem over discrete tokens by leveraging the expressive Transformer architecture, in order to model complex local dynamics across different agents and provide accurate and consistent long-term imaginations.

Reinforcement Learning (RL) SMAC+ +1

Decoding Matters: Addressing Amplification Bias and Homogeneity Issue for LLM-based Recommendation

no code implementations21 Jun 2024 Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng

However, we find these methods encounter significant challenges: 1) amplification bias -- where standard length normalization inflates scores for items containing tokens with generation probabilities close to 1 (termed ghost tokens), and 2) homogeneity issue -- generating multiple similar or repetitive items for a user.

Diversity

Geometric Self-Supervised Pretraining on 3D Protein Structures using Subgraphs

no code implementations20 Jun 2024 Michail Chatzianastasis, Yang Zhang, George Dasoulas, Michalis Vazirgiannis

Protein representation learning aims to learn informative protein embeddings capable of addressing crucial biological questions, such as protein function prediction.

Protein Function Prediction Representation Learning

Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments

no code implementations14 Jun 2024 Zhenrui Yue, Huimin Zeng, Lanyu Shang, Yifan Liu, Yang Zhang, Dong Wang

Upon input claims, RAFTS starts with evidence retrieval, where we design a retrieval pipeline to collect and re-rank relevant documents from verifiable sources.

Decision Making Fact Verification +2

Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid Transformer

1 code implementation13 Jun 2024 Guodong Sun, Junjie Liu, Mingxuan Liu, Moyun Liu, Yang Zhang

To address these challenges, we introduce a novel self-supervised monocular depth estimation model that leverages multiple priors to bolster representation capabilities across spatial, context, and semantic dimensions.

Decoder Monocular Depth Estimation +1

Towards Unsupervised Speech Recognition Without Pronunciation Models

no code implementations12 Jun 2024 Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo

Recent advancements in supervised automatic speech recognition (ASR) have achieved remarkable performance, largely due to the growing availability of large transcribed speech corpora.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference

1 code implementation12 Jun 2024 Jiabao Ji, Yujian Liu, Yang Zhang, Gaowen Liu, Ramana Rao Kompella, Sijia Liu, Shiyu Chang

To achieve both goals, a mainstream class of LLM unlearning methods introduces an optimization framework with a combination of two objectives - maximizing the prediction loss on the forget documents while minimizing that on the retain documents, which suffers from two challenges, degenerated output and catastrophic forgetting.

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

1 code implementation11 Jun 2024 Bairu Hou, Yang Zhang, Jacob Andreas, Shiyu Chang

To address this problem, a popular class of methods utilize the LLM's self-consistencies in its beliefs in a set of logically related augmented statements generated by the LLM, which does not require external knowledge databases and can work with both white-box and black-box LLMs.

Hallucination

Text-like Encoding of Collaborative Information in Large Language Models for Recommendation

1 code implementation5 Jun 2024 Yang Zhang, Keqin Bao, Ming Yan, Wenjie Wang, Fuli Feng, Xiangnan He

BinLLM converts collaborative embeddings from external models into binary sequences -- a specific text format that LLMs can understand and operate on directly, facilitating the direct usage of collaborative information in text-like format by LLMs.

Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction

1 code implementation2 Jun 2024 Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng

From the examiner perspective, we define four evaluation tasks for error identification and correction along with a new dataset with annotated error types and steps.

Mathematical Reasoning

Voice Jailbreak Attacks Against GPT-4o

1 code implementation29 May 2024 Xinyue Shen, Yixin Wu, Michael Backes, Yang Zhang

This resistance is primarily due to GPT-4o's internal safeguards and the difficulty of adapting text jailbreak prompts to voice mode.

Language Modelling Large Language Model +1

FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models

no code implementations28 May 2024 Yang Zhang, Yawei Li, Xinpeng Wang, Qianli Shen, Barbara Plank, Bernd Bischl, Mina Rezaei, Kenji Kawaguchi

Overparametrized transformer networks are the state-of-the-art architecture for Large Language Models (LLMs).

Decoder

Efficient Visual Fault Detection for Freight Train via Neural Architecture Search with Data Volume Robustness

no code implementations27 May 2024 Yang Zhang, Mingying Li, Huilin Pan, Moyun Liu, Yang Zhou

In this work, we propose an efficient NAS-based framework for visual fault detection of freight trains to search for the task-specific detection head with capacities of multi-scale representation.

Fault Detection Neural Architecture Search

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

no code implementations23 May 2024 Yang Zhang, Shixin Yang, Chenjia Bai, Fei Wu, Xiu Li, Zhen Wang, Xuelong Li

In this paper, we propose a novel framework for multi-agent collaboration that introduces Reinforced Advantage feedback (ReAd) for efficient self-refinement of plans.

regression

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

1 code implementation10 May 2024 Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li, Zhen Wang

In this paper, we propose a novel representation-based approach to measure the domain gap, where the representation is learned through a contrastive objective by sampling transitions from different domains.

reinforcement-learning

Link Stealing Attacks Against Inductive Graph Neural Networks

1 code implementation9 May 2024 Yixin Wu, Xinlei He, Pascal Berrang, Mathias Humbert, Michael Backes, Neil Zhenqiang Gong, Yang Zhang

This paper fills the gap by conducting a systematic privacy analysis of inductive GNNs through the lens of link stealing attacks, one of the most popular attacks that are specifically designed for GNNs.

Graph Neural Network

UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images

no code implementations6 May 2024 Yiting Qu, Xinyue Shen, Yixin Wu, Michael Backes, Savvas Zannettou, Yang Zhang

With the advent of text-to-image models and concerns about their misuse, developers are increasingly relying on image safety classifiers to moderate their generated unsafe images.

Benchmarking

Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach

1 code implementation2 May 2024 Tianhao Shi, Yang Zhang, Jizhi Zhang, Fuli Feng, Xiangnan He

To this end, we propose Distributionally Robust Fair Optimization (DRFO), which minimizes the worst-case unfairness over all potential probability distributions of missing sensitive attributes instead of the reconstructed one to account for the impact of the reconstruction errors.

Attribute Fairness +1

Generative AI for Low-Carbon Artificial Intelligence of Things with Large Language Models

no code implementations28 Apr 2024 Jinbo Wen, Ruichen Zhang, Dusit Niyato, Jiawen Kang, Hongyang Du, Yang Zhang, Zhu Han

In this article, we explore the potential of GAI for carbon emissions reduction and propose a novel GAI-enabled solution for low-carbon AIoT.

Language Modelling RAG

WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users

1 code implementation25 Apr 2024 William Huang, Sam Ghahremani, Siyou Pei, Yang Zhang

We present a data synthesis pipeline to address this disparity in data collection and subsequently improve pose estimation performance for wheelchair users.

Diversity Human Detection +3

decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating Points

1 code implementation19 Apr 2024 Yi Guo, Fanliu Kong, Xiaoyang Li, Hui Li, Wei Chen, Xiaogang Tian, Jinping Cai, Yang Zhang, Shouda Liu

However, existing quantization schemes suffer from significant accuracy degradation at very low bits, or require some additional computational overhead when deployed, making it difficult to be applied to large-scale applications in industry.

Quantization

Advancing the Robustness of Large Language Models through Self-Denoised Smoothing

1 code implementation18 Apr 2024 Jiabao Ji, Bairu Hou, Zhen Zhang, Guanhua Zhang, Wenqi Fan, Qing Li, Yang Zhang, Gaowen Liu, Sijia Liu, Shiyu Chang

Although large language models (LLMs) have achieved significant success, their vulnerability to adversarial perturbations, including recent jailbreak attacks, has raised considerable concerns.

Exact and Efficient Unlearning for Large Language Model-based Recommendation

no code implementations16 Apr 2024 Zhiyu Hu, Yang Zhang, Minghao Xiao, Wenjie Wang, Fuli Feng, Xiangnan He

The evolving paradigm of Large Language Model-based Recom- mendation (LLMRec) customizes Large Language Models (LLMs) through parameter-efficient fine-tuning (PEFT) using recommenda- tion data.

Language Modelling Large Language Model +1

RULER: What's the Real Context Size of Your Long-Context Language Models?

3 code implementations9 Apr 2024 Cheng-Ping Hsieh, Simeng Sun, Samuel Kriman, Shantanu Acharya, Dima Rekesh, Fei Jia, Yang Zhang, Boris Ginsburg

Despite achieving nearly perfect accuracy in the vanilla NIAH test, almost all models exhibit large performance drops as the context length increases.

Long-Context Understanding

Beamforming Design for Double-Active-RIS-aided Communication Systems with Inter-Excitation

no code implementations17 Mar 2024 Boshi Wang, Cunhua Pan, Hong Ren, Zhiyuan Yu, Yang Zhang, Mengyu Liu, Gui Zhou

Furthermore, the results demonstrate that the proposed scheme can enhance the WSR by 30\% compared to scenarios that do not take this effect into account when the maximum amplification gain is 40 dB.

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

1 code implementation11 Mar 2024 Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Tiviatis Sim, Kenji Kawaguchi

Recent advancements in diffusion models have notably improved the perceptual quality of generated images in text-to-image synthesis tasks.

Image Generation

The 2nd Workshop on Recommendation with Generative Models

no code implementations7 Mar 2024 Wenjie Wang, Yang Zhang, Xinyu Lin, Fuli Feng, Weiwen Liu, Yong liu, Xiangyu Zhao, Wayne Xin Zhao, Yang song, Xiangnan He

The rise of generative models has driven significant advancements in recommender systems, leaving unique opportunities for enhancing users' personalized recommendations.

Recommendation Systems

HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

no code implementations CVPR 2024 Peng Dai, Yang Zhang, Tao Liu, Zhen Fan, Tianyuan Du, Zhuo Su, Xiaozheng Zheng, Zeming Li

It is especially challenging to achieve real-time human motion tracking on a standalone VR Head-Mounted Display (HMD) such as Meta Quest and PICO.

PICO

DomainVerse: A Benchmark Towards Real-World Distribution Shifts For Tuning-Free Adaptive Domain Generalization

no code implementations5 Mar 2024 Feng Hou, Jin Yuan, Ying Yang, Yang Liu, Yang Zhang, Cheng Zhong, Zhongchao shi, Jianping Fan, Yong Rui, Zhiqiang He

With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain task changes to directly adapt the pre-trained source model to arbitrary target domains equipped with prior domain knowledge, and we name this task Adaptive Domain Generalization (ADG).

Domain Generalization

A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods

no code implementations5 Mar 2024 Hanlei Jin, Yang Zhang, Dan Meng, Jun Wang, Jinghua Tan

Automatic Text Summarization (ATS), utilizing Natural Language Processing (NLP) algorithms, aims to create concise and accurate summaries, thereby significantly reducing the human effort required in processing large volumes of text.

Survey Text Summarization

Large Language Models are Learnable Planners for Long-Term Recommendation

1 code implementation29 Feb 2024 Wentao Shi, Xiangnan He, Yang Zhang, Chongming Gao, Xinyue Li, Jizhi Zhang, Qifan Wang, Fuli Feng

To this end, we propose a Bi-level Learnable LLM Planner framework, which consists of a set of LLM instances and breaks down the learning process into macro-learning and micro-learning to learn macro-level guidance and micro-level personalized recommendation policies, respectively.

Decision Making Language Modelling +2

Lower-Left Partial AUC: An Effective and Efficient Optimization Metric for Recommendation

no code implementations29 Feb 2024 Wentao Shi, Chenxu Wang, Fuli Feng, Yang Zhang, Wenjie Wang, Junkang Wu, Xiangnan He

Compared to AUC, LLPAUC considers only the partial area under the ROC curve in the Lower-Left corner to push the optimization focus on Top-K. We provide theoretical validation of the correlation between LLPAUC and Top-K ranking metrics and demonstrate its robustness to noisy user feedback.

Recommendation Systems

Prospect Personalized Recommendation on Large Language Model-based Agent Platform

1 code implementation28 Feb 2024 Jizhi Zhang, Keqin Bao, Wenjie Wang, Yang Zhang, Wentao Shi, Wanhong Xu, Fuli Feng, Tat-Seng Chua

Additionally, we prospect the evolution of Rec4Agentverse and conceptualize it into three stages based on the enhancement of the interaction and information exchange among Agent Items, Agent Recommender, and the user.

Language Modelling Large Language Model +1

An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary

1 code implementation27 Feb 2024 Guodong Sun, Yuting Peng, Le Cheng, Mengya Xu, An Wang, Bo Wu, Hongliang Ren, Yang Zhang

The precise segmentation of ore images is critical to the successful execution of the beneficiation process.

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

1 code implementation25 Feb 2024 Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang

Aligned large language models (LLMs) are vulnerable to jailbreaking attacks, which bypass the safeguards of targeted LLMs and fool them into generating objectionable content.

Instruction Following

FlexHB: a More Efficient and Flexible Framework for Hyperparameter Optimization

no code implementations21 Feb 2024 Yang Zhang, Haiyang Wu, Yuekui Yang

Comprehensive study on FlexHB shows that (1) our fine-grained fidelity method considerably enhances the efficiency of searching optimal configurations, (2) our FlexBand framework (self-adaptive allocation of SH brackets, and global ranking of configurations in both current and past SH procedures) grants the algorithm with more flexibility and improves the anytime performance.

Bayesian Optimization Hyperparameter Optimization

A Framework of RIS-assisted ICSC User-centric Based Systems: Latency Optimization and Design

no code implementations21 Feb 2024 Jiahua Wan, Hong Ren, Zhiyuan Yu, Zhenkun Zhang, Yang Zhang, Cunhua Pan, Jiangzhou Wang

To address the formulated non-convex problem in the multi-UE scenario, we decouple the original problem into two subproblems, where the computational and beamforming settings are optimized alternately.

Edge-computing

Prompt Stealing Attacks Against Large Language Models

no code implementations20 Feb 2024 Zeyang Sha, Yang Zhang

Our proposed prompt stealing attack aims to steal these well-designed prompts based on the generated answers.

Prompt Engineering

VGMShield: Mitigating Misuse of Video Generative Models

1 code implementation20 Feb 2024 Yan Pang, Yang Zhang, Tianhao Wang

Together with fake video detection and tracing, our multi-faceted set of solutions can effectively mitigate misuse of video generative models.

Video Generation

Are LLM-based Evaluators Confusing NLG Quality Criteria?

2 code implementations19 Feb 2024 Xinyu Hu, Mingqi Gao, Sen Hu, Yang Zhang, Yicheng Chen, Teng Xu, Xiaojun Wan

Some prior work has shown that LLMs perform well in NLG evaluation for different tasks.

nlg evaluation

Instruction Backdoor Attacks Against Customized LLMs

1 code implementation14 Feb 2024 Rui Zhang, Hongwei Li, Rui Wen, Wenbo Jiang, Yuan Zhang, Michael Backes, Yun Shen, Yang Zhang

The increasing demand for customized Large Language Models (LLMs) has led to the development of solutions like GPTs.

Language Modelling Large Language Model +2

Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection

1 code implementation14 Feb 2024 Pengfei Zhou, Weiqing Min, Jiajun Song, Yang Zhang, Shuqiang Jiang

The complexity of food semantic attributes further makes it more difficult for current ZSD methods to distinguish various food categories.

Attribute Generalized Zero-Shot Object Detection +2

Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

1 code implementation8 Feb 2024 Haotong Qin, Xudong Ma, Xingyu Zheng, Xiaoyang Li, Yang Zhang, Shouda Liu, Jie Luo, Xianglong Liu, Michele Magno

This paper proposes a novel IR-QLoRA for pushing quantized LLMs with LoRA to be highly accurate through information retention.

MMLU Quantization

Comprehensive Assessment of Jailbreak Attacks Against LLMs

2 code implementations8 Feb 2024 Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang

Some jailbreak prompt datasets, available from the Internet, can also achieve high attack success rates on many LLMs, such as ChatGLM3, GPT-3. 5, and PaLM2.

Ethics

Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models

no code implementations5 Feb 2024 Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang

This attack targets the contents of previous conversations between GPT models and benign users, i. e., the benign users' input contents during their interaction with GPT models.

Reconstruction Attack Semantic Similarity +1

GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models

no code implementations5 Feb 2024 Haibo Jin, Ruoxi Chen, Andy Zhou, Yang Zhang, Haohan Wang

Our system of different roles will leverage this knowledge graph to generate new jailbreaks, which have proved effective in inducing LLMs to generate unethical or guideline-violating responses.

Sentence

Secure Wireless Communication in Active RIS-Assisted DFRC System

no code implementations3 Feb 2024 Yang Zhang, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Tuo Wu, Yongchao He

This work considers a dual-functional radar and communication (DFRC) system with an active reconfigurable intelligent surface (RIS) and a potential eavesdropper.

Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion

1 code implementation28 Jan 2024 Yujian Liu, Jiabao Ji, Tong Yu, Ryan Rossi, Sungchul Kim, Handong Zhao, Ritwik Sinha, Yang Zhang, Shiyu Chang

Table question answering is a popular task that assesses a model's ability to understand and interact with structured data.

Question Answering

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

no code implementations7 Jan 2024 Haonan Wang, Qianli Shen, Yao Tong, Yang Zhang, Kenji Kawaguchi

Our method strategically embeds connections between pieces of copyrighted information and text references in poisoning data while carefully dispersing that information, making the poisoning data inconspicuous when integrated into a clean dataset.

Backdoor Attack Data Poisoning +1

Preliminary Study on Incremental Learning for Large Language Model-based Recommender Systems

no code implementations25 Dec 2023 Tianhao Shi, Yang Zhang, Zhijian Xu, Chong Chen, Fuli Feng, Xiangnan He, Qi Tian

Instead of dismissing the role of incremental learning, we attribute the lack of anticipated performance enhancement to a mismatch between the LLM4Rec architecture and incremental learning: LLM4Rec employs a single adaptation module for learning recommendations, limiting its ability to simultaneously capture long-term and short-term user preferences in the incremental learning context.

Attribute Incremental Learning +3

FAKEPCD: Fake Point Cloud Detection via Source Attribution

no code implementations18 Dec 2023 Yiting Qu, Zhikun Zhang, Yun Shen, Michael Backes, Yang Zhang

Take the open-world attribution as an example, FAKEPCD attributes point clouds to known sources with an accuracy of 0. 82-0. 98 and to unknown sources with an accuracy of 0. 73-1. 00.

Attribute Cloud Detection

LabelCraft: Empowering Short Video Recommendations with Automated Label Crafting

1 code implementation18 Dec 2023 Yimeng Bai, Yang Zhang, Jing Lu, Jianxin Chang, Xiaoxue Zang, Yanan Niu, Yang song, Fuli Feng

Through meta-learning techniques, LabelCraft effectively addresses the bi-level optimization hurdle posed by the recommender and labeling models, enabling the automatic acquisition of intricate label generation mechanisms. Extensive experiments on real-world datasets corroborate LabelCraft's excellence across varied operational metrics, encompassing usage time, user engagement, and retention.

Meta-Learning Model Optimization

Polyper: Boundary Sensitive Polyp Segmentation

1 code implementation14 Dec 2023 Hao Shao, Yang Zhang, Qibin Hou

We present a new boundary sensitive framework for polyp segmentation, called Polyper.

Segmentation

RdimKD: Generic Distillation Paradigm by Dimensionality Reduction

no code implementations14 Dec 2023 Yi Guo, Yiqian He, Xiaoyang Li, Haotong Qin, Van Tung Pham, Yang Zhang, Shouda Liu

Knowledge Distillation (KD) emerges as one of the most promising compression technologies to run advanced deep neural networks on resource-limited devices.

Dimensionality Reduction Knowledge Distillation

Spatial-wise Dynamic Distillation for MLP-like Efficient Visual Fault Detection of Freight Trains

1 code implementation10 Dec 2023 Yang Zhang, Huilin Pan, Mingying Li, An Wang, Yang Zhou, Hongliang Ren

Existing modeling shortcomings of spatial invariance and pooling layers in conventional CNNs often ignore the neglect of crucial global information, resulting in error localization for fault objection tasks of freight trains.

Fault Detection object-detection +1

Correcting Diffusion Generation through Resampling

1 code implementation CVPR 2024 Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang

Despite diffusion models' superior capabilities in modeling complex distributions, there are still non-trivial distributional discrepancies between generated and ground-truth images, which has resulted in several notable problems in image generation, including missing object errors in text-to-image generation and low image quality.

Object Text-to-Image Generation

From Beginner to Expert: Modeling Medical Knowledge into General LLMs

no code implementations2 Dec 2023 Qiang Li, Xiaoyan Yang, Haowen Wang, Qin Wang, Lei Liu, Junjie Wang, Yang Zhang, Mingyuan Chu, Sen Hu, Yicheng Chen, Yue Shen, Cong Fan, Wangshu Zhang, Teng Xu, Jinjie Gu, Jing Zheng, Guannan Zhang Ant Group

(3) Specifically for multi-choice questions in the medical domain, we propose a novel Verification-of-Choice approach for prompting engineering, which significantly enhances the reasoning ability of LLMs.

Language Modelling Large Language Model +3

VIoTGPT: Learning to Schedule Vision Tools towards Intelligent Video Internet of Things

no code implementations1 Dec 2023 Yaoyao Zhong, Mengshi Qi, Rui Wang, Yuhan Qiu, Yang Zhang, Huadong Ma

Video Internet of Things (VIoT) has shown full potential in collecting an unprecedented volume of video data.

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling

1 code implementation15 Nov 2023 Bairu Hou, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang, Yang Zhang

We show that, when aleatoric uncertainty arises from ambiguity or under-specification in LLM inputs, this approach makes it possible to factor an (unclarified) LLM's predictions into separate aleatoric and epistemic terms, using a decomposition similar to the one employed by Bayesian neural networks.

Uncertainty Quantification

Comprehensive Assessment of Toxicity in ChatGPT

no code implementations3 Nov 2023 Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang

Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP.

A New Fine-grained Alignment Method for Image-text Matching

no code implementations3 Nov 2023 Yang Zhang

For this purpose, we introduce the Cross-Modal Prominent Fragments Enhancement Aligning Network(CPFEAN), which achieves improved retrieval accuracy by diminishing the participation of irrelevant regions during alignment and relatively increasing the alignment similarity of prominent words.

Image-text matching Image-text Retrieval +2

CoLLM: Integrating Collaborative Embeddings into Large Language Models for Recommendation

1 code implementation30 Oct 2023 Yang Zhang, Fuli Feng, Jizhi Zhang, Keqin Bao, Qifan Wang, Xiangnan He

In pursuit of superior recommendations for both cold and warm start scenarios, we introduce CoLLM, an innovative LLMRec methodology that seamlessly incorporates collaborative information into LLMs for recommendation.

Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation

no code implementations30 Oct 2023 Zhaowei Gao, Mingyang Song, Christopher Schroers, Yang Zhang

Our proposed method supports bidirectional spatio-temporal information propagation across multiple scales to leverage information in both space and time.

Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models

1 code implementation30 Oct 2023 Minxing Zhang, Ning Yu, Rui Wen, Michael Backes, Yang Zhang

Several membership inference attacks (MIAs) have been proposed to exhibit the privacy vulnerability of generative models by classifying a query image as a training dataset member or nonmember.

Inference Attack Membership Inference Attack

From Generative AI to Generative Internet of Things: Fundamentals, Framework, and Outlooks

no code implementations27 Oct 2023 Jinbo Wen, Jiangtian Nie, Jiawen Kang, Dusit Niyato, Hongyang Du, Yang Zhang, Mohsen Guizani

Generative Artificial Intelligence (GAI) possesses the capabilities of generating realistic data and facilitating advanced decision-making.

Decision Making Management

DASA: Difficulty-Aware Semantic Augmentation for Speaker Verification

no code implementations18 Oct 2023 Yuanyuan Wang, Yang Zhang, Zhiyong Wu, Zhihan Yang, Tao Wei, Kun Zou, Helen Meng

Existing augmentation methods for speaker verification manipulate the raw signal, which are time-consuming and the augmented samples lack diversity.

Data Augmentation Diversity +1

Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning

no code implementations17 Oct 2023 Rui Wen, Tianhao Wang, Michael Backes, Yang Zhang, Ahmed Salem

Large Language Models (LLMs) are powerful tools for natural language processing, enabling novel applications and user experiences.

In-Context Learning

A Comprehensive Study of Privacy Risks in Curriculum Learning

no code implementations16 Oct 2023 Joann Qiongna Chen, Xinlei He, Zheng Li, Yang Zhang, Zhou Li

Training a machine learning model with data following a meaningful order, i. e., from easy to hard, has been proven to be effective in accelerating the training process and achieving better model performance.

Attribute Inference Attack +3

LGL-BCI: A Lightweight Geometric Learning Framework for Motor Imagery-Based Brain-Computer Interfaces

no code implementations12 Oct 2023 Jianchao Lu, Yuzhe Tian, Yang Zhang, Jiaqi Ge, Quan Z. Sheng, Xi Zheng

The efficiency, assessed on two public EEG datasets and two real-world EEG devices, significantly outperforms the state-of-the-art solution in accuracy ($82. 54\%$ versus $62. 22\%$) with fewer parameters (64. 9M compared to 183. 7M).

EEG Motor Imagery

Composite Backdoor Attacks Against Large Language Models

1 code implementation11 Oct 2023 Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

Such a Composite Backdoor Attack (CBA) is shown to be stealthier than implanting the same multiple trigger keys in only a single component.

Backdoor Attack

Prompt Backdoors in Visual Prompt Learning

no code implementations11 Oct 2023 Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

Specifically, the VPPTaaS provider optimizes a visual prompt given downstream data, and downstream users can use this prompt together with the large pre-trained model for prediction.

Backdoor Attack

A Chat About Boring Problems: Studying GPT-based text normalization

no code implementations23 Sep 2023 Yang Zhang, Travis M. Bartley, Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg

Through this new framework, we can identify strengths and weaknesses of GPT-based TN, opening opportunities for future work.

Prompt Engineering

On Copyright Risks of Text-to-Image Diffusion Models

no code implementations15 Sep 2023 Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Haonan Wang, Kenji Kawaguchi

Specifically, we introduce a data generation pipeline to systematically produce data for studying copyright in diffusion models.

Efficient Real-time Path Planning with Self-evolving Particle Swarm Optimization in Dynamic Scenarios

1 code implementation20 Aug 2023 Jinghao Xin, Zhi Li, Yang Zhang, Ning li

Particle Swarm Optimization (PSO) has demonstrated efficacy in addressing static path planning problems.

Computational Efficiency

A Dual-Perspective Approach to Evaluating Feature Attribution Methods

no code implementations17 Aug 2023 Yawei Li, Yang Zhang, Kenji Kawaguchi, Ashkan Khakzar, Bernd Bischl, Mina Rezaei

We apply these metrics to mainstream attribution methods, offering a novel lens through which to analyze and compare feature attribution methods.

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

1 code implementation16 Aug 2023 Keqin Bao, Jizhi Zhang, Wenjie Wang, Yang Zhang, Zhengyi Yang, Yancheng Luo, Chong Chen, Fuli Feng, Qi Tian

As the focus on Large Language Models (LLMs) in the field of recommendation intensifies, the optimization of LLMs for recommendation purposes (referred to as LLM4Rec) assumes a crucial role in augmenting their effectiveness in providing recommendations.

Collaborative Filtering Recommendation Systems

SAM Meets Robotic Surgery: An Empirical Study on Generalization, Robustness and Adaptation

no code implementations14 Aug 2023 An Wang, Mobarakol Islam, Mengya Xu, Yang Zhang, Hongliang Ren

Our extensive evaluation results reveal that although SAM shows remarkable zero-shot generalization ability with bounding box prompts, it struggles to segment the whole instrument with point-based prompts and unprompted settings.

Semantic Segmentation Zero-shot Generalization

Spatial-information Guided Adaptive Context-aware Network for Efficient RGB-D Semantic Segmentation

1 code implementation11 Aug 2023 Yang Zhang, Chenyun Xiong, Junjie Liu, Xuhui Ye, Guodong Sun

Efficient RGB-D semantic segmentation has received considerable attention in mobile robots, which plays a vital role in analyzing and recognizing environmental information.

Decoder Segmentation +1

You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content

1 code implementation10 Aug 2023 Xinlei He, Savvas Zannettou, Yun Shen, Yang Zhang

We find that prompt learning achieves around 10\% improvement in the toxicity classification task compared to the baselines, while for the toxic span detection task we find better performance to the best baseline (0. 643 vs. 0. 640 in terms of $F_1$-score).

Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio

2 code implementations9 Aug 2023 Yang Zhang, Krishna C. Puvvada, Vitaly Lavrukhin, Boris Ginsburg

We propose CONF-TSASR, a non-autoregressive end-to-end time-frequency domain architecture for single-channel target-speaker automatic speech recognition (TS-ASR).

Automatic Speech Recognition speech-recognition +1

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

1 code implementation7 Aug 2023 Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, Yang Zhang

We hope that our study can facilitate the research community and LLM vendors in promoting safer and regulated LLMs.

Community Detection

Learning to Select the Relevant History Turns in Conversational Question Answering

no code implementations4 Aug 2023 Munazza Zaib, Wei Emma Zhang, Quan Z. Sheng, Subhash Sagar, Adnan Mahmood, Yang Zhang

In this paper, we propose a framework, DHS-ConvQA (Dynamic History Selection in Conversational Question Answering), that first generates the context and question entities for all the history turns, which are then pruned on the basis of similarity they share in common with the question at hand.

Binary Classification Conversational Question Answering +1

Blockchain-empowered Federated Learning for Healthcare Metaverses: User-centric Incentive Mechanism with Optimal Data Freshness

no code implementations29 Jul 2023 Jiawen Kang, Jinbo Wen, Dongdong Ye, Bingkun Lai, Tianhao Wu, Zehui Xiong, Jiangtian Nie, Dusit Niyato, Yang Zhang, Shengli Xie

Given the revolutionary role of metaverses, healthcare metaverses are emerging as a transformative force, creating intelligent healthcare systems that offer immersive and personalized services.

Decision Making Federated Learning +1

FATRER: Full-Attention Topic Regularizer for Accurate and Robust Conversational Emotion Recognition

1 code implementation23 Jul 2023 Yuzhao Mao, Di Lu, Xiaojie Wang, Yang Zhang

This paper concentrates on the understanding of interlocutors' emotions evoked in conversational utterances.