Search Results for author: Shaochen Zhong

Found 11 papers, 8 papers with code

LoRA-as-an-Attack! Piercing LLM Safety Under The Share-and-Play Scenario

no code implementations29 Feb 2024 Hongyi Liu, Zirui Liu, Ruixiang Tang, Jiayi Yuan, Shaochen Zhong, Yu-Neng Chuang, Li Li, Rui Chen, Xia Hu

Our aim is to raise awareness of the potential risks under the emerging share-and-play scenario, so as to proactively prevent potential consequences caused by LoRA-as-an-Attack.

FaithLM: Towards Faithful Explanations for Large Language Models

no code implementations7 Feb 2024 Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Ruixiang Tang, Shaochen Zhong, Fan Yang, Mengnan Du, Xuanting Cai, Xia Hu

Large Language Models (LLMs) have become proficient in addressing complex tasks by leveraging their extensive internal knowledge and reasoning capabilities.

Decision Making

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

1 code implementation5 Feb 2024 Zirui Liu, Jiayi Yuan, Hongye Jin, Shaochen Zhong, Zhaozhuo Xu, Vladimir Braverman, Beidi Chen, Xia Hu

However, there is a lack of in-depth studies that explore the element distribution of KV cache to understand the hardness and limitation of KV cache quantization.

Quantization

TVE: Learning Meta-attribution for Transferable Vision Explainer

1 code implementation23 Dec 2023 Guanchu Wang, Yu-Neng Chuang, Fan Yang, Mengnan Du, Chia-Yuan Chang, Shaochen Zhong, Zirui Liu, Zhaozhuo Xu, Kaixiong Zhou, Xuanting Cai, Xia Hu

This meta-attribution leverages the versatility of generic backbone encoders to comprehensively encode the attribution knowledge for the input instance, which enables TVE to seamlessly transfer to explain various downstream tasks, without the need for training on task-specific data.

Editable Graph Neural Network for Node Classifications

no code implementations24 May 2023 Zirui Liu, Zhimeng Jiang, Shaochen Zhong, Kaixiong Zhou, Li Li, Rui Chen, Soo-Hyun Choi, Xia Hu

However, model editing for graph neural networks (GNNs) is rarely explored, despite GNNs' widespread applicability.

Fake News Detection Graph Neural Network +1

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

1 code implementation NeurIPS 2023 Zirui Liu, Guanchu Wang, Shaochen Zhong, Zhaozhuo Xu, Daochen Zha, Ruixiang Tang, Zhimeng Jiang, Kaixiong Zhou, Vipin Chaudhary, Shuai Xu, Xia Hu

While the model parameters do contribute to memory usage, the primary memory bottleneck during training arises from storing feature maps, also known as activations, as they are crucial for gradient calculation.

Language Modeling Language Modelling +1

Data-centric Artificial Intelligence: A Survey

10 code implementations17 Mar 2023 Daochen Zha, Zaid Pervaiz Bhat, Kwei-Herng Lai, Fan Yang, Zhimeng Jiang, Shaochen Zhong, Xia Hu

Artificial Intelligence (AI) is making a profound impact in almost every domain.

Survey

Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions

1 code implementation ICLR 2022 Shaochen Zhong, Guanqun Zhang, Ningjia Huang, Shuai Xu

In this paper, we revisit the idea of kernel pruning (to only prune one or several $k \times k$ kernels out of a 3D-filter), a heavily overlooked approach under the context of structured pruning due to it will naturally introduce sparsity to filters within the same convolutional layer—thus, making the remaining network no longer dense.

Clustering Network Pruning

Cannot find the paper you are looking for? You can Submit a new open access paper.