Search Results for author: Zeyu Li

Found 20 papers, 9 papers with code

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

no code implementations4 Feb 2025 Xiang Liu, Zhenheng Tang, Hong Chen, Peijie Dong, Zeyu Li, Xiuze Zhou, Bo Li, Xuming Hu, Xiaowen Chu

We present a comprehensive empirical study evaluating prominent KV cache compression methods across diverse tasks, spanning world knowledge, commonsense reasoning, arithmetic reasoning, code generation, safety, and long-context understanding and generation. Our analysis reveals that KV cache compression methods exhibit task-specific performance degradation.

Arithmetic Reasoning Code Generation +2

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference

no code implementations1 Feb 2025 Xiang Liu, Zhenheng Tang, Peijie Dong, Zeyu Li, Bo Li, Xuming Hu, Xiaowen Chu

To reduce memory costs in long-context inference with Large Language Models (LLMs), many recent works focus on compressing the key-value (KV) cache of different tokens.

GSM8K In-Context Learning

Representational Transfer Learning for Matrix Completion

no code implementations9 Dec 2024 Yong He, Zeyu Li, Dong Liu, Kangxiang Qin, Jiahui Xie

We propose to transfer representational knowledge from multiple sources to a target noisy matrix completion task by aggregating singular subspaces information.

Matrix Completion Transfer Learning

AI-generated Image Detection: Passive or Watermark?

1 code implementation20 Nov 2024 Moyang Guo, Yuepeng Hu, Zhengyuan Jiang, Zeyu Li, Amir Sadovnik, Arka Daw, Neil Gong

Based on these insights, we provide recommendations for detecting AI-generated images, e. g., when both types of detectors are applicable, watermark-based detectors should be the preferred choice.

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

1 code implementation24 Oct 2024 Qi Li, Xiang Liu, Zhenheng Tang, Peijie Dong, Zeyu Li, Xinglin Pan, Xiaowen Chu

Our findings indicate that current editing methods are only suitable for small-scale knowledge updates within language models, which motivates further research on more practical and reliable editing methods.

General Knowledge Model Editing

OpenAnimals: Revisiting Person Re-Identification for Animals Towards Better Generalization

no code implementations30 Sep 2024 Saihui Hou, Panjian Huang, Zengbin Wang, YuAn Liu, Zeyu Li, Man Zhang, Yongzhen Huang

This paper addresses the challenge of animal re-identification, an emerging field that shares similarities with person re-identification but presents unique complexities due to the diverse species, environments and poses.

Person Re-Identification

Physics-aligned Schrödinger bridge

no code implementations26 Sep 2024 Zeyu Li, Hongkun Dou, Shen Fang, Wang Han, Yue Deng, LiJun Yang

To overcome this limitation, we introduce a novel data-driven field reconstruction framework, termed the Physics-aligned Schr\"{o}dinger Bridge (PalSB).

MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets

no code implementations22 Apr 2024 Zeyu Li, Ruitong Gan, Chuanchen Luo, Yuxi Wang, Jiaheng Liu, Ziwei Zhu Man Zhang, Qing Li, XuCheng Yin, Zhaoxiang Zhang, Junran Peng

Driven by powerful image diffusion models, recent research has achieved the automatic creation of 3D objects from textual or visual guidance.

Knowledge Transfer across Multiple Principal Component Analysis Studies

no code implementations12 Mar 2024 Zeyu Li, Kangxiang Qin, Yong He, Wang Zhou, Xinsheng Zhang

In the first step, we integrate the shared subspace information across multiple studies by a proposed method named as Grassmannian barycenter, instead of directly performing PCA on the pooled dataset.

Activity Recognition Transfer Learning

Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

1 code implementation4 Jan 2024 Zeyu Li, Suncheng Xiang, Tong Yu, Jingsheng Gao, Jiacheng Ruan, Yanping Hu, Ting Liu, Yuzhuo Fu

While audio retrieval tasks are well-established in general audio classification, they have not been explored in the context of underwater audio recognition.

Attribute Audio Classification +3

MEAOD: Model Extraction Attack against Object Detectors

no code implementations22 Dec 2023 Zeyu Li, Chenghui Shi, Yuwen Pu, Xuhong Zhang, Yu Li, Jinbao Li, Shouling Ji

The widespread use of deep learning technology across various industries has made deep neural network models highly valuable and, as a result, attractive targets for potential attackers.

Active Learning model +4

Geo2SigMap: High-Fidelity RF Signal Mapping Using Geographic Databases

1 code implementation21 Dec 2023 Yiming Li, Zeyu Li, Zhihui Gao, Tingjun Chen

First, we develop an automated framework that seamlessly integrates three open-source tools: OpenStreetMap (geographic databases), Blender (computer graphics), and Sionna (ray tracing), enabling the efficient generation of large-scale 3D building maps and ray tracing models.

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

no code implementations7 Nov 2023 Longteng Zhang, Xiang Liu, Zeyu Li, Xinglin Pan, Peijie Dong, Ruibo Fan, Rui Guo, Xin Wang, Qiong Luo, Shaohuai Shi, Xiaowen Chu

For end users, our benchmark and findings help better understand different optimization techniques, training and inference frameworks, together with hardware platforms in choosing configurations for deploying LLMs.

Quantization

CluCDD:Contrastive Dialogue Disentanglement via Clustering

1 code implementation16 Feb 2023 Jingsheng Gao, Zeyu Li, Suncheng Xiang, Ting Liu, Yuzhuo Fu

A huge number of multi-participant dialogues happen online every day, which leads to difficulty in understanding the nature of dialogue dynamics for both humans and machines.

Clustering Contrastive Learning +1

Powering Comparative Classification with Sentiment Analysis via Domain Adaptive Knowledge Transfer

1 code implementation EMNLP 2021 Zeyu Li, Yilong Qin, Zihan Liu, Wei Wang

We study Comparative Preference Classification (CPC) which aims at predicting whether a preference comparison exists between two entities in a given sentence and, if so, which entity is preferred over the other.

Graph Neural Network Question Answering +3

Recommend for a Reason: Unlocking the Power of Unsupervised Aspect-Sentiment Co-Extraction

1 code implementation Findings (EMNLP) 2021 Zeyu Li, Wei Cheng, Reema Kshetramade, John Houser, Haifeng Chen, Wei Wang

Compliments and concerns in reviews are valuable for understanding users' shopping interests and their opinions with respect to specific aspects of certain items.

Towards Visual Explainable Active Learning for Zero-Shot Classification

no code implementations15 Aug 2021 Shichao Jia, Zeyu Li, Nuo Chen, Jiawan Zhang

This paper proposes a visual explainable active learning approach with its design and implementation called semantic navigator to solve the above problems.

Active Learning Attribute +2

Learning Gender-Neutral Word Embeddings

1 code implementation EMNLP 2018 Jieyu Zhao, Yichao Zhou, Zeyu Li, Wei Wang, Kai-Wei Chang

Word embedding models have become a fundamental component in a wide range of Natural Language Processing (NLP) applications.

Word Embeddings

Peeking the Impact of Points of Interests on Didi

no code implementations6 Apr 2018 Yonghong Tian, Zeyu Li, Zhiwei Xu, Xuying Meng, Bing Zheng

Recently, the online car-hailing service, Didi, has emerged as a leader in the sharing economy.

Cannot find the paper you are looking for? You can Submit a new open access paper.