Search Results for author: Zhan Chen

Found 14 papers, 8 papers with code

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

1 code implementation20 Feb 2025 Tao Ji, Bin Guo, Yuanbin Wu, Qipeng Guo, Lixing Shen, Zhan Chen, Xipeng Qiu, Qi Zhang, Tao Gui

For example, the KV cache size of Llama2-7B is reduced by 92. 19%, with only a 0. 5% drop in LongBench performance.

Quantization

Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game

no code implementations CVPR 2025 Keyizhi Xu, Chi Zhang, Zhan Chen, Zhongyuan Wang, Chunxia Xiao, Chao Liang

Multi-exit neural networks represent a promising approach to enhancing model inference efficiency, yet like common neural networks, they suffer from significantly reduced robustness against adversarial attacks.

Adversarial Robustness

UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images

1 code implementation21 Aug 2024 Enze Zhu, Zhan Chen, Dingkai Wang, Hanru Shi, Xiaoxuan Liu, Lei Wang

Semantic segmentation of high-resolution remote sensing images is vital in downstream applications such as land-cover mapping, urban planning and disaster assessment. Existing Transformer-based methods suffer from the constraint between accuracy and efficiency, while the recently proposed Mamba is renowned for being efficient.

Mamba Segmentation +2

HGNET: A Hierarchical Feature Guided Network for Occupancy Flow Field Prediction

no code implementations1 Jul 2024 Zhan Chen, Chen Tang, Lu Xiong

Additionally, to enhance the temporal consistency and causal relationships of the predictions, we propose a Time Series Memory framework to learn the conditional distribution models of the prediction outputs at future time steps from multivariate time series.

Autonomous Driving multimodal interaction +3

HVDetFusion: A Simple and Robust Camera-Radar Fusion Framework

1 code implementation21 Jul 2023 Kai Lei, Zhan Chen, Shuman Jia, Xiaoteng Zhang

In this study, we propose a new detection algorithm called HVDetFusion, which is a multi-modal detection algorithm that not only supports pure camera data as input for detection, but also can perform fusion input of radar data and camera data.

3D Object Detection Autonomous Driving +1

Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition

1 code implementation7 Jul 2022 Zhan Chen, Hong Liu, Tianyu Guo, Zhengyan Chen, Pinhao Song, Hao Tang

First, SkeleMix utilizes the topological information of skeleton data to mix two skeleton sequences by randomly combing the cropped skeleton fragments (the trimmed view) with the remaining skeleton sequences (the truncated view).

Action Recognition Contrastive Learning +3

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

1 code implementation27 Jun 2022 Zhan Chen, Sicheng Li, Bing Yang, Qinghan Li, Hong Liu

To solve this problem, we present a multi-scale spatial graph convolution (MS-GC) module and a multi-scale temporal graph convolution (MT-GC) module to enrich the receptive field of the model in spatial and temporal dimensions.

Skeleton Based Action Recognition

Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition

1 code implementation7 Dec 2021 Tianyu Guo, Hong Liu, Zhan Chen, Mengyuan Liu, Tao Wang, Runwei Ding

In this paper, to make better use of the movement patterns introduced by extreme augmentations, a Contrastive Learning framework utilizing Abundant Information Mining for self-supervised action Representation (AimCLR) is proposed.

Contrastive Learning Few-Shot Skeleton-Based Action Recognition +5

Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion

no code implementations Interspeech 2020 Hong Liu, Zhan Chen, Bing Yang

Second, the hybrid visual stream is combined with the audio stream by an attention-based bidirectional synchronous fusion which allows bidirectional information interaction to resolve the asynchrony between the two modalities during fusion.

Audio-Visual Speech Recognition Landmark-based Lipreading +2

Two Step Joint Model for Drug Drug Interaction Extraction

no code implementations28 Aug 2020 Siliang Tang, Qi Zhang, Tianpeng Zheng, Mengdi Zhou, Zhan Chen, Lixing Shen, Xiang Ren, Yueting Zhuang, ShiLiang Pu, Fei Wu

When patients need to take medicine, particularly taking more than one kind of drug simultaneously, they should be alarmed that there possibly exists drug-drug interaction.

Decoder Drug–drug Interaction Extraction +5

Cannot find the paper you are looking for? You can Submit a new open access paper.