no code implementations • 15 Apr 2025 • Qinyue Tong, Ziqian Lu, Jun Liu, Yangming Zheng, Zheming Lu
In this paper, we introduce a novel medical vision task: Medical Reasoning Segmentation and Detection (MedSD), which aims to comprehend implicit queries about medical images and generate the corresponding segmentation mask and bounding box for the target object.
1 code implementation • 9 Jan 2025 • Hao Wen, Ziqian Lu, Fengli Shen, Zhe-Ming Lu, Jialin Cui
We propose a new action recognition framework introducing object nodes to supplement absent interactive object information.
no code implementations • 22 Aug 2024 • Mushui Liu, Fangtai Wu, Bozheng Li, Ziqian Lu, Yunlong Yu, Xi Li
Few-shot learning (FSL) aims to recognize new concepts using a limited number of visual samples.
1 code implementation • 17 May 2024 • Mushui Liu, Jun Dan, Ziqian Lu, Yunlong Yu, Yingming Li, Xi Li
In this paper, we propose CM-UNet, comprising a CNN-based encoder for extracting local image features and a Mamba-based decoder for aggregating and integrating global information, facilitating efficient semantic segmentation of remote sensing images.
1 code implementation • 6 Dec 2023 • Mushui Liu, Weijie He, Ziqian Lu, Yunlong Yu
Prompt learning is a powerful technique for transferring Vision-Language Models (VLMs) such as CLIP to downstream tasks.
1 code implementation • 29 Sep 2023 • Zixuan Chen, Zewei He, Ziqian Lu, Xuecheng Sun, Zhe-Ming Lu
Accordingly, we first apply a prompt generation module (PGM) to generate a visual prompt, which is the reference of appropriate statistical perturbations for mean and standard deviation.
no code implementations • 28 Sep 2023 • Zewei He, Zixuan Chen, Ziqian Lu, Xuecheng Sun, Zhe-Ming Lu
Thus, a multi-receptive-field non-local network (MRFNLN) consisting of the multi-stream feature attention block (MSFAB) and cross non-local block (CNLB) is presented in this paper.