Search Results for author: Mengcheng Lan

Found 12 papers, 9 papers with code

BrainPrompt: Multi-Level Brain Prompt Enhancement for Neurological Condition Identification

1 code implementation12 Apr 2025 Jiaxing Xu, Kai He, Yue Tang, Wei Li, Mengcheng Lan, Xia Dong, Yiping Ke, Mengling Feng

In this paper, we present BrainPrompt, an innovative framework that enhances Graph Neural Networks (GNNs) by integrating Large Language Models (LLMs) with knowledge-driven prompts, enabling more effective capture of complex, non-imaging information and external knowledge for neurological disease identification.

BrainOOD: Out-of-distribution Generalizable Brain Network Analysis

1 code implementation2 Feb 2025 Jiaxing Xu, Yongqiang Chen, Xia Dong, Mengcheng Lan, Tiancheng Huang, Qingtian Bian, James Cheng, Yiping Ke

Graph Neural Networks (GNNs) have shown promising in analyzing brain networks, but there are two major challenges in using GNNs: (1) distribution shifts in multi-site brain network data, leading to poor Out-of-Distribution (OOD) generalization, and (2) limited interpretability in identifying key brain regions critical to neurological disorders.

GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding

1 code implementation16 Nov 2024 Yue Zhou, Mengcheng Lan, Xiang Li, Litong Feng, Yiping Ke, Xue Jiang, Qingyun Li, Xue Yang, Wayne Zhang

Remote sensing (RS) visual grounding aims to use natural language expression to locate specific objects (in the form of the bounding box or segmentation mask) in RS images, enhancing human interaction with intelligent RS interpretation systems.

Language Modeling Language Modelling +3

Text4Seg: Reimagining Image Segmentation as Text Generation

1 code implementation13 Oct 2024 Mengcheng Lan, Chaofeng Chen, Yue Zhou, Jiaxing Xu, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang

Multimodal Large Language Models (MLLMs) have shown exceptional capabilities in vision-language tasks; however, effectively integrating image segmentation into these models remains a significant challenge.

Image Segmentation Referring Expression +4

Multi-Atlas Brain Network Classification through Consistency Distillation and Complementary Information Fusion

no code implementations28 Sep 2024 Jiaxing Xu, Mengcheng Lan, Xia Dong, Kai He, Wei zhang, Qingtian Bian, Yiping Ke

Some recent methods have proposed utilizing multiple atlases, but they neglect consistency across atlases and lack ROI-level information exchange.

Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition Identification

1 code implementation17 Sep 2024 Jiaxing Xu, Kai He, Mengcheng Lan, Qingtian Bian, Wei Li, Tieying Li, Yiping Ke, Miao Qiao

It generates a prior-knowledge-enhanced contrast graph to address the distribution shifts across sub-populations by a two-stream attention mechanism.

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

1 code implementation9 Aug 2024 Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang

ProxyCLIP leverages the spatial feature correspondence from VFMs as a form of proxy attention to augment CLIP, thereby inheriting the VFMs' robust local consistency and maintaining CLIP's exceptional zero-shot transfer capacity.

Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +2

ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference

no code implementations17 Jul 2024 Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang

Despite the success of large-scale pretrained Vision-Language Models (VLMs) especially CLIP in various open-vocabulary tasks, their application to semantic segmentation remains challenging, producing noisy segmentation maps with mis-segmented regions.

Open Vocabulary Semantic Segmentation Open-Vocabulary Semantic Segmentation +1

Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach

1 code implementation20 Jun 2024 Mengcheng Lan, Min Meng, Jun Yu, Jigang Wu

As such, the key issues of WS-PDA are: 1) how to sufficiently discover the knowledge from the noisy labeled source domain and the unlabeled target domain, and 2) how to successfully adapt the knowledge across domains.

Partial Domain Adaptation

MIMO Is All You Need : A Strong Multi-In-Multi-Out Baseline for Video Prediction

1 code implementation9 Dec 2022 Shuliang Ning, Mengcheng Lan, Yanran Li, Chaofeng Chen, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui

The mainstream of the existing approaches for video prediction builds up their models based on a Single-In-Single-Out (SISO) architecture, which takes the current frame as input to predict the next frame in a recursive manner.

All Prediction +1

From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting

no code implementations21 Jul 2021 Mengcheng Lan, Shuliang Ning, Yanran Li, Qian Chen, Xunlai Chen, Xiaoguang Han, Shuguang Cui

Despite video forecasting has been a widely explored topic in recent years, the mainstream of the existing work still limits their models with a single prediction space but completely neglects the way to leverage their model with multi-prediction spaces.

Prediction Video Forecasting +1

Cannot find the paper you are looking for? You can Submit a new open access paper.