Search Results for author: Xiaomao Fan

Found 14 papers, 9 papers with code

ITCFN: Incomplete Triple-Modal Co-Attention Fusion Network for Mild Cognitive Impairment Conversion Prediction

1 code implementation20 Jan 2025 Xiangyang Hu, Xiangyu Shen, Yifei Sun, Xuhao Shan, Wenwen Min, Liyilei Su, Xiaomao Fan, Ahmed Elazab, Ruiquan Ge, Changmiao Wang, Xiaopeng Fan

To address these challenges, we propose an innovative multimodal approach for predicting MCI conversion, focusing specifically on the issues of missing positron emission tomography (PET) data and integrating diverse medical information.

VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection

no code implementations24 Dec 2024 Zhaohui Jin, Yi Shuai, Yongcheng Li, Lingcong Cai, Yun Li, Huifen Liu, Xiaomao Fan

The early detection of glottic carcinoma is critical for improving patient outcomes, as it enables timely intervention, preserves vocal function, and significantly reduces the risk of tumor progression and metastasis.

Language Modeling Language Modelling +1

Stain-aware Domain Alignment for Imbalance Blood Cell Classification

1 code implementation4 Dec 2024 Yongcheng Li, Lingcong Cai, Ying Lu, Xianghua Fu, Xiao Han, Ma Li, Wenxing Lai, Xiangzhong Zhang, Xiaomao Fan

In real-world scenarios, blood cell image datasets often present the issues of domain shift and data imbalance, posing challenges for accurate blood cell identification.

Contrastive Learning

A Survey of Stance Detection on Social Media: New Directions and Perspectives

no code implementations24 Sep 2024 BoWen Zhang, Genan Dai, Fuqiang Niu, Nan Yin, Xiaomao Fan, Senzhang Wang, Xiaochun Cao, Hu Huang

In modern digital environments, users frequently express opinions on contentious topics, providing a wealth of information on prevailing attitudes.

Marketing Stance Detection +1

Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations

1 code implementation8 Sep 2024 Xinran Li, Xiaomao Fan, Qingyang Wu, Xiaojiang Peng, Ye Li

MaTAV is with the advantages of aligning unimodal features to ensure consistency across different modalities and handling long input sequences to better capture contextual multimodal information.

Emotion Recognition Mamba +2

3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos

no code implementations2 Sep 2024 Meiyu Qiu, Yun Li, Wenjun Huang, Haoyun Zhang, Weiping Zheng, Wenbin Lei, Xiaomao Fan

Laryngeal cancer is a malignant disease with a high morality rate in otorhinolaryngology, posing an significant threat to human health.

Ethics

Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification

1 code implementation14 Aug 2024 Yongcheng Li, Lingcong Cai, Ying Lu, Cheng Lin, Yupeng Zhang, Jingyan Jiang, Genan Dai, BoWen Zhang, Jingzhou Cao, Xiangzhong Zhang, Xiaomao Fan

To address this issue, we propose a novel framework of domain-invariant representation learning (DoRL) via segment anything model (SAM) for blood cell classification.

Representation Learning

Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model

1 code implementation13 Aug 2024 Yongcheng Li, Lingcong Cai, Ying Lu, Yupeng Zhang, Jingyan Jiang, Genan Dai, BoWen Zhang, Jingzhou Cao, Xiangzhong Zhang, Xiaomao Fan

Accurate classification of blood cells plays a vital role in hematological analysis as it aids physicians in diagnosing various medical conditions.

Image Classification

SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection

1 code implementation10 Aug 2024 Jia Wei, Yun Li, Meiyu Qiu, Hongyu Chen, Xiaomao Fan, Wenbin Lei

However, they are still limited in their capabilities to accurately locate the lesion region and capture the discriminative feature information between the global and local branches.

Diagnostic Semantic Segmentation

Core Knowledge Learning Framework for Graph Adaptation and Scalability Learning

no code implementations2 Jul 2024 BoWen Zhang, Zhichao Huang, Genan Dai, Guangning Xu, Xiaomao Fan, Hu Huang

\method{} comprises several key modules, including the core subgraph knowledge submodule, graph domain adaptation module, and few-shot learning module for downstream tasks.

Domain Adaptation Few-Shot Learning +3

UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts

1 code implementation29 Apr 2024 Zhi-Qi Cheng, Xiang Li, Jun-Yan He, Junyao Chen, Xiaomao Fan, Xiaojiang Peng, Alexander G. Hauptmann

Emotional Text-to-Speech (E-TTS) synthesis has garnered significant attention in recent years due to its potential to revolutionize human-computer interaction.

Contrastive Learning Speech Synthesis +2

Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework

no code implementations8 Jul 2022 Xiaojiang Peng, Xiaomao Fan, Qingyang Wu, Jieyan Zhao, Pan Gao

Moreover, we present a new Coarse-to-fine Deep Smoky vehicle detection (CoDeS) framework for efficient smoky vehicle detection.

Cannot find the paper you are looking for? You can Submit a new open access paper.