Search Results for author: Yutong Xie

Found 50 papers, 32 papers with code

A Survey of Medical Vision-and-Language Applications and Their Techniques

no code implementations19 Nov 2024 Qi Chen, Ruoshan Zhao, Sinuo Wang, Vu Minh Hieu Phan, Anton Van Den Hengel, Johan Verjans, Zhibin Liao, Minh-Son To, Yong Xia, Jian Chen, Yutong Xie, Qi Wu

Unlike general vision-and-language models trained on diverse, non-specialized datasets, MVLMs are purpose-built for the medical domain, automatically extracting and interpreting critical information from medical images and textual reports to support clinical decision-making.

Decision Making Image-text Retrieval +5

MedUniSeg: 2D and 3D Medical Image Segmentation via a Prompt-driven Universal Model

1 code implementation8 Oct 2024 Yiwen Ye, Ziyang Chen, Jianpeng Zhang, Yutong Xie, Yong Xia

In this paper, we introduce MedUniSeg, a prompt-driven universal segmentation model designed for 2D and 3D multi-task segmentation across diverse modalities and domains.

Image Segmentation Medical Image Segmentation +2

A Novel Perspective for Multi-modal Multi-label Skin Lesion Classification

no code implementations19 Sep 2024 Yuan Zhang, Yutong Xie, Hu Wang, Jodie C Avery, M Louise Hull, Gustavo Carneiro

The efficacy of deep learning-based Computer-Aided Diagnosis (CAD) methods for skin diseases relies on analyzing multiple data modalities (i. e., clinical+dermoscopic images, and patient metadata) and addressing the challenges of multi-label classification.

Lesion Classification Multi-Label Classification +1

AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis

1 code implementation4 Aug 2024 Townim F. Chowdhury, Vu Minh Hieu Phan, Kewen Liao, Minh-Son To, Yutong Xie, Anton Van Den Hengel, Johan W. Verjans, Zhibin Liao

This paper takes an unconventional approach by re-examining the CBM framework through the lens of its geometrical representation as a simple linear classification system.

Classification Transfer Learning +1

XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training

1 code implementation28 Jul 2024 Biao Wu, Yutong Xie, Zeyu Zhang, Minh Hieu Phan, Qi Chen, Ling Chen, Qi Wu

To this end, this paper proposes a XLIP (Masked modelling for medical Language-Image Pre-training) framework to enhance pathological learning and feature learning via unpaired data.

Contrastive Learning Language Modelling

Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis

1 code implementation27 Jun 2024 Vu Minh Hieu Phan, Yutong Xie, BoWen Zhang, Yuankai Qi, Zhibin Liao, Antonios Perperidis, Son Lam Phung, Johan W. Verjans, Minh-Son To

To address this, we introduce UNet Structured Transformer (UNest), a novel architecture incorporating structural inductive biases for unpaired medical image synthesis.

Anatomy Image Generation

MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

1 code implementation10 Jun 2024 Xingjian Zhang, Yutong Xie, Jin Huang, Jinge Ma, Zhaoying Pan, Qijia Liu, Ziyang Xiong, Tolga Ergen, Dongsub Shim, Honglak Lee, Qiaozhu Mei

Scientific innovation relies on detailed workflows, which include critical steps such as analyzing literature, generating ideas, validating these ideas, interpreting results, and inspiring follow-up research.

Navigate

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

1 code implementation CVPR 2024 Townim Faisal Chowdhury, Kewen Liao, Vu Minh Hieu Phan, Minh-Son To, Yutong Xie, Kevin Hung, David Ross, Anton Van Den Hengel, Johan W. Verjans, Zhibin Liao

Deep Neural Networks (DNNs) are widely used for visual classification tasks, but their complex computation process and black-box nature hinder decision transparency and interpretability.

Decision Making

Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework

2 code implementations CVPR 2024 Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi, Lingqiao Liu, Liyang Liu, BoWen Zhang, Zhibin Liao, Qi Wu, Minh-Son To, Johan W. Verjans

Medical vision language pre-training (VLP) has emerged as a frontier of research, enabling zero-shot pathological recognition by comparing the query image with the textual descriptions for each disease.

Language Modelling Large Language Model

Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning

1 code implementation CVPR 2024 Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Qi Wu, Yong Xia

In this paper, we reconsider versatile self-supervised learning from the perspective of continual learning and propose MedCoSS, a continuous self-supervised learning approach for multi-modal medical data.

Continual Learning Continual Self-Supervised Learning +3

Segment Together: A Versatile Paradigm for Semi-Supervised Medical Image Segmentation

no code implementations20 Nov 2023 Qingjie Zeng, Yutong Xie, Zilin Lu, Mengkang Lu, Yicheng Wu, Yong Xia

Therefore, in this paper, we introduce a \textbf{Ver}satile \textbf{Semi}-supervised framework (VerSemi) to point out a new perspective that integrates various tasks into a unified model with a broad label space, to exploit more unlabeled data for semi-supervised medical image segmentation.

Benchmarking Image Segmentation +3

A Turing Test: Are AI Chatbots Behaviorally Similar to Humans?

no code implementations19 Nov 2023 Qiaozhu Mei, Yutong Xie, Walter Yuan, Matthew O. Jackson

Their behaviors are often distinct from average and modal human behaviors, in which case they tend to behave on the more altruistic and cooperative end of the distribution.

Fairness

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers

3 code implementations11 Oct 2023 Jieneng Chen, Jieru Mei, Xianhang Li, Yongyi Lu, Qihang Yu, Qingyue Wei, Xiangde Luo, Yutong Xie, Ehsan Adeli, Yan Wang, Matthew Lungren, Lei Xing, Le Lu, Alan Yuille, Yuyin Zhou

In this paper, we extend the 2D TransUNet architecture to a 3D network by building upon the state-of-the-art nnU-Net architecture, and fully exploring Transformers' potential in both the encoder and decoder design.

Decoder Image Segmentation +4

Discrepancy Matters: Learning from Inconsistent Decoder Features for Consistent Semi-supervised Medical Image Segmentation

1 code implementation26 Sep 2023 Qingjie Zeng, Yutong Xie, Zilin Lu, Mengkang Lu, Yong Xia

Semi-supervised learning (SSL) has been proven beneficial for mitigating the issue of limited labeled data especially on the task of volumetric medical image segmentation.

Decoder Image Segmentation +3

BHSD: A 3D Multi-Class Brain Hemorrhage Segmentation Dataset

1 code implementation22 Aug 2023 Biao Wu, Yutong Xie, Zeyu Zhang, Jinchao Ge, Kaspar Yaxley, Suzan Bahadir, Qi Wu, Yifan Liu, Minh-Son To

Intracranial hemorrhage (ICH) is a pathological condition characterized by bleeding inside the skull or brain, which can be attributed to various factors.

Image Segmentation Medical Image Segmentation +2

Transformer-based Annotation Bias-aware Medical Image Segmentation

no code implementations2 Jun 2023 Zehui Liao, Yutong Xie, Shishuai Hu, Yong Xia

This paper proposes a Transformer-based Annotation Bias-aware (TAB) medical image segmentation model, which tackles the annotator-related bias via modeling annotator preference and stochastic errors.

Decoder Image Segmentation +3

Attention Mechanisms in Medical Image Segmentation: A Survey

no code implementations29 May 2023 Yutong Xie, Bing Yang, Qingbiao Guan, Jianpeng Zhang, Qi Wu, Yong Xia

This paper systematically reviews the basic principles of attention mechanisms and their applications in medical image segmentation.

Image Segmentation Medical Image Segmentation +4

Act Like a Radiologist: Radiology Report Generation across Anatomical Regions

2 code implementations26 May 2023 Qi Chen, Yutong Xie, Biao Wu, Xiaomin Chen, James Ang, Minh-Son To, Xiaojun Chang, Qi Wu

To address these issues, we propose X-RGen, a radiologist-minded report generation framework across six anatomical regions.

 Ranked #1 on Medical Report Generation on IU X-Ray (using extra training data)

Decoder Medical Report Generation +1

UniSeg: A Prompt-driven Universal Segmentation Model as well as A Strong Representation Learner

1 code implementation7 Apr 2023 Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Yong Xia

Moreover, UniSeg also beats other pre-trained models on two downstream datasets, providing the community with a high-quality pre-trained model for 3D medical image segmentation.

Decoder Image Segmentation +3

A Prompt Log Analysis of Text-to-Image Generation Systems

1 code implementation8 Mar 2023 Yutong Xie, Zhaoying Pan, Jinge Ma, Luo Jie, Qiaozhu Mei

Despite the plenty of efforts to improve the generative models, there is limited work on understanding the information needs of the users of these systems at scale.

Text-to-Image Generation

Diffusion Model for Generative Image Denoising

no code implementations5 Feb 2023 Yutong Xie, Minne Yuan, Bin Dong, Quanzheng Li

In supervised learning for image denoising, usually the paired clean images and noisy images are collected or synthesised to train a denoising model.

Image Denoising

PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training

no code implementations CVPR 2023 Qingjie Zeng, Yutong Xie, Zilin Lu, Yong Xia

In this paper, we propose a novel Pseudo-loss Estimation and Feature Adversarial Training semi-supervised framework, termed as PEFAT, to boost the performance of multi-class and multi-label medical image classification from the point of loss distribution modeling and adversarial training.

Image Classification Semi-supervised Medical Image Classification

Instance-dependent Label Distribution Estimation for Learning with Label Noise

no code implementations16 Dec 2022 Zehui Liao, Shishuai Hu, Yutong Xie, Yong Xia

These methods heavily rely on the existence of anchor points or the quality of pseudo ones, and the global NTM can hardly provide accurate label transition information for each sample, since the label noise in real applications is mostly instance-dependent.

Image Classification

Learning from partially labeled data for multi-organ and tumor segmentation

1 code implementation13 Nov 2022 Yutong Xie, Jianpeng Zhang, Yong Xia, Chunhua Shen

To address this, we propose a Transformer based dynamic on-demand network (TransDoDNet) that learns to segment organs and tumors on multiple partially labeled datasets.

Image Segmentation Medical Image Segmentation +4

ClusTR: Exploring Efficient Self-attention via Clustering for Vision Transformers

no code implementations28 Aug 2022 Yutong Xie, Jianpeng Zhang, Yong Xia, Anton Van Den Hengel, Qi Wu

Besides, we further extend the clustering-guided attention from single-scale to multi-scale, which is conducive to dense prediction tasks.

Clustering Diversity +1

Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation

1 code implementation17 May 2022 Hexin Dong, ZiFan Chen, Mingze Yuan, Yutong Xie, Jie Zhao, Fei Yu, Bin Dong, Li Zhang

Therefore, we propose a method called region-aware metric learning (RAML), which first separates the regions of the images and generates region-aware features for further metric learning.

Anomaly Segmentation Few-Shot Learning +3

Measurement-conditioned Denoising Diffusion Probabilistic Model for Under-sampled Medical Image Reconstruction

1 code implementation5 Mar 2022 Yutong Xie, Quanzheng Li

We propose a novel and unified method, measurement-conditioned denoising diffusion probabilistic model (MC-DDPM), for under-sampled medical image reconstruction based on DDPM.

Denoising MRI Reconstruction

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

1 code implementation25 Feb 2022 Ting Long, Yutong Xie, Xianyu Chen, Weinan Zhang, Qinxiang Cao, Yong Yu

We thoroughly evaluate our proposed MVG approach in the context of algorithm detection, an important and challenging subfield of PLP.

Graph Neural Network

Trained Model in Supervised Deep Learning is a Conditional Risk Minimizer

1 code implementation8 Feb 2022 Yutong Xie, Dufan Wu, Bin Dong, Quanzheng Li

We proved that a trained model in supervised deep learning minimizes the conditional risk for each input (Theorem 2. 1).

Deep Learning Image Super-Resolution

How Much Space Has Been Explored? Measuring the Chemical Space Covered by Databases and Machine-Generated Molecules

no code implementations22 Dec 2021 Yutong Xie, Ziqiao Xu, Jiaqi Ma, Qiaozhu Mei

We further evaluate how well the existing databases and generation models cover the chemical space in terms of #Circles.

Drug Discovery

UniMiSS: Universal Medical Self-Supervised Learning via Breaking Dimensionality Barrier

1 code implementation17 Dec 2021 Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu

In this paper, we advocate bringing a wealth of 2D images like chest X-rays as compensation for the lack of 3D data, aiming to build a universal medical self-supervised representation learning framework, called UniMiSS.

Image Classification Medical Image Analysis +3

Learning from Ambiguous Labels for Lung Nodule Malignancy Prediction

1 code implementation23 Apr 2021 Zehui Liao, Yutong Xie, Shishuai Hu, Yong Xia

According to the consistency and reliability of their annotations, we divide nodules into three sets: a consistent and reliable set (CR-Set), an inconsistent set (IC-Set), and a low reliable set (LR-Set).

counterfactual

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

1 code implementation4 Mar 2021 Yutong Xie, Jianpeng Zhang, Chunhua Shen, Yong Xia

Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.

Image Segmentation Inductive Bias +4

A General Computational Framework to Measure the Expressiveness of Complex Networks using a Tight Upper Bound of Linear Regions

no code implementations1 Jan 2021 Yutong Xie, Gaoxiang Chen, Quanzheng Li

Inspired by the proof of this upper bound and the framework of matrix computation in \citet{hinz2019framework}, we propose a general computational approach to compute a tight upper bound of regions number for theoretically any network structures (e. g. DNN with all kind of skip connections and residual structures).

A General Computational Framework to Measure the Expressiveness of Complex Networks Using a Tighter Upper Bound of Linear Regions

no code implementations8 Dec 2020 Yutong Xie, Gaoxiang Chen, Quanzheng Li

Inspired by the proof of this upper bound and theframework of matrix computation in Hinz & Van de Geer (2019), we propose ageneral computational approach to compute a tight upper bound of regions numberfor theoretically any network structures (e. g. DNN with all kind of skip connec-tions and residual structures).

Inter-slice Context Residual Learning for 3D Medical Image Segmentation

1 code implementation28 Nov 2020 Jianpeng Zhang, Yutong Xie, Yan Wang, Yong Xia

In this paper, we propose the 3D context residual network (ConResNet) for the accurate segmentation of 3D medical images.

Brain Tumor Segmentation Decoder +4

PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image Segmentation

no code implementations25 Nov 2020 Yutong Xie, Jianpeng Zhang, Zehui Liao, Yong Xia, Chunhua Shen

In this paper, we propose a PriorGuided Local (PGL) self-supervised model that learns the region-wise local consistency in the latent feature space.

Image Segmentation Medical Image Segmentation +3

DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

1 code implementation CVPR 2021 Jianpeng Zhang, Yutong Xie, Yong Xia, Chunhua Shen

To address this, we propose a dynamic on-demand network (DoDNet) that learns to segment multiple organs and tumors on partially labeled datasets.

Image Segmentation Medical Image Segmentation +4

Pairwise Relation Learning for Semi-supervised Gland Segmentation

no code implementations6 Aug 2020 Yutong Xie, Jianpeng Zhang, Zhibin Liao, Chunhua Shen, Johan Verjans, Yong Xia

In this paper, we propose the pairwise relation-based semi-supervised (PRS^2) model for gland segmentation on histology images.

Relation Relation Network +1

Viral Pneumonia Screening on Chest X-ray Images Using Confidence-Aware Anomaly Detection

1 code implementation27 Mar 2020 Jianpeng Zhang, Yutong Xie, Guansong Pang, Zhibin Liao, Johan Verjans, Wenxin Li, Zongji Sun, Jian He, Yi Li, Chunhua Shen, Yong Xia

In this paper, we formulate the task of differentiating viral pneumonia from non-viral pneumonia and healthy controls into an one-class classification-based anomaly detection problem, and thus propose the confidence-aware anomaly detection (CAAD) model, which consists of a shared feature extractor, an anomaly detection module, and a confidence prediction module.

Binary Classification Classification +2

A Mutual Bootstrapping Model for Automated Skin Lesion Segmentation and Classification

1 code implementation8 Mar 2019 Yutong Xie, Jianpeng Zhang, Yong Xia, Chunhua Shen

Our results suggest that it is possible to boost the performance of skin lesion segmentation and classification simultaneously via training a unified model to perform both tasks in a mutual bootstrapping way.

Classification General Classification +3

Visual Rhythm Prediction with Feature-Aligning Network

no code implementations29 Jan 2019 Yutong Xie, Haiyang Wang, Yan Hao, Zihao Xu

In this paper, we propose a data-driven visual rhythm prediction method, which overcomes the previous works' deficiency that predictions are made primarily by human-crafted hard rules.

Optical Flow Estimation

A Multi-Level Deep Ensemble Model for Skin Lesion Classification in Dermoscopy Images

no code implementations23 Jul 2018 Yutong Xie, Jianpeng Zhang, Yong Xia

A multi-level deep ensemble (MLDE) model that can be trained in an 'end to end' manner is proposed for skin lesion classification in dermoscopy images.

General Classification Lesion Classification +1

Classification of Medical Images and Illustrations in the Biomedical Literature Using Synergic Deep Learning

no code implementations28 Jun 2017 Jianpeng Zhang, Yong Xia, Qi Wu, Yutong Xie

The Classification of medical images and illustrations in the literature aims to label a medical image according to the modality it was produced or label an illustration according to its production attributes.

General Classification Image Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.