Search Results for author: ZongYuan Ge

Found 67 papers, 29 papers with code

Simple Online and Realtime Tracking

55 code implementations • 2 Feb 2016 • Alex Bewley, ZongYuan Ge, Lionel Ott, Fabio Ramos, Ben Upcroft

This paper explores a pragmatic approach to multiple object tracking where the main focus is to associate objects efficiently for online and realtime applications.

Ranked #2 on Multi-Object Tracking on MOT15

Multi-Object Tracking Multiple Object Tracking

3,712

Paper
Code

Mutual Consistency Learning for Semi-supervised Medical Image Segmentation

2 code implementations • 21 Sep 2021 • Yicheng Wu, ZongYuan Ge, Donghao Zhang, Minfeng Xu, Lei Zhang, Yong Xia, Jianfei Cai

In this paper, we propose a novel mutual consistency network (MC-Net+) to effectively exploit the unlabeled data for semi-supervised medical image segmentation.

Image Segmentation Segmentation +2

1,988

Paper
Code

Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation

2 code implementations • 2 Mar 2022 • Yicheng Wu, Zhonghua Wu, Qianyi Wu, ZongYuan Ge, Jianfei Cai

The pixel-level smoothness forces the model to generate invariant results under adversarial perturbations.

Image Segmentation Semantic Segmentation +1

1,988

Paper
Code

Semi-supervised Left Atrium Segmentation with Mutual Consistency Training

3 code implementations • 4 Mar 2021 • Yicheng Wu, Minfeng Xu, ZongYuan Ge, Jianfei Cai, Lei Zhang

Such mutual consistency encourages the two decoders to have consistent and low-entropy predictions and enables the model to gradually capture generalized features from these unlabeled challenging regions.

Image Segmentation Left Atrium Segmentation +4

272

Paper
Code

Hierarchical Neural Architecture Search for Deep Stereo Matching

1 code implementation • NeurIPS 2020 • Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Yuchao Dai, Xiaojun Chang, Tom Drummond, Hongdong Li, ZongYuan Ge

To reduce the human efforts in neural network design, Neural Architecture Search (NAS) has been applied with remarkable success to various high-level vision tasks such as classification and semantic segmentation.

Ranked #2 on Stereo Disparity Estimation on Scene Flow

Neural Architecture Search Semantic Segmentation +3

252

Paper
Code

Implicit Motion Handling for Video Camouflaged Object Detection

1 code implementation • CVPR 2022 • Xuelian Cheng, Huan Xiong, Deng-Ping Fan, Yiran Zhong, Mehrtash Harandi, Tom Drummond, ZongYuan Ge

We propose a new video camouflaged object detection (VCOD) framework that can exploit both short-term dynamics and long-term temporal consistency to detect camouflaged objects from video frames.

Ranked #2 on Camouflaged Object Segmentation on Camouflaged Animal Dataset (using extra training data)

Camouflaged Object Segmentation Motion Estimation +4

Paper
Code

LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition

1 code implementation • 8 May 2023 • Peng Xia, Di Xu, Lie Ju, Ming Hu, Jun Chen, ZongYuan Ge

Long-tailed multi-label visual recognition (LTML) task is a highly challenging task due to the label co-occurrence and imbalanced data distribution.

Ranked #1 on Long-tail Learning on COCO-MLT (using extra training data)

Long-tail Learning

Paper
Code

SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation

2 code implementations • 22 Dec 2023 • Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, ZongYuan Ge, Yong Xia, Jiebo Luo, Zhiyong Wang

Specifically, we achieve this by proposing (1) Collaborative Prompts that describe instrument structures via collaborating category-level and part-level texts; (2) Cross-Modal Prompt Encoder that encodes text prompts jointly with visual embeddings into discriminative part-level representations; and (3) Part-to-Whole Adaptive Fusion and Hierarchical Decoding that adaptively fuse the part-level representations into a whole for accurate instrument segmentation in surgical scenarios.

Segmentation Semantic Segmentation

Paper
Code

EndoSurf: Neural Surface Reconstruction of Deformable Tissues with Stereo Endoscope Videos

1 code implementation • 21 Jul 2023 • Ruyi Zha, Xuelian Cheng, Hongdong Li, Mehrtash Harandi, ZongYuan Ge

We constrain the learned shape by tailoring multiple regularization strategies and disentangling geometry and appearance.

Surface Reconstruction

Paper
Code

HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding

1 code implementation • 23 Nov 2023 • Peng Xia, Xingtong Yu, Ming Hu, Lie Ju, Zhiyong Wang, Peibo Duan, ZongYuan Ge

We explore constructing the class hierarchy into a graph, with its nodes representing the textual or image features of each category.

Fine-Grained Visual Recognition Graph Representation Learning

Paper
Code

Diversified and Personalized Multi-rater Medical Image Segmentation

1 code implementation • 20 Mar 2024 • Yicheng Wu, Xiangde Luo, Zhe Xu, Xiaoqing Guo, Lie Ju, ZongYuan Ge, Wenjun Liao, Jianfei Cai

To address it, the common practice is to gather multiple annotations from different experts, leading to the setting of multi-rater medical image segmentation.

Image Segmentation Medical Image Segmentation +2

Paper
Code

Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization

1 code implementation • CVPR 2022 • Wei Dong, Junsheng Wu, Yi Luo, ZongYuan Ge, Peng Wang

In this work, we present a simple-yet-effective self-supervised node representation learning strategy via directly maximizing the mutual information between the hidden representations of nodes and their neighbourhood, which can be theoretically justified by its link to graph smoothing.

Node Classification Representation Learning

Paper
Code

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation

1 code implementation • 12 Mar 2024 • Feilong Tang, Zhongxing Xu, Zhaojun Qu, Wei Feng, Xingjian Jiang, ZongYuan Ge

Inspired by prototype learning theory, we propose leveraging prototype awareness to capture diverse and fine-grained feature attributes of instances.

Learning Theory Weakly supervised Semantic Segmentation +1

Paper
Code

Medical Matting: A New Perspective on Medical Segmentation with Uncertainty

1 code implementation • 18 Jun 2021 • Lin Wang, Lie Ju, Xin Wang, Wanji He, Donghao Zhang, Yelin Huang, Zhiwen Yang, Xuan Yao, Xin Zhao, Xiufen Ye, ZongYuan Ge

None of them investigate the influence of the ambiguous nature of the lesion itself. Inspired by image matting, this paper introduces alpha matte as a soft mask to represent uncertain areas in medical scenes and accordingly puts forward a new uncertainty quantification method to fill the gap of uncertainty research for lesion structure.

Image Matting Image Segmentation +3

Paper
Code

Deep Laparoscopic Stereo Matching with Transformers

1 code implementation • 25 Jul 2022 • Xuelian Cheng, Yiran Zhong, Mehrtash Harandi, Tom Drummond, Zhiyong Wang, ZongYuan Ge

The self-attention mechanism, successfully employed with the transformer structure is shown promise in many computer vision tasks including image recognition, and object detection.

object-detection Object Detection +2

Paper
Code

EPVT: Environment-aware Prompt Vision Transformer for Domain Generalization in Skin Lesion Recognition

1 code implementation • 4 Apr 2023 • Siyuan Yan, Chi Liu, Zhen Yu, Lie Ju, Dwarikanath Mahapatrainst, Victoria Mar, Monika Janda, Peter Soyer, ZongYuan Ge

Concretely, EPVT leverages a set of domain prompts, each of which plays as a domain expert, to capture domain-specific knowledge; and a shared prompt for general knowledge over the entire dataset.

Domain Generalization General Knowledge

Paper
Code

Prompt-driven Latent Domain Generalization for Medical Image Classification

2 code implementations • 5 Jan 2024 • Siyuan Yan, Chi Liu, Zhen Yu, Lie Ju, Dwarikanath Mahapatra, Brigid Betz-Stablein, Victoria Mar, Monika Janda, Peter Soyer, ZongYuan Ge

To address these challenges, we propose a novel DG framework for medical image classification without relying on domain labels, called Prompt-driven Latent Domain Generalization (PLDG).

Domain Generalization Image Classification +1

Paper
Code

Vision-Language Navigation with Random Environmental Mixup

1 code implementation • ICCV 2021 • Chong Liu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, ZongYuan Ge, Yi-Dong Shen

Then, we cross-connect the key views of different scenes to construct augmented scenes.

Ranked #38 on Vision and Language Navigation on VLN Challenge

Data Augmentation Navigate +1

Paper
Code

Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

1 code implementation • 30 Jun 2022 • Deval Mehta, Yaniv Gal, Adrian Bowling, Paul Bonnington, ZongYuan Ge

Through this approach, 1) First, we target the mixup amongst middle and tail classes to address the long-tail problem.

Lesion Classification Multi-class Classification +3

Paper
Code

Adversarial Discriminative Sim-to-real Transfer of Visuo-motor Policies

1 code implementation • 18 Sep 2017 • Fangyi Zhang, Jürgen Leitner, ZongYuan Ge, Michael Milford, Peter Corke

Policies can be transferred to real environments with only 93 labelled and 186 unlabelled real images.

Paper
Code

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

1 code implementation • 14 Mar 2024 • Mingyuan Sun, Donghao Zhang, ZongYuan Ge, Jiaxu Wang, Jia Li, Zheng Fang, Renjing Xu

Based on this, we propose EventRPG, which leverages relevance propagation on the spiking neural network for more efficient augmentation.

Action Recognition Data Augmentation +1

Paper
Code

TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification

1 code implementation • 1 May 2023 • Litao Yang, Deval Mehta, Sidong Liu, Dwarikanath Mahapatra, Antonio Di Ieva, ZongYuan Ge

Due to the high resolution of the WSI and the unavailability of patch-level annotations, WSI classification is usually formulated as a weakly supervised problem, which relies on multiple instance learning (MIL) based on patches of a WSI.

Image Classification Multiple Instance Learning +1

Paper
Code

Adversarial Pulmonary Pathology Translation for Pairwise Chest X-ray Data Augmentation

1 code implementation • 11 Oct 2019 • Yunyan Xing, ZongYuan Ge, Rui Zeng, Dwarikanath Mahapatra, Jarrel Seah, Meng Law, Tom Drummond

We demonstrate the effectiveness of our model on two tasks: (i) we invite certified radiologists to assess the quality of the generated synthetic images against real and other state-of-the-art generative models, and (ii) data augmentation to improve the performance of disease localisation.

Data Augmentation Image-to-Image Translation +1

Paper
Code

Unsupervised Domain Adaptive Fundus Image Segmentation with Category-level Regularization

1 code implementation • 8 Jul 2022 • Wei Feng, Lin Wang, Lie Ju, Xin Zhao, Xin Wang, Xiaoyu Shi, ZongYuan Ge

Existing unsupervised domain adaptation methods based on adversarial learning have achieved good performance in several medical imaging tasks.

Image Segmentation Semantic Segmentation +1

Paper
Code

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up

1 code implementation • 18 Aug 2022 • Deyin Liu, Lin Yuanbo Wu, Bo Li, ZongYuan Ge

Our architecture is orthogonal to StackGAN++ , and focuses on person image generation, with all of them together to enrich the spectrum of GANs for the image generation task.

Text-to-Image Generation

Paper
Code

Improving Deep Lesion Detection Using 3D Contextual and Spatial Attention

1 code implementation • 9 Jul 2019 • Qingyi Tao, ZongYuan Ge, Jianfei Cai, Jianxiong Yin, Simon See

Secondly, in CT scans, the lesions are often indistinguishable from the background since the lesion and non-lesion areas may have very similar appearances.

Computed Tomography (CT) Lesion Detection +2

Paper
Code

Differentiable Neural Architecture Search in Equivalent Space with Exploration Enhancement

1 code implementation • NeurIPS 2020 • Miao Zhang, Huiqi Li, Shirui Pan, Xiaojun Chang, ZongYuan Ge, Steven Su

A probabilistic exploration enhancement method is accordingly devised to encourage intelligent exploration during the architecture search in the latent space, to avoid local optimal in architecture search.

Bilevel Optimization Neural Architecture Search

Paper
Code

Generative OpenMax for Multi-Class Open Set Classification

no code implementations • 24 Jul 2017 • ZongYuan Ge, Sergey Demyanov, Zetao Chen, Rahil Garnavi

We present a conceptually new and flexible method for multi-class open set classification.

Classification General Classification +2

Paper
Add Code

Exploiting Temporal Information for DCNN-based Fine-Grained Object Classification

no code implementations • 1 Aug 2016 • ZongYuan Ge, Chris McCool, Conrad Sanderson, Peng Wang, Lingqiao Liu, Ian Reid, Peter Corke

Fine-grained classification is a relatively new field that has concentrated on using information from a single image, while ignoring the enormous potential of using video data to improve classification.

Classification General Classification

Paper
Add Code

Fine-Grained Classification via Mixture of Deep Convolutional Neural Networks

no code implementations • 30 Nov 2015 • ZongYuan Ge, Alex Bewley, Christopher Mccool, Ben Upcroft, Peter Corke, Conrad Sanderson

We present a novel deep convolutional neural network (DCNN) system for fine-grained image classification, called a mixture of DCNNs (MixDCNN).

Classification Fine-Grained Image Classification +1

Paper
Add Code

Subset Feature Learning for Fine-Grained Category Classification

no code implementations • 9 May 2015 • Zongyuan Ge, Christopher Mccool, Conrad Sanderson, Peter Corke

Fine-grained categorisation has been a challenging problem due to small inter-class variation, large intra-class variation and low number of training images.

Classification General Classification +1

Paper
Add Code

Modelling Local Deep Convolutional Neural Network Features to Improve Fine-Grained Image Classification

no code implementations • 27 Feb 2015 • ZongYuan Ge, Chris McCool, Conrad Sanderson, Peter Corke

We propose a local modelling approach using deep convolutional neural networks (CNNs) for fine-grained image classification.

Fine-Grained Image Classification General Classification +1

Paper
Add Code

Chest X-rays Classification: A Multi-Label and Fine-Grained Problem

no code implementations • 19 Jul 2018 • Zongyuan Ge, Dwarikanath Mahapatra, Suman Sedai, Rahil Garnavi, Rajib Chakravorty

In this work we have proposed a novel error function, Multi-label Softmax Loss (MSML), to specifically address the properties of multiple labels and imbalanced data.

General Classification Image Classification +1

Paper
Add Code

Deep multiscale convolutional feature learning for weakly supervised localization of chest pathologies in X-ray images

no code implementations • 22 Aug 2018 • Suman Sedai, Dwarikanath Mahapatra, ZongYuan Ge, Rajib Chakravorty, Rahil Garnavi

We propose a novel weakly supervised method to localize chest pathologies using class aware deep multiscale feature learning.

General Classification

Paper
Add Code

Geometry-constrained Car Recognition Using a 3D Perspective Network

no code implementations • 19 Mar 2019 • Rui Zeng, ZongYuan Ge, Simon Denman, Sridha Sridharan, Clinton Fookes

Unlike existing methods which only use attention mechanisms to locate 2D discriminative information, our work learns a novel 3D perspective feature representation of a vehicle, which is then fused with 2D appearance feature to predict the category.

Paper
Add Code

Training Data Independent Image Registration With GANs Using Transfer Learning And Segmentation Information

no code implementations • 25 Mar 2019 • Dwarikanath Mahapatra, ZongYuan Ge

Registration is an important task in automated medical image analysis.

Image Registration Transfer Learning

Paper
Add Code

ZSTAD: Zero-Shot Temporal Activity Detection

no code implementations • CVPR 2020 • Lingling Zhang, Xiaojun Chang, Jun Liu, Minnan Luo, Sen Wang, ZongYuan Ge, Alexander Hauptmann

An integral part of video analysis and surveillance is temporal activity detection, which means to simultaneously recognize and localize activities in long untrimmed videos.

Action Detection Activity Detection

Paper
Add Code

Bridge the Domain Gap Between Ultra-wide-field and Traditional Fundus Images via Adversarial Domain Adaptation

no code implementations • 23 Mar 2020 • Lie Ju, Xin Wang, Quan Zhou, Hu Zhu, Mehrtash Harandi, Paul Bonnington, Tom Drummond, ZongYuan Ge

We design a regularisation technique to regulate the domain adaptation.

Domain Adaptation Management

Paper
Add Code

Synergic Adversarial Label Learning for Grading Retinal Diseases via Knowledge Distillation and Multi-task Learning

no code implementations • 24 Mar 2020 • Lie Ju, Xin Wang, Xin Zhao, Huimin Lu, Dwarikanath Mahapatra, Paul Bonnington, ZongYuan Ge

In addition, we conduct additional experiments to show the effectiveness of SALL from the aspects of reliability and interpretability in the context of medical imaging application.

Classification General Classification +3

Paper
Add Code

Melanoma Diagnosis with Spatio-Temporal Feature Learning on Sequential Dermoscopic Images

no code implementations • 19 Jun 2020 • Zhen Yu, Jennifer Nguyen, Xiaojun Chang, John Kelly, Catriona Mclean, Lei Zhang, Victoria Mar, ZongYuan Ge

Existing studies for automated melanoma diagnosis are based on single-time point images of lesions.

Melanoma Diagnosis

Paper
Add Code

Robust early-learning: Hindering the memorization of noisy labels

no code implementations • ICLR 2021 • Xiaobo Xia, Tongliang Liu, Bo Han, Chen Gong, Nannan Wang, ZongYuan Ge, Yi Chang

The \textit{early stopping} method therefore can be exploited for learning with noisy labels.

Ranked #32 on Image Classification on mini WebVision 1.0 (ImageNet Top-1 Accuracy metric)

Learning with noisy labels Memorization

Paper
Add Code

Leveraging Regular Fundus Images for Training UWF Fundus Diagnosis Models via Adversarial Learning and Pseudo-Labeling

no code implementations • 27 Nov 2020 • Lie Ju, Xin Wang, Xin Zhao, Paul Bonnington, Tom Drummond, ZongYuan Ge

We propose the use of a modified cycle generative adversarial network (CycleGAN) model to bridge the gap between regular and UWF fundus and generate additional UWF fundus images for training.

Generative Adversarial Network Lesion Detection

Paper
Add Code

Improving Medical Image Classification with Label Noise Using Dual-uncertainty Estimation

no code implementations • 28 Feb 2021 • Lie Ju, Xin Wang, Lin Wang, Dwarikanath Mahapatra, Xin Zhao, Mehrtash Harandi, Tom Drummond, Tongliang Liu, ZongYuan Ge

In this paper, we systematically discuss and define the two common types of label noise in medical images - disagreement label noise from inconsistency expert opinions and single-target label noise from wrong diagnosis record.

Benchmarking General Classification +3

Paper
Add Code

Relational Subsets Knowledge Distillation for Long-tailed Retinal Diseases Recognition

no code implementations • 22 Apr 2021 • Lie Ju, Xin Wang, Lin Wang, Tongliang Liu, Xin Zhao, Tom Drummond, Dwarikanath Mahapatra, ZongYuan Ge

For example, there are estimated more than 40 different kinds of retinal diseases with variable morbidity, however with more than 30+ conditions are very rare from the global patient cohorts, which results in a typical long-tailed learning problem for deep learning-based screening models.

Knowledge Distillation

Paper
Add Code

Unsupervised Domain Adaptation for Retinal Vessel Segmentation with Adversarial Learning and Transfer Normalization

no code implementations • 4 Aug 2021 • Wei Feng, Lie Ju, Lin Wang, Kaimin Song, Xin Wang, Xin Zhao, Qingyi Tao, ZongYuan Ge

In this work, we explore unsupervised domain adaptation in retinal vessel segmentation by using entropy-based adversarial learning and transfer normalization layer to train a segmentation network, which generalizes well across domains and requires no annotation of the target domain.

Retinal Vessel Segmentation Segmentation +1

Paper
Add Code

IDENTIFYING CONCEALED OBJECTS FROM VIDEOS

no code implementations • 29 Sep 2021 • Xuelian Cheng, Huan Xiong, Deng-Ping Fan, Yiran Zhong, Mehrtash Harandi, Tom Drummond, ZongYuan Ge

The proposed SLT-Net leverages on both short-term dynamics and long-term temporal consistency to detect concealed objects in continuous video frames.

object-detection Object Detection

Paper
Add Code

Early Melanoma Diagnosis with Sequential Dermoscopic Images

no code implementations • 12 Oct 2021 • Zhen Yu, Jennifer Nguyen, Toan D Nguyen, John Kelly, Catriona Mclean, Paul Bonnington, Lei Zhang, Victoria Mar, ZongYuan Ge

In this study, we propose a framework for automated early melanoma diagnosis using sequential dermoscopic images.

Melanoma Diagnosis

Paper
Add Code

Evaluation of Various Open-Set Medical Imaging Tasks with Deep Neural Networks

no code implementations • 21 Oct 2021 • ZongYuan Ge, Xin Wang

The current generation of deep neural networks has achieved close-to-human results on "closed-set" image recognition; that is, the classes being evaluated overlap with the training classes.

Decision Making Open Set Learning

Paper
Add Code

Hierarchical Knowledge Guided Learning for Real-world Retinal Diseases Recognition

no code implementations • 17 Nov 2021 • Lie Ju, Zhen Yu, Lin Wang, Xin Zhao, Xin Wang, Paul Bonnington, ZongYuan Ge

From a modeling perspective, most deep learning models trained on these datasets may lack the ability to generalize to rare diseases where only a few available samples are presented for training.

Knowledge Distillation

Paper
Add Code

Medical Visual Question Answering: A Survey

no code implementations • 19 Nov 2021 • Zhihong Lin, Donghao Zhang, Qingyi Tao, Danli Shi, Gholamreza Haffari, Qi Wu, Mingguang He, ZongYuan Ge

Medical Visual Question Answering~(VQA) is a combination of medical artificial intelligence and popular VQA challenges.

Medical Visual Question Answering Question Answering +1

Paper
Add Code

Anomaly Detection in Retinal Images using Multi-Scale Deep Feature Sparse Coding

no code implementations • 27 Jan 2022 • Sourya Dipta Das, Saikat Dutta, Nisarg A. Shah, Dwarikanath Mahapatra, ZongYuan Ge

Convolutional Neural Network models have successfully detected retinal illness from optical coherence tomography (OCT) and fundus images.

Anomaly Detection

Paper
Add Code

Label uncertainty-guided multi-stream model for disease screening

no code implementations • 28 Jan 2022 • Chi Liu, ZongYuan Ge, Mingguang He, Xiaotong Han

The main idea is dividing the images into simple and hard cases by uncertainty information, and then developing a multi-stream network to deal with different cases separately.

Paper
Add Code

Flexible Sampling for Long-tailed Skin Lesion Classification

no code implementations • 7 Apr 2022 • Lie Ju, Yicheng Wu, Lin Wang, Zhen Yu, Xin Zhao, Xin Wang, Paul Bonnington, ZongYuan Ge

To address this, in this paper, we propose a curriculum learning-based framework called Flexible Sampling for the long-tailed skin lesion classification task.

Classification Lesion Classification +1

Paper
Add Code

Camera Adaptation for Fundus-Image-Based CVD Risk Estimation

1 code implementation • 18 Jun 2022 • Zhihong Lin, Danli Shi, Donghao Zhang, Xianwen Shang, Mingguang He, ZongYuan Ge

Most high-quality retinography databases ready for research are collected from high-end fundus cameras, and there is a significant domain discrepancy between different cameras.

Paper
Code

Pseudo-Pair based Self-Similarity Learning for Unsupervised Person Re-identification

no code implementations • 9 Jul 2022 • Lin Wu, Deyin Liu, Wenying Zhang, Dapeng Chen, ZongYuan Ge, Farid Boussaid, Mohammed Bennamoun, Jialie Shen

In this paper, we present a pseudo-pair based self-similarity learning approach for unsupervised person re-ID without human annotations.

Unsupervised Person Re-Identification

Paper
Add Code

Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation

no code implementations • 17 Aug 2022 • Litao Yang, Deval Mehta, Dwarikanath Mahapatra, ZongYuan Ge

Our unique contribution is two-fold - 1) We present a first of its kind multimodal WBC dataset for WBC classification; 2) We develop a high performing multimodal architecture which is also efficient and low in complexity at the same time.

Classification Knowledge Distillation

Paper
Add Code

Skin Lesion Recognition with Class-Hierarchy Regularized Hyperbolic Embeddings

no code implementations • 13 Sep 2022 • Zhen Yu, Toan Nguyen, Yaniv Gal, Lie Ju, Shekhar S. Chandra, Lei Zhang, Paul Bonnington, Victoria Mar, Zhiyong Wang, ZongYuan Ge

Accordingly, the learned prototypes preserve the semantic class relations in the embedding space and we can predict the label of an image by assigning its feature to the nearest hyperbolic class prototype.

Paper
Add Code

3D Matting: A Soft Segmentation Method Applied in Computed Tomography

no code implementations • 16 Sep 2022 • Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

It can be caused by many factors, such as the imaging properties, pathological anatomy, and the weak representation of the binary masks, which brings challenges to accurate 3D segmentation.

Anatomy Image Matting

Paper
Add Code

3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

no code implementations • 11 Oct 2022 • Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, ZongYuan Ge

In this work, we introduce the image matting into the 3D scenes and use the alpha matte, i. e., a soft mask, to describe lesions in a 3D medical image.

Binarization Image Matting

Paper
Add Code

Multimorbidity Content-Based Medical Image Retrieval Using Proxies

no code implementations • 22 Nov 2022 • Yunyan Xing, Benjamin J. Meyer, Mehrtash Harandi, Tom Drummond, ZongYuan Ge

Medical imaging data, such as radiology images, are often multimorbidity; a single sample may have more than one pathology present.

Content-Based Image Retrieval Decision Making +3

Paper
Add Code

Towards Trustable Skin Cancer Diagnosis via Rewriting Model's Decision

no code implementations • CVPR 2023 • Siyuan Yan, Zhen Yu, Xuelin Zhang, Dwarikanath Mahapatra, Shekhar S. Chandra, Monika Janda, Peter Soyer, ZongYuan Ge

We introduce a human-in-the-loop framework in the model training process such that users can observe and correct the model's decision logic when confounding behaviors happen.

Decision Making

Paper
Add Code

Towards Open-Scenario Semi-supervised Medical Image Classification

no code implementations • 8 Apr 2023 • Lie Ju, Yicheng Wu, Wei Feng, Zhen Yu, Lin Wang, Zhuoting Zhu, ZongYuan Ge

Therefore, in this paper, we proposed a unified framework to leverage these unseen unlabeled data for open-scenario semi-supervised medical image classification.

Domain Adaptation Image Classification +1

Paper
Add Code

Ugly Ducklings or Swans: A Tiered Quadruplet Network with Patient-Specific Mining for Improved Skin Lesion Classification

no code implementations • 18 Sep 2023 • Nathasha Naranpanawa, H. Peter Soyer, Adam Mothershaw, Gayan K. Kulatilleke, ZongYuan Ge, Brigid Betz-Stablein, Shekhar S. Chandra

An ugly duckling is an obviously different skin lesion from surrounding lesions of an individual, and the ugly duckling sign is a criterion used to aid in the diagnosis of cutaneous melanoma by differentiating between highly suspicious and benign lesions.

Lesion Classification Metric Learning +1

Paper
Add Code

Privacy-preserving Early Detection of Epileptic Seizures in Videos

no code implementations • 15 Sep 2023 • Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, ZongYuan Ge

In this work, we contribute towards the development of video-based epileptic seizure classification by introducing a novel framework (SETR-PKD), which could achieve privacy-preserved early detection of seizures in videos.

Knowledge Distillation Optical Flow Estimation +2

Paper
Add Code

Towards Novel Class Discovery: A Study in Novel Skin Lesions Clustering

no code implementations • 28 Sep 2023 • Wei Feng, Lie Ju, Lin Wang, Kaimin Song, ZongYuan Ge

We conducted extensive experiments on the dermatology dataset ISIC 2019, and the experimental results show that our approach can effectively leverage knowledge from known categories to discover new semantic categories.

Clustering Contrastive Learning +2

Paper
Add Code

Revamping AI Models in Dermatology: Overcoming Critical Challenges for Enhanced Skin Lesion Diagnosis

no code implementations • 2 Nov 2023 • Deval Mehta, Brigid Betz-Stablein, Toan D Nguyen, Yaniv Gal, Adrian Bowling, Martin Haskett, Maithili Sashindranath, Paul Bonnington, Victoria Mar, H Peter Soyer, ZongYuan Ge

For a clinical image, our model generates three outputs: a hierarchical prediction, an alert for out-of-distribution images, and a recommendation for dermoscopy if clinical image alone is insufficient for diagnosis.

Paper
Add Code

A Multimodal Feature Distillation with CNN-Transformer Network for Brain Tumor Segmentation with Incomplete Modalities

1 code implementation • 22 Apr 2024 • Ming Kang, Fung Fung Ting, Raphaël C. -W. Phan, ZongYuan Ge, Chee-Ming Ting

Our ablation study demonstrates the importance of the proposed modules with CNN-Transformer networks and the convolutional blocks in Transformer for improving the performance of brain tumor segmentation with missing modalities.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.