Search Results for author: Miaojing Shi

Found 49 papers, 24 papers with code

Does a Rising Tide Lift All Boats? Bias Mitigation for AI-based CMR Segmentation

1 code implementation21 Mar 2025 Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Miaojing Shi, Andrew P. King

We aim to investigate the impact of common bias mitigation methods to address bias between Black and White subjects in AI-based CMR segmentation models.

All Image Segmentation +1

Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer

1 code implementation20 Dec 2024 Xinyue Chen, Miaojing Shi, Zijian Zhou, Lianghua He, Sophia Tsoka

Furthermore, existing GFSS approaches suffer from a lack of contextual information for novel classes due to their limited samples, we thereby introduce a context consistency learning scheme to transfer the contextual knowledge from base to novel classes.

Classifier calibration Generalized Few-Shot Semantic Segmentation +2

SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation

no code implementations17 Dec 2024 Shuangping Huang, Hao Liang, Qingfeng Wang, Chulong Zhong, Zijian Zhou, Miaojing Shi

First, to avoid the potential conflict between binary and semantic predictions, we introduce a semantic-aware decoder independent of SAM's original decoder, specialized for both semantic segmentation on the prompted object and classification on unprompted objects in images.

Decoder Image Segmentation +3

Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization

no code implementations4 Dec 2024 Maxime Fontana, Michael Spratling, Miaojing Shi

To enhance cross-task coherence, we introduce a trace-back method that improves both cross-task geometric and predictive features.

Multi-Task Learning

Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

1 code implementation18 Nov 2024 Ziyi Zhang, Li Shen, Sen Zhang, Deheng Ye, Yong Luo, Miaojing Shi, Bo Du, DaCheng Tao

Experimental results demonstrate that SDPO consistently outperforms prior methods in reward-based alignment across diverse step configurations, underscoring its robust step generalization capabilities.

Denoising

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

1 code implementation15 Jul 2024 Zijian Zhou, Zheng Zhu, Holger Caesar, Miaojing Shi

In this paper, we focus on the task of open-set relation prediction integrated with a pretrained open-set panoptic segmentation model to achieve true open-set panoptic scene graph generation (OpenPSG).

Graph Generation object-detection +6

Bootstrapping Vision-language Models for Self-supervised Remote Physiological Measurement

no code implementations11 Jul 2024 Zijie Yue, Miaojing Shi, Hanli Wang, Shuai Ding, Qijun Chen, Shanlin Yang

Next, we introduce a frequency-oriented vision-text pair generation method by carefully creating contrastive spatio-temporal maps from positive and negative samples and designing proper text prompts to describe their relative ratios of signal frequencies.

Contrastive Learning Photoplethysmography (PPG) +1

CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery

1 code implementation23 Jun 2024 Oluwatosin Alabi, Ko Ko Zayar Toe, Zijian Zhou, Charlie Budd, Nicholas Raison, Miaojing Shi, Tom Vercauteren

Derived from the existing CholecT50 and Cholec80 datasets, CholecInstanceSeg provides novel annotations for laparoscopic cholecystectomy procedures in patients.

Instance Segmentation Segmentation +1

Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation

no code implementations1 Jun 2024 Xinyue Chen, Miaojing Shi

To alleviate this dependence, few-shot semantic segmentation (FSS) is introduced to leverage the model trained on base classes with sufficient data into the segmentation of novel classes with few data.

Few-Shot Semantic Segmentation Segmentation +1

Enhancing surgical instrument segmentation: integrating vision transformer insights with adapter

1 code implementation International Journal of Computer Assisted Radiology and Surgery 2024 Meng Wei, Miaojing Shi, Tom Vercauteren

Given the scarcity of annotated data in this field, our work aims to develop a model that achieves competitive performance with training on limited datasets, while also enhancing model robustness in various surgical scenarios.

Image Segmentation Medical Image Segmentation +2

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

no code implementations11 Mar 2024 Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy.

Decoder Language Modeling +4

AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image

1 code implementation5 Feb 2024 Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan

Experimental results show that AdaTreeFormer significantly surpasses the state of the art, \eg in the cross domain from the Yosemite to Jiangsu dataset, it achieves a reduction of 15. 9 points in terms of the absolute counting errors and an increase of 10. 8\% in the accuracy of the detected trees' locations.

Domain Adaptation

Multitask Learning in Minimally Invasive Surgical Vision: A Review

no code implementations16 Jan 2024 Oluwatosin Alabi, Tom Vercauteren, Miaojing Shi

Recent advancements in machine learning and computer vision have led to successful applications in analyzing videos obtained from MIS with the promise of alleviating challenges in MIS videos.

Action Understanding

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

2 code implementations CVPR 2024 Zhipeng Du, Miaojing Shi, Jiankang Deng

Previous methods mitigate this issue by exploring image enhancement or object detection techniques with real low-light image datasets.

Domain Adaptation Image Enhancement +3

VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation

3 code implementations27 Nov 2023 Zijian Zhou, Miaojing Shi, Holger Caesar

Leveraging the recent progress in Large Language Models (LLMs), we propose to use language information to assist relation prediction, particularly for rare relations.

Graph Generation Panoptic Scene Graph Generation +2

IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training

1 code implementation11 Oct 2023 Che Liu, Sibo Cheng, Miaojing Shi, Anand Shah, Wenjia Bai, Rossella Arcucci

The framework derives multi-level visual features from the chest X-ray (CXR) images and separately aligns these features with the descriptive and the conclusive text encoded in the hierarchical medical report.

Contrastive Learning Descriptive

An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation

no code implementations25 Aug 2023 Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Keana Aitcheson, Miaojing Shi, Andrew P. King

However, the severity and nature of the bias varies between the models, highlighting the importance of model choice when attempting to train fair AI-based segmentation models for medical imaging tasks.

Image Segmentation Segmentation +1

SegMatch: A semi-supervised learning method for surgical instrument segmentation

no code implementations9 Aug 2023 Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miaojing Shi, Tom Vercauteren

Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions.

Ranked #4 on Medical Image Segmentation on ROBUST-MIS (using extra training data)

Medical Image Segmentation Pseudo Label +2

When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review

1 code implementation25 Jul 2023 Maxime Fontana, Michael Spratling, Miaojing Shi

Second, it presents the different challenges arising from such a multi-objective optimisation scheme.

Benchmarking Multi-Task Learning

TreeFormer: a Semi-Supervised Transformer-based Framework for Tree Counting from a Single High Resolution Image

1 code implementation12 Jul 2023 Hamed Amini Amirkolaee, Miaojing Shi, Mark Mulligan

Automatic tree density estimation and counting using single aerial and satellite images is a challenging task in photogrammetry and remote sensing, yet has an important role in forest management.

Decoder Density Estimation

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

no code implementations30 Jan 2023 Jian Zhu, Hanli Wang, Miaojing Shi

On the other hand, BLIP-2 as an MLLM is employed to process images and texts, and the referring expressions in texts involving specific visual objects are modified with linguistic object labels to serve as comprehensible MLLM inputs.

Language Modeling Language Modelling +2

Domain-General Crowd Counting in Unseen Scenarios

1 code implementation5 Dec 2022 Zhipeng Du, Jiankang Deng, Miaojing Shi

In this paper, we aim to train a model based on a single source domain which can generalize well on any unseen domain.

Crowd Counting Disentanglement +2

Facial Video-based Remote Physiological Measurement via Self-supervised Learning

1 code implementation27 Oct 2022 Zijie Yue, Miaojing Shi, Shuai Ding

Facial video-based remote physiological measurement aims to estimate remote photoplethysmography (rPPG) signals from human face videos and then measure multiple vital signs (e. g. heart rate, respiration frequency) from rPPG signals.

Heart Rate Variability Photoplethysmography (PPG) +1

A systematic study of race and sex bias in CNN-based cardiac MR segmentation

no code implementations4 Sep 2022 Tiarna Lee, Esther Puyol-Anton, Bram Ruijsink, Miaojing Shi, Andrew P. King

We present the first systematic study of the impact of training set imbalance on race and sex bias in CNN-based segmentation.

Management Segmentation

Redesigning Multi-Scale Neural Network for Crowd Counting

1 code implementation4 Aug 2022 Zhipeng Du, Miaojing Shi, Jiankang Deng, Stefanos Zafeiriou

In this work, we redesign the multi-scale neural network by introducing a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.

Crowd Counting

Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction

1 code implementation18 Jul 2022 Zijie Yue, Miaojing Shi

A popular solution is to first increase the frame rate of the video; then perform feature refinement among different frame features; and last increase the spatial resolutions of these features.

Space-time Video Super-resolution Video Super-Resolution

Dam reservoir extraction from remote sensing imagery using tailored metric learning strategies

1 code implementation12 Jul 2022 Arnout van Soesbergen, Zedong Chu, Miaojing Shi, Mark Mulligan

Extensive experiments were conducted on this benchmark in the water body segmentation task, dam reservoir recognition task, and the joint dam reservoir extraction task.

Metric Learning Segmentation

Boosting Zero-shot Learning via Contrastive Optimization of Attribute Representations

1 code implementation8 Jul 2022 Yu Du, Miaojing Shi, Fangyun Wei, Guoqi Li

In this paper, we propose a new framework to boost ZSL by explicitly learning attribute prototypes beyond images and contrastively optimizing them with attribute-level features within images.

Attribute Zero-Shot Learning

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

1 code implementation CVPR 2022 Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li

In this paper, we introduce a novel method, detection prompt (DetPro), to learn continuous prompt representations for open-vocabulary object detection based on the pre-trained vision-language model.

Image Classification Language Modeling +7

MFNet: Multi-class Few-shot Segmentation Network with Pixel-wise Metric Learning

no code implementations30 Oct 2021 Miao Zhang, Miaojing Shi, Li Li

Last, to enhance the embedding space learning, an additional pixel-wise metric learning module is introduced with triplet loss formulated on the pixel-level embedding of the input image.

Few-Shot Semantic Segmentation Image Classification +4

Learning to Recommend Items to Wikidata Editors

no code implementations13 Jul 2021 Kholoud Alghamdi, Miaojing Shi, Elena Simperl

The system uses a hybrid of content-based and collaborative filtering techniques to rank items for editors relying on both item features and item-editor previous interaction.

Collaborative Filtering Recommendation Systems

Fast Fourier Intrinsic Network

no code implementations9 Nov 2020 Yanlin Qian, Miaojing Shi, Joni-Kristian Kämäräinen, Jiri Matas

We address the problem of decomposing an image into albedo and shading.

Restoring Negative Information in Few-Shot Object Detection

1 code implementation NeurIPS 2020 Yukuan Yang, Fangyun Wei, Miaojing Shi, Guoqi Li

In this paper, we restore the negative information in few-shot object detection by introducing a new negative- and positive-representative based metric learning framework and a new inference scheme with negative and positive representatives.

Few-Shot Learning Few-Shot Object Detection +4

Defending Adversarial Examples via DNN Bottleneck Reinforcement

no code implementations12 Aug 2020 Wenqing Liu, Miaojing Shi, Teddy Furon, Li Li

This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks.

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

no code implementations12 Aug 2020 Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang

We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.

Crowd Counting Knowledge Distillation +3

Training Object Detectors from Few Weakly-Labeled and Many Unlabeled Images

no code implementations arXiv 2019 Zhaohui Yang, Miaojing Shi, Chao Xu, Vittorio Ferrari, Yannis Avrithis

Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set.

Ranked #29 on Weakly Supervised Object Detection on PASCAL VOC 2012 test (using extra training data)

object-detection Weakly Supervised Object Detection

Point in, Box out: Beyond Counting Persons in Crowds

no code implementations CVPR 2019 Yuting Liu, Miaojing Shi, Qijun Zhao, Xiaofang Wang

In the end, we propose a curriculum learning strategy to train the network from images of relatively accurate and easy pseudo ground truth first.

Crowd Counting regression

Revisiting Perspective Information for Efficient Crowd Counting

no code implementations CVPR 2019 Miaojing Shi, Zhaohui Yang, Chao Xu, Qijun Chen

Modern crowd counting methods employ deep neural networks to estimate crowd counts via crowd density regressions.

Crowd Counting regression

Crowd counting via scale-adaptive convolutional neural network

1 code implementation13 Nov 2017 Lu Zhang, Miaojing Shi, Qiaobo Chen

The task of crowd counting is to automatically estimate the pedestrian number in crowd images.

Crowd Counting

Weakly Supervised Object Localization Using Things and Stuff Transfer

no code implementations ICCV 2017 Miaojing Shi, Holger Caesar, Vittorio Ferrari

We propose to help weakly supervised object localization for classes where location annotations are not available, by transferring things and stuff knowledge from a source set with available annotations.

Multiple Instance Learning Object +2

Weakly Supervised Object Localization Using Size Estimates

no code implementations15 Aug 2016 Miaojing Shi, Vittorio Ferrari

We present a technique for weakly supervised object localization (WSOL), building on the observation that WSOL algorithms usually work better on images with bigger objects.

Object Weakly-Supervised Object Localization

Early Burst Detection for Memory-Efficient Image Retrieval

no code implementations CVPR 2015 Miaojing Shi, Yannis Avrithis, Herve Jegou

Then, we show the interest of using this strategy in an asymmetrical manner, with only the database features being aggregated but not those of the query.

Image Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.