Search Results for author: Miaojing Shi

Found 38 papers, 14 papers with code

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

no code implementations11 Mar 2024 Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy.

Language Modelling Large Language Model +1

AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image

no code implementations5 Feb 2024 Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan

For the latter, an attention-to-adapt mechanism is introduced to distill relevant information from different domains while generating tree density maps; a hierarchical cross-domain feature alignment scheme is proposed that progressively aligns the features from the source and target domains.

Domain Adaptation

Multitask Learning in Minimally Invasive Surgical Vision: A Review

no code implementations16 Jan 2024 Oluwatosin Alabi, Tom Vercauteren, Miaojing Shi

Recent advancements in machine learning and computer vision have led to successful applications in analyzing videos obtained from MIS with the promise of alleviating challenges in MIS videos.

Action Understanding

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

2 code implementations2 Dec 2023 Zhipeng Du, Miaojing Shi, Jiankang Deng

Previous methods mitigate this issue by exploring image enhancement or object detection techniques with real low-light image datasets.

Domain Adaptation Image Enhancement +3

VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation

1 code implementation27 Nov 2023 Zijian Zhou, Miaojing Shi, Holger Caesar

Panoptic Scene Graph Generation (PSG) aims at achieving a comprehensive image understanding by simultaneously segmenting objects and predicting relations among objects.

Graph Generation Panoptic Scene Graph Generation +1

IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training

no code implementations11 Oct 2023 Che Liu, Sibo Cheng, Miaojing Shi, Anand Shah, Wenjia Bai, Rossella Arcucci

The framework derives multi-level visual features from the chest X-ray (CXR) images and separately aligns these features with the descriptive and the conclusive text encoded in the hierarchical medical report.

Contrastive Learning Descriptive

An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation

no code implementations25 Aug 2023 Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Keana Aitcheson, Miaojing Shi, Andrew P. King

However, the severity and nature of the bias varies between the models, highlighting the importance of model choice when attempting to train fair AI-based segmentation models for medical imaging tasks.

Image Segmentation Segmentation +1

SegMatch: A semi-supervised learning method for surgical instrument segmentation

no code implementations9 Aug 2023 Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miaojing Shi, Tom Vercauteren

Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions.

Pseudo Label Segmentation +1

TreeFormer: a Semi-Supervised Transformer-based Framework for Tree Counting from a Single High Resolution Image

1 code implementation12 Jul 2023 Hamed Amini Amirkolaee, Miaojing Shi, Mark Mulligan

Automatic tree density estimation and counting using single aerial and satellite images is a challenging task in photogrammetry and remote sensing, yet has an important role in forest management.

Density Estimation

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

no code implementations30 Jan 2023 Jian Zhu, Hanli Wang, Miaojing Shi

On the other hand, BLIP-2 as an MLLM is employed to process images and texts, and the referring expressions in texts involving specific visual objects are modified with linguistic object labels to serve as comprehensible MLLM inputs.

Language Modelling Large Language Model +1

Domain-General Crowd Counting in Unseen Scenarios

1 code implementation5 Dec 2022 Zhipeng Du, Jiankang Deng, Miaojing Shi

In this paper, we aim to train a model based on a single source domain which can generalize well on any unseen domain.

Crowd Counting Disentanglement +2

Facial Video-based Remote Physiological Measurement via Self-supervised Learning

1 code implementation27 Oct 2022 Zijie Yue, Miaojing Shi, Shuai Ding

Facial video-based remote physiological measurement aims to estimate remote photoplethysmography (rPPG) signals from human face videos and then measure multiple vital signs (e. g. heart rate, respiration frequency) from rPPG signals.

Heart Rate Variability Photoplethysmography (PPG) +1

A systematic study of race and sex bias in CNN-based cardiac MR segmentation

no code implementations4 Sep 2022 Tiarna Lee, Esther Puyol-Anton, Bram Ruijsink, Miaojing Shi, Andrew P. King

We present the first systematic study of the impact of training set imbalance on race and sex bias in CNN-based segmentation.

Management Segmentation

Redesigning Multi-Scale Neural Network for Crowd Counting

1 code implementation4 Aug 2022 Zhipeng Du, Miaojing Shi, Jiankang Deng, Stefanos Zafeiriou

In this work, we redesign the multi-scale neural network by introducing a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.

Crowd Counting

Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction

1 code implementation18 Jul 2022 Zijie Yue, Miaojing Shi

A popular solution is to first increase the frame rate of the video; then perform feature refinement among different frame features; and last increase the spatial resolutions of these features.

Space-time Video Super-resolution Video Super-Resolution

Dam reservoir extraction from remote sensing imagery using tailored metric learning strategies

1 code implementation12 Jul 2022 Arnout van Soesbergen, Zedong Chu, Miaojing Shi, Mark Mulligan

Extensive experiments were conducted on this benchmark in the water body segmentation task, dam reservoir recognition task, and the joint dam reservoir extraction task.

Metric Learning Segmentation

Boosting Zero-shot Learning via Contrastive Optimization of Attribute Representations

1 code implementation8 Jul 2022 Yu Du, Miaojing Shi, Fangyun Wei, Guoqi Li

In this paper, we propose a new framework to boost ZSL by explicitly learning attribute prototypes beyond images and contrastively optimizing them with attribute-level features within images.

Attribute Zero-Shot Learning

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

1 code implementation CVPR 2022 Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li

In this paper, we introduce a novel method, detection prompt (DetPro), to learn continuous prompt representations for open-vocabulary object detection based on the pre-trained vision-language model.

Image Classification Language Modelling +5

MFNet: Multi-class Few-shot Segmentation Network with Pixel-wise Metric Learning

no code implementations30 Oct 2021 Miao Zhang, Miaojing Shi, Li Li

Last, to enhance the embedding space learning, an additional pixel-wise metric learning module is introduced with triplet loss formulated on the pixel-level embedding of the input image.

Few-Shot Semantic Segmentation Image Classification +3

Learning to Recommend Items to Wikidata Editors

no code implementations13 Jul 2021 Kholoud Alghamdi, Miaojing Shi, Elena Simperl

The system uses a hybrid of content-based and collaborative filtering techniques to rank items for editors relying on both item features and item-editor previous interaction.

Collaborative Filtering Recommendation Systems

Fast Fourier Intrinsic Network

no code implementations9 Nov 2020 Yanlin Qian, Miaojing Shi, Joni-Kristian Kämäräinen, Jiri Matas

We address the problem of decomposing an image into albedo and shading.

Restoring Negative Information in Few-Shot Object Detection

1 code implementation NeurIPS 2020 Yukuan Yang, Fangyun Wei, Miaojing Shi, Guoqi Li

In this paper, we restore the negative information in few-shot object detection by introducing a new negative- and positive-representative based metric learning framework and a new inference scheme with negative and positive representatives.

Few-Shot Learning Few-Shot Object Detection +4

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

no code implementations12 Aug 2020 Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang

We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.

Crowd Counting Knowledge Distillation +3

Defending Adversarial Examples via DNN Bottleneck Reinforcement

no code implementations12 Aug 2020 Wenqing Liu, Miaojing Shi, Teddy Furon, Li Li

This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks.

Training Object Detectors from Few Weakly-Labeled and Many Unlabeled Images

no code implementations arXiv 2019 Zhaohui Yang, Miaojing Shi, Chao Xu, Vittorio Ferrari, Yannis Avrithis

Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set.

Ranked #23 on Weakly Supervised Object Detection on PASCAL VOC 2012 test (using extra training data)

object-detection Weakly Supervised Object Detection

Point in, Box out: Beyond Counting Persons in Crowds

no code implementations CVPR 2019 Yuting Liu, Miaojing Shi, Qijun Zhao, Xiaofang Wang

In the end, we propose a curriculum learning strategy to train the network from images of relatively accurate and easy pseudo ground truth first.

Crowd Counting regression

Revisiting Perspective Information for Efficient Crowd Counting

no code implementations CVPR 2019 Miaojing Shi, Zhaohui Yang, Chao Xu, Qijun Chen

Modern crowd counting methods employ deep neural networks to estimate crowd counts via crowd density regressions.

Crowd Counting regression

Crowd counting via scale-adaptive convolutional neural network

1 code implementation13 Nov 2017 Lu Zhang, Miaojing Shi, Qiaobo Chen

The task of crowd counting is to automatically estimate the pedestrian number in crowd images.

Crowd Counting

Weakly Supervised Object Localization Using Things and Stuff Transfer

no code implementations ICCV 2017 Miaojing Shi, Holger Caesar, Vittorio Ferrari

We propose to help weakly supervised object localization for classes where location annotations are not available, by transferring things and stuff knowledge from a source set with available annotations.

Multiple Instance Learning Object +2

Weakly Supervised Object Localization Using Size Estimates

no code implementations15 Aug 2016 Miaojing Shi, Vittorio Ferrari

We present a technique for weakly supervised object localization (WSOL), building on the observation that WSOL algorithms usually work better on images with bigger objects.

Object Weakly-Supervised Object Localization

Early Burst Detection for Memory-Efficient Image Retrieval

no code implementations CVPR 2015 Miaojing Shi, Yannis Avrithis, Herve Jegou

Then, we show the interest of using this strategy in an asymmetrical manner, with only the database features being aggregated but not those of the query.

Image Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.