Search Results for author: Miaojing Shi

Found 38 papers, 14 papers with code

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

no code implementations • 11 Mar 2024 • Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

Finally, to better reflect the clinical significant and insignificant errors that radiologists would normally assign in the report, we introduce a novel clinical quality reinforcement learning strategy.

Language Modelling Large Language Model +1

Paper
Add Code

AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image

no code implementations • 5 Feb 2024 • Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan

For the latter, an attention-to-adapt mechanism is introduced to distill relevant information from different domains while generating tree density maps; a hierarchical cross-domain feature alignment scheme is proposed that progressively aligns the features from the source and target domains.

Domain Adaptation

Paper
Add Code

Multitask Learning in Minimally Invasive Surgical Vision: A Review

no code implementations • 16 Jan 2024 • Oluwatosin Alabi, Tom Vercauteren, Miaojing Shi

Recent advancements in machine learning and computer vision have led to successful applications in analyzing videos obtained from MIS with the promise of alleviating challenges in MIS videos.

Action Understanding

Paper
Add Code

Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

2 code implementations • 2 Dec 2023 • Zhipeng Du, Miaojing Shi, Jiankang Deng

Previous methods mitigate this issue by exploring image enhancement or object detection techniques with real low-light image datasets.

Domain Adaptation Image Enhancement +3

Paper
Code

VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation

1 code implementation • 27 Nov 2023 • Zijian Zhou, Miaojing Shi, Holger Caesar

Panoptic Scene Graph Generation (PSG) aims at achieving a comprehensive image understanding by simultaneously segmenting objects and predicting relations among objects.

Ranked #1 on Panoptic Scene Graph Generation on PSG Dataset

Graph Generation Panoptic Scene Graph Generation +1

Paper
Code

IMITATE: Clinical Prior Guided Hierarchical Vision-Language Pre-training

no code implementations • 11 Oct 2023 • Che Liu, Sibo Cheng, Miaojing Shi, Anand Shah, Wenjia Bai, Rossella Arcucci

The framework derives multi-level visual features from the chest X-ray (CXR) images and separately aligns these features with the descriptive and the conclusive text encoded in the hierarchical medical report.

Contrastive Learning Descriptive

Paper
Add Code

An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation

no code implementations • 25 Aug 2023 • Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Keana Aitcheson, Miaojing Shi, Andrew P. King

However, the severity and nature of the bias varies between the models, highlighting the importance of model choice when attempting to train fair AI-based segmentation models for medical imaging tasks.

Image Segmentation Segmentation +1

Paper
Add Code

SegMatch: A semi-supervised learning method for surgical instrument segmentation

no code implementations • 9 Aug 2023 • Meng Wei, Charlie Budd, Luis C. Garcia-Peraza-Herrera, Reuben Dorent, Miaojing Shi, Tom Vercauteren

Surgical instrument segmentation is recognised as a key enabler to provide advanced surgical assistance and improve computer assisted interventions.

Pseudo Label Segmentation +1

Paper
Add Code

When Multi-Task Learning Meets Partial Supervision: A Computer Vision Review

no code implementations • 25 Jul 2023 • Maxime Fontana, Michael Spratling, Miaojing Shi

Second, it presents the different challenges arising from such a multi-objective optimisation scheme.

Benchmarking Multi-Task Learning

Paper
Add Code

TreeFormer: a Semi-Supervised Transformer-based Framework for Tree Counting from a Single High Resolution Image

1 code implementation • 12 Jul 2023 • Hamed Amini Amirkolaee, Miaojing Shi, Mark Mulligan

Automatic tree density estimation and counting using single aerial and satellite images is a challenging task in photogrammetry and remote sensing, yet has an important role in forest management.

Density Estimation

Paper
Code

LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation

1 code implementation • 14 Jun 2023 • Linfeng Yuan, Miaojing Shi, Zijie Yue, Qijun Chen

Referring video object segmentation (RVOS) aims to segment the target instance referred by a given text expression in a video clip.

Ranked #12 on Referring Expression Segmentation on Refer-YouTube-VOS (2021 public validation) (using extra training data)

Referring Expression Segmentation Referring Video Object Segmentation +2

Paper
Code

HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation

1 code implementation • ICCV 2023 • Zijian Zhou, Miaojing Shi, Holger Caesar

Existing unbiased methods tackle the long-tail problem by data/loss rebalancing to favor low-frequency relations.

Ranked #2 on Panoptic Scene Graph Generation on PSG Dataset

Panoptic Scene Graph Generation Scene Understanding

Paper
Code

Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning

no code implementations • 30 Jan 2023 • Jian Zhu, Hanli Wang, Miaojing Shi

On the other hand, BLIP-2 as an MLLM is employed to process images and texts, and the referring expressions in texts involving specific visual objects are modified with linguistic object labels to serve as comprehensible MLLM inputs.

Language Modelling Large Language Model +1

Paper
Add Code

Domain-General Crowd Counting in Unseen Scenarios

1 code implementation • 5 Dec 2022 • Zhipeng Du, Jiankang Deng, Miaojing Shi

In this paper, we aim to train a model based on a single source domain which can generalize well on any unseen domain.

Crowd Counting Disentanglement +2

Paper
Code

Exploring and Eliciting Needs and Preferences from Editors for Wikidata Recommendations

no code implementations • 4 Dec 2022 • Kholoud Alghamdi, Miaojing Shi, Elena Simperl

Our aim with this paper is to elicit the user requirements for a Wikidata recommendations system.

Recommendation Systems

Paper
Add Code

$α$ DARTS Once More: Enhancing Differentiable Architecture Search by Masked Image Modeling

no code implementations • 18 Nov 2022 • Bicheng Guo, Shuxuan Guo, Miaojing Shi, Peng Chen, Shibo He, Jiming Chen, Kaicheng Yu

Differentiable architecture search (DARTS) has been a mainstream direction in automatic machine learning.

Ranked #10 on Neural Architecture Search on NAS-Bench-201, CIFAR-100

Neural Architecture Search

Paper
Add Code

Facial Video-based Remote Physiological Measurement via Self-supervised Learning

1 code implementation • 27 Oct 2022 • Zijie Yue, Miaojing Shi, Shuai Ding

Facial video-based remote physiological measurement aims to estimate remote photoplethysmography (rPPG) signals from human face videos and then measure multiple vital signs (e. g. heart rate, respiration frequency) from rPPG signals.

Heart Rate Variability Photoplethysmography (PPG) +1

Paper
Code

A systematic study of race and sex bias in CNN-based cardiac MR segmentation

no code implementations • 4 Sep 2022 • Tiarna Lee, Esther Puyol-Anton, Bram Ruijsink, Miaojing Shi, Andrew P. King

We present the first systematic study of the impact of training set imbalance on race and sex bias in CNN-based segmentation.

Management Segmentation

Paper
Add Code

Redesigning Multi-Scale Neural Network for Crowd Counting

1 code implementation • 4 Aug 2022 • Zhipeng Du, Miaojing Shi, Jiankang Deng, Stefanos Zafeiriou

In this work, we redesign the multi-scale neural network by introducing a hierarchical mixture of density experts, which hierarchically merges multi-scale density maps for crowd counting.

Crowd Counting

Paper
Code

Enhancing Space-time Video Super-resolution via Spatial-temporal Feature Interaction

1 code implementation • 18 Jul 2022 • Zijie Yue, Miaojing Shi

A popular solution is to first increase the frame rate of the video; then perform feature refinement among different frame features; and last increase the spatial resolutions of these features.

Space-time Video Super-resolution Video Super-Resolution

Paper
Code

Dam reservoir extraction from remote sensing imagery using tailored metric learning strategies

1 code implementation • 12 Jul 2022 • Arnout van Soesbergen, Zedong Chu, Miaojing Shi, Mark Mulligan

Extensive experiments were conducted on this benchmark in the water body segmentation task, dam reservoir recognition task, and the joint dam reservoir extraction task.

Metric Learning Segmentation

Paper
Code

Boosting Zero-shot Learning via Contrastive Optimization of Attribute Representations

1 code implementation • 8 Jul 2022 • Yu Du, Miaojing Shi, Fangyun Wei, Guoqi Li

In this paper, we propose a new framework to boost ZSL by explicitly learning attribute prototypes beyond images and contrastively optimizing them with attribute-level features within images.

Attribute Zero-Shot Learning

Paper
Code

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

1 code implementation • CVPR 2022 • Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li

In this paper, we introduce a novel method, detection prompt (DetPro), to learn continuous prompt representations for open-vocabulary object detection based on the pre-trained vision-language model.

Image Classification Language Modelling +5

159

Paper
Code

MFNet: Multi-class Few-shot Segmentation Network with Pixel-wise Metric Learning

no code implementations • 30 Oct 2021 • Miao Zhang, Miaojing Shi, Li Li

Last, to enhance the embedding space learning, an additional pixel-wise metric learning module is introduced with triplet loss formulated on the pixel-level embedding of the input image.

Few-Shot Semantic Segmentation Image Classification +3

Paper
Add Code

Learning to Recommend Items to Wikidata Editors

no code implementations • 13 Jul 2021 • Kholoud Alghamdi, Miaojing Shi, Elena Simperl

The system uses a hybrid of content-based and collaborative filtering techniques to rank items for editors relying on both item features and item-editor previous interaction.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Detecting Human-Object Interaction with Mixed Supervision

no code implementations • 10 Nov 2020 • Suresh Kirthi Kumaraswamy, Miaojing Shi, Ewa Kijak

Human object interaction (HOI) detection is an important task in image understanding and reasoning.

Human-Object Interaction Detection Object +1

Paper
Add Code

Fast Fourier Intrinsic Network

no code implementations • 9 Nov 2020 • Yanlin Qian, Miaojing Shi, Joni-Kristian Kämäräinen, Jiri Matas

We address the problem of decomposing an image into albedo and shading.

Paper
Add Code

Restoring Negative Information in Few-Shot Object Detection

1 code implementation • NeurIPS 2020 • Yukuan Yang, Fangyun Wei, Miaojing Shi, Guoqi Li

In this paper, we restore the negative information in few-shot object detection by introducing a new negative- and positive-representative based metric learning framework and a new inference scheme with negative and positive representatives.

Few-Shot Learning Few-Shot Object Detection +4

Paper
Code

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

no code implementations • 12 Aug 2020 • Yuting Liu, Zheng Wang, Miaojing Shi, Shin'ichi Satoh, Qijun Zhao, Hongyu Yang

We formulate the mutual transformations between the outputs of regression- and detection-based models as two scene-agnostic transformers which enable knowledge distillation between the two models.

Crowd Counting Knowledge Distillation +3

Paper
Add Code

Defending Adversarial Examples via DNN Bottleneck Reinforcement

no code implementations • 12 Aug 2020 • Wenqing Liu, Miaojing Shi, Teddy Furon, Li Li

This paper presents a DNN bottleneck reinforcement scheme to alleviate the vulnerability of Deep Neural Networks (DNN) against adversarial attacks.

Paper
Add Code

Active Crowd Counting with Limited Supervision

no code implementations • ECCV 2020 • Zhen Zhao, Miaojing Shi, Xiaoxiao Zhao, Li Li

To learn a reliable people counter from crowd images, head center annotations are normally required.

Active Learning Crowd Counting +1

Paper
Add Code

Training Object Detectors from Few Weakly-Labeled and Many Unlabeled Images

no code implementations • arXiv 2019 • Zhaohui Yang, Miaojing Shi, Chao Xu, Vittorio Ferrari, Yannis Avrithis

Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set.

Ranked #23 on Weakly Supervised Object Detection on PASCAL VOC 2012 test (using extra training data)

object-detection Weakly Supervised Object Detection

Paper
Add Code

Point in, Box out: Beyond Counting Persons in Crowds

no code implementations • CVPR 2019 • Yuting Liu, Miaojing Shi, Qijun Zhao, Xiaofang Wang

In the end, we propose a curriculum learning strategy to train the network from images of relatively accurate and easy pseudo ground truth first.

Crowd Counting regression

Paper
Add Code

Revisiting Perspective Information for Efficient Crowd Counting

no code implementations • CVPR 2019 • Miaojing Shi, Zhaohui Yang, Chao Xu, Qijun Chen

Modern crowd counting methods employ deep neural networks to estimate crowd counts via crowd density regressions.

Crowd Counting regression

Paper
Add Code

Crowd counting via scale-adaptive convolutional neural network

1 code implementation • 13 Nov 2017 • Lu Zhang, Miaojing Shi, Qiaobo Chen

The task of crowd counting is to automatically estimate the pedestrian number in crowd images.

Crowd Counting

107

Paper
Code

Weakly Supervised Object Localization Using Things and Stuff Transfer

no code implementations • ICCV 2017 • Miaojing Shi, Holger Caesar, Vittorio Ferrari

We propose to help weakly supervised object localization for classes where location annotations are not available, by transferring things and stuff knowledge from a source set with available annotations.

Multiple Instance Learning Object +2

Paper
Add Code

Weakly Supervised Object Localization Using Size Estimates

no code implementations • 15 Aug 2016 • Miaojing Shi, Vittorio Ferrari

We present a technique for weakly supervised object localization (WSOL), building on the observation that WSOL algorithms usually work better on images with bigger objects.

Object Weakly-Supervised Object Localization

Paper
Add Code

Early Burst Detection for Memory-Efficient Image Retrieval

no code implementations • CVPR 2015 • Miaojing Shi, Yannis Avrithis, Herve Jegou

Then, we show the interest of using this strategy in an asymmetrical manner, with only the database features being aggregated but not those of the query.

Image Retrieval Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.