Weakly-Supervised Object Localization
76 papers with code • 8 benchmarks • 3 datasets
Weakly supervised object localization (WSOL) learns to localize objects with only image-level labels, no object level labels (bonding boxes, etc.,) is needed. It is more attractive since image-level labels are much easier and cheaper to obtain.
Libraries
Use these libraries to find Weakly-Supervised Object Localization models and implementationsLatest papers with no code
Diverse Instance Discovery: Vision-Transformer for Instance-Aware Multi-Label Image Recognition
Finally, we propose a weakly supervised object localization-based approach to extract multi-scale local features, to form a multi-view pipeline.
Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization
This manipulation is realized in an anti-adversarial manner, so that the original image is perturbed along pixel gradients in directions opposite to those used in an adversarial attack.
Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization
Weakly supervised object localization aims to find a target object region in a given image with only weak supervision, such as image-level labels.
Learning Consistency from High-quality Pseudo-labels for Weakly Supervised Object Localization
In the second stage, we propose a simple and effective method for evaluating the confidence of pseudo-labels based on classification discrimination, and by learning consistency from high-quality pseudo-labels, we further refine the localization network to get better localization performance.
CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object Localization
Therefore, we propose Clustering and Filter of Tokens (CaFT) with Vision Transformer (ViT) backbone to solve this problem in another way.
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
In this paper, we propose a novel framework built upon the transformer, termed LCTR (Local Continuity TRansformer), which targets at enhancing the local perception capability of global features among long-range feature dependencies.
SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without Cost
The pixel-wise dense prediction tasks based on weakly supervisions currently use Class Attention Maps (CAM) to generate pseudo masks as ground-truth.
Self-Taught Cross-Domain Few-Shot Learning with Weakly Supervised Object Localization and Task-Decomposition
However, the target domain is absolutely unknown during the training on the source domain, which results in lacking directed guidance for target tasks.
Weakly Supervised Foreground Learning for Weakly Supervised Localization and Detection
Modern deep learning models require large amounts of accurately annotated data, which is often difficult to satisfy.
Improving Few-shot Learning with Weakly-supervised Object Localization
In this work, we propose a novel framework that generates class representations by extracting features from class-relevant regions of the images.