TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Semi-Supervised Image Classification	CIFAR-100, 400 Labels	SemiReward	Percentage error	15.62	# 1
Semi-Supervised Image Classification	ImageNet - 1% labeled data	SemiReward	Top 1 Accuracy	59.64%	# 32

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semireward-a-general-reward-model-for-semi/semi-supervised-image-classification-on-cifar-8)](https://paperswithcode.com/sota/semi-supervised-image-classification-on-cifar-8?p=semireward-a-general-reward-model-for-semi)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/semireward-a-general-reward-model-for-semi/semi-supervised-image-classification-on-1)](https://paperswithcode.com/sota/semi-supervised-image-classification-on-1?p=semireward-a-general-reward-model-for-semi)`

SemiReward: A General Reward Model for Semi-supervised Learning

4 Oct 2023 · Siyuan Li, Weiyang Jin, Zedong Wang, Fang Wu, Zicheng Liu, Cheng Tan, Stan Z. Li ·

Semi-supervised learning (SSL) has witnessed great progress with various improvements in the self-training framework with pseudo labeling. The main challenge is how to distinguish high-quality pseudo labels against the confirmation bias. However, existing pseudo-label selection strategies are limited to pre-defined schemes or complex hand-crafted policies specially designed for classification, failing to achieve high-quality labels, fast convergence, and task versatility simultaneously. To these ends, we propose a Semi-supervised Reward framework (SemiReward) that predicts reward scores to evaluate and filter out high-quality pseudo labels, which is pluggable to mainstream SSL methods in wide task types and scenarios. To mitigate confirmation bias, SemiReward is trained online in two stages with a generator model and subsampling strategy. With classification and regression tasks on 13 standard SSL benchmarks across three modalities, extensive experiments verify that SemiReward achieves significant performance gains and faster convergence speeds upon Pseudo Label, FlexMatch, and Free/SoftMatch. Code and models are available at https://github.com/Westlake-AI/SemiReward.

PDF Abstract

Code

Add Remove Mark official

Westlake-AI/SemiReward official

Tasks

Add Remove

Few-Shot Image Classification

Image Classification

Pseudo Label

Semi-supervised Audio Classification

Semi-Supervised Image Classification

Semi-Supervised Text Classification

Transfer Learning

Datasets

ImageNet

CIFAR-100

STL-10

AG News

EuroSAT

ESC-50

UrbanSound8K Yahoo! Answers Yelp Review Polarity

FSDnoisy18k

AgeDB

Results from the Paper

Edit

Ranked #1 on Semi-Supervised Image Classification on CIFAR-100, 400 Labels

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Semi-Supervised Image Classification	CIFAR-100, 400 Labels	SemiReward	Percentage error	15.62	# 1		Compare
Semi-Supervised Image Classification	ImageNet - 1% labeled data	SemiReward	Top 1 Accuracy	59.64%	# 32		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

SemiReward: A General Reward Model for Semi-supervised Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove