TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Image Classification	CIFAR-100	ResNet20+UnsharpMaskLayer	Percentage correct	60.36	# 186
Scene Text Detection	ICDAR 2013	USM (COCO TS + ICDAR–2013)	F-Measure	80.40%	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsharp-masking-layer-injecting-prior/scene-text-detection-on-icdar-2013)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2013?p=unsharp-masking-layer-injecting-prior)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/unsharp-masking-layer-injecting-prior/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=unsharp-masking-layer-injecting-prior)`

Unsharp Masking Layer: Injecting Prior Knowledge in Convolutional Networks for Image Classification

ICANN 2019 2019 · Jose Carranza-Rojas, Saul Calderon-Ramirez, Adán Mora-Fallas, Michael Granados-Menani, Jordina Torrents-Barrena ·

Image enhancement refers to the enrichment of certain image features such as edges, boundaries, or contrast. The main objective is to process the original image so that the overall performance of visualization, classification and segmentation tasks is considerably improved. Traditional techniques require manual fine-tuning of the parameters to control enhancement behavior. To date, recent Convolutional Neural Network (CNN) approaches frequently employ the aforementioned techniques as an enriched pre-processing step. In this work, we present the first intrinsic CNN pre-processing layer based on the well-known unsharp masking algorithm. The proposed layer injects prior knowledge about how to enhance the image, by adding high frequency information to the input, to subsequently emphasize meaningful image features. The layer optimizes the unsharp masking parameters during model training, without any manual intervention. We evaluate the network performance and impact on two applications: CIFAR100 image classification, and the PlantCLEF identification challenge. Results obtained show a significant improvement over popular CNNs, yielding 9.49% and 2.42% for PlantCLEF and general-purpose CIFAR100, respectively. The design of an unsharp enhancement layer plainly boosts the accuracy with negligible performance cost on simple CNN models, as prior knowledge is directly injected to improve its robustness.

PDF Abstract

Code

Add Remove Mark official

maeotaku/pytorch_usm

Tasks

Add Remove

General Classification

Image Classification

Image Enhancement

Scene Text Detection

Datasets

CIFAR-100

ICDAR 2013

Results from the Paper

Add Remove

Ranked #13 on Scene Text Detection on ICDAR 2013

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-100	ResNet20+UnsharpMaskLayer	Percentage correct	60.36	# 186	Compare
Scene Text Detection	ICDAR 2013	USM (COCO TS + ICDAR–2013)	F-Measure	80.40%	# 13	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Unsharp Masking Layer: Injecting Prior Knowledge in Convolutional Networks for Image Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove