TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	DTD	RADAM (ConvNeXt-L)	Accuracy	84.0	# 1
Image Classification	FMD (materials)	RADAM (ConvNeXt-L)	Accuracy (%)	95.2	# 1
Image Classification	KTH-TIPS2	RADAM (ConvNeXt-XL)	Accuracy (%)	94.4	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/radam-texture-recognition-through-randomized-1/image-classification-on-dtd)](https://paperswithcode.com/sota/image-classification-on-dtd?p=radam-texture-recognition-through-randomized-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/radam-texture-recognition-through-randomized-1/image-classification-on-fmd-texture)](https://paperswithcode.com/sota/image-classification-on-fmd-texture?p=radam-texture-recognition-through-randomized-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/radam-texture-recognition-through-randomized-1/image-classification-on-kth-tips2)](https://paperswithcode.com/sota/image-classification-on-kth-tips2?p=radam-texture-recognition-through-randomized-1)`

RADAM: Texture Recognition through Randomized Aggregated Encoding of Deep Activation Maps

8 Mar 2023 · Leonardo Scabini, Kallil M. Zielinski, Lucas C. Ribas, Wesley N. Gonçalves, Bernard De Baets, Odemir M. Bruno ·

Texture analysis is a classical yet challenging task in computer vision for which deep neural networks are actively being applied. Most approaches are based on building feature aggregation modules around a pre-trained backbone and then fine-tuning the new architecture on specific texture recognition tasks. Here we propose a new method named \textbf{R}andom encoding of \textbf{A}ggregated \textbf{D}eep \textbf{A}ctivation \textbf{M}aps (RADAM) which extracts rich texture representations without ever changing the backbone. The technique consists of encoding the output at different depths of a pre-trained deep convolutional network using a Randomized Autoencoder (RAE). The RAE is trained locally to each image using a closed-form solution, and its decoder weights are used to compose a 1-dimensional texture representation that is fed into a linear SVM. This means that no fine-tuning or backpropagation is needed. We explore RADAM on several texture benchmarks and achieve state-of-the-art results with different computational budgets. Our results suggest that pre-trained backbones may not require additional fine-tuning for texture recognition if their learned representations are better encoded.

PDF Abstract

Code

Add Remove Mark official

scabini/RADAM official

Tasks

Add Remove

Texture Classification

Datasets

ImageNet

DTD

Bamboo

KTH-TIPS2 FMD (materials)

Results from the Paper

Edit

Ranked #1 on Image Classification on DTD (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	DTD	RADAM (ConvNeXt-L)	Accuracy	84.0	# 1	Compare
Image Classification	FMD (materials)	RADAM (ConvNeXt-L)	Accuracy (%)	95.2	# 1	Compare
Image Classification	KTH-TIPS2	RADAM (ConvNeXt-XL)	Accuracy (%)	94.4	# 1	Compare

Methods

Add Remove

AutoEncoder • RAdam • RAE • SVM

Edit Social Preview

RADAM: Texture Recognition through Randomized Aggregated Encoding of Deep Activation Maps

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove