TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Malware Classification	Malimg Dataset	Gray-scale IMG CNN	Accuracy (10-fold)	0.9848	# 1
Malware Classification	Malimg Dataset	Gray-scale IMG CNN	Macro F1 (10-fold)	0.9580	# 1
Malware Classification	Microsoft Malware Classification Challenge	LBP features + XGBoost	Accuracy (5-fold)	0.951	# 5
Malware Classification	Microsoft Malware Classification Challenge	Haralick features + XGBoost	Accuracy (5-fold)	0.9550	# 4
Malware Classification	Microsoft Malware Classification Challenge	Gray-scale IMG CNN	Accuracy (10-fold)	0.9750	# 16
Malware Classification	Microsoft Malware Classification Challenge	Gray-scale IMG CNN	LogLoss	0.184483	# 11
Malware Classification	Microsoft Malware Classification Challenge	Gray-scale IMG CNN	Macro F1 (10-fold)	0.9400	# 15
Malware Classification	Microsoft Malware Classification Challenge	Gray-scale IMG CNN	Accuracy (5-fold)	0.973	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/using-convolutional-neural-networks-for-1/malware-classification-on-malimg-dataset)](https://paperswithcode.com/sota/malware-classification-on-malimg-dataset?p=using-convolutional-neural-networks-for-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/using-convolutional-neural-networks-for-1/malware-classification-on-microsoft-malware)](https://paperswithcode.com/sota/malware-classification-on-microsoft-malware?p=using-convolutional-neural-networks-for-1)`

Using Convolutional Neural Networks for Classification of Malware represented as Images

27 Aug 2018 · Daniel Gibert, Carles Mateu, Jordi Planes & Ramon Vicens ·

The number of malicious files detected every year are counted by millions. One of the main reasons for these high volumes of different files is the fact that, in order to evade detection, malware authors add mutation. This means that malicious files belonging to the same family, with the same malicious behavior, are constantly modified or obfuscated using several techniques, in such a way that they look like different files. In order to be effective in analyzing and classifying such large amounts of files, we need to be able to categorize them into groups and identify their respective families on the basis of their behavior. In this paper, malicious software is visualized as gray scale images since its ability to capture minor changes while retaining the global structure helps to detect variations. Motivated by the visual similarity between malware samples of the same family, we propose a file agnostic deep learning approach for malware categorization to efficiently group malicious software into families based on a set of discriminant patterns extracted from their visualization as images. The suitability of our approach is evaluated against two benchmarks: the MalImg dataset and the Microsoft Malware Classification Challenge dataset. Experimental comparison demonstrates its superior performance with respect to state-of-the-art techniques.

PDF

Code

Add Remove Mark official

danielgibert/mlw_classification_cnn… official

Tasks

Add Remove

General Classification

Malware Classification

Datasets

Microsoft Malware Classification Challenge

160_subset

Malimg

Results from the Paper

Add Remove

Ranked #1 on Malware Classification on Malimg Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Malware Classification	Malimg Dataset	Gray-scale IMG CNN	Accuracy (10-fold)	0.9848	# 1	Compare
Malware Classification	Malimg Dataset	Gray-scale IMG CNN	Macro F1 (10-fold)	0.9580	# 1	Compare
Malware Classification	Microsoft Malware Classification Challenge	LBP features + XGBoost	Accuracy (5-fold)	0.951	# 5	Compare
Malware Classification	Microsoft Malware Classification Challenge	Haralick features + XGBoost	Accuracy (5-fold)	0.9550	# 4	Compare
Malware Classification	Microsoft Malware Classification Challenge	Gray-scale IMG CNN	Accuracy (10-fold)	0.9750	# 16	Compare
			LogLoss	0.184483	# 11	Compare
			Macro F1 (10-fold)	0.9400	# 15	Compare
			Accuracy (5-fold)	0.973	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Using Convolutional Neural Networks for Classification of Malware represented as Images

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove