Deep-CAPTCHA: a deep learning based CAPTCHA solver for vulnerability assessment

15 Jun 2020  ยท  Zahra Noury, Mahdi Rezaei ยท

CAPTCHA is a human-centred test to distinguish a human operator from bots, attacking programs, or other computerised agents that tries to imitate human intelligence. In this research, we investigate a way to crack visual CAPTCHA tests by an automated deep learning based solution. The goal of this research is to investigate the weaknesses and vulnerabilities of the CAPTCHA generator systems; hence, developing more robust CAPTCHAs, without taking the risks of manual try and fail efforts. We develop a Convolutional Neural Network called Deep-CAPTCHA to achieve this goal. The proposed platform is able to investigate both numerical and alphanumerical CAPTCHAs. To train and develop an efficient model, we have generated a dataset of 500,000 CAPTCHAs to train our model. In this paper, we present our customised deep neural network model, we review the research gaps, the existing challenges, and the solutions to cope with the issues. Our network's cracking accuracy leads to a high rate of 98.94% and 98.31% for the numerical and the alpha-numerical test datasets, respectively. That means more works is required to develop robust CAPTCHAs, to be non-crackable against automated artificial agents. As the outcome of this research, we identify some efficient techniques to improve the security of the CAPTCHAs, based on the performance analysis conducted on the Deep-CAPTCHA model.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
CAPTCHA Detection Python alphanumerical Image-Captcha Library Deep-CAPTCHA alphanumerical Accuracy 98.3 # 1
CAPTCHA Detection Python alphanumerical Image-Captcha Library TOD-CNN [20] Accuracy 92.37 # 6
CAPTCHA Detection Python alphanumerical Image-Captcha Library VGG 16 [37] Accuracy 97.5 # 3
CAPTCHA Detection Python alphanumerical Image-Captcha Library SVHN Network * [14] Accuracy 97.84 # 2
CAPTCHA Detection Python alphanumerical Image-Captcha Library ZF [37] Accuracy 96.6 # 5
CAPTCHA Detection Python alphanumerical Image-Captcha Library VGG_CNN_M_1024 [37] Accuracy 97.2 # 4
CAPTCHA Detection Python numerical Image-Captcha Library Deep-CAPTCHA numerical Accuracy 98.9 # 1
CAPTCHA Detection Python numerical Image-Captcha Library Wei et.al. - CNN [36] Accuracy 98.43 # 3
CAPTCHA Detection Python numerical Image-Captcha Library Wei et.al. - SVM [36] Accuracy 98.81 # 2

Methods