A novel CAPTCHA solver framework using deep skipping Convolutional Neural Networks

Shida Lu; Kai Huang; Talha Meraj; Hafiz Tayyab Rauf

doi:10.7717/peerj-cs.879

PeerJ Computer Science (Apr 2022)

A novel CAPTCHA solver framework using deep skipping Convolutional Neural Networks

Shida Lu,
Kai Huang,
Talha Meraj,
Hafiz Tayyab Rauf

Affiliations

Shida Lu: State Grid Information & Communication Company, SMEPC, Shanghai, China
Kai Huang: Shanghai Shineenergy Information Technology Development Co., Ltd., Shanghai, China
Talha Meraj: COMSATS Institute of Information Technology, Islamabad, Pakistan
Hafiz Tayyab Rauf: University of Bradford, Bradford, United Kingdom

DOI: https://doi.org/10.7717/peerj-cs.879
Journal volume & issue: Vol. 8
p. e879

Abstract

Read online Read online

A Completely Automated Public Turing Test to tell Computers and Humans Apart (CAPTCHA) is used in web systems to secure authentication purposes; it may break using Optical Character Recognition (OCR) type methods. CAPTCHA breakers make web systems highly insecure. However, several techniques to break CAPTCHA suggest CAPTCHA designers about their designed CAPTCHA’s need improvement to prevent computer vision-based malicious attacks. This research primarily used deep learning methods to break state-of-the-art CAPTCHA codes; however, the validation scheme and conventional Convolutional Neural Network (CNN) design still need more confident validation and multi-aspect covering feature schemes. Several public datasets are available of text-based CAPTCHa, including Kaggle and other dataset repositories where self-generation of CAPTCHA datasets are available. The previous studies are dataset-specific only and cannot perform well on other CAPTCHA’s. Therefore, the proposed study uses two publicly available datasets of 4- and 5-character text-based CAPTCHA images to propose a CAPTCHA solver. Furthermore, the proposed study used a skip-connection-based CNN model to solve a CAPTCHA. The proposed research employed 5-folds on data that delivers 10 different CNN models on two datasets with promising results compared to the other studies.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords