A deep learning‐based attack on text CAPTCHAs by using object detection techniques

Jiawei Nian; Ping Wang; Haichang Gao; Xiaoyan Guo

doi:10.1049/ise2.12047

IET Information Security (Mar 2022)

A deep learning‐based attack on text CAPTCHAs by using object detection techniques

Jiawei Nian,
Ping Wang,
Haichang Gao,
Xiaoyan Guo

Affiliations

Jiawei Nian: School of Computer Science and Technology Xidian University Xi'an Shaanxi China
Ping Wang: School of Computer Science and Technology Xidian University Xi'an Shaanxi China
Haichang Gao: School of Computer Science and Technology Xidian University Xi'an Shaanxi China
Xiaoyan Guo: School of Computer Science and Technology Xidian University Xi'an Shaanxi China

DOI: https://doi.org/10.1049/ise2.12047
Journal volume & issue: Vol. 16, no. 2
pp. 97 – 110

Abstract

Read online

Abstract Text‐based CAPTCHAs have been widely deployed by many popular websites, and many have been attacked. However, most previous cracks were based on classification algorithms that typically rely on a series of preprocessing operations or on many training samples, thus making such attacks complicated and costly. In this study, a simple, generic, fast and end‐to‐end attack based on advanced object detection technologies is introduced. The proposed attack combines a feature extraction module, a character location and recognition module and a coordinate matching module. The experiments show that the attack can break a wide range of real‐world text CAPTCHAs deployed by the 50 most popular websites on Alexa.com and that the method achieves a high attack accuracy with only 2000 samples at an attack speed of less than 0.10 s. The attack was also evaluated on four click‐based CAPTCHAs that cannot be attacked in the end‐to‐end manner used by previous attacks, and the results demonstrated that within one step, the proposed approach achieves high success rates on both click‐based CAPTCHAs and schemes based on large‐scale character sets, such as Chinese character sets.

Published in IET Information Security

ISSN: 1751-8709 (Print); 1751-8717 (Online)
Publisher: Hindawi-IET
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics: Computer engineering. Computer hardware; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://ietresearch.onlinelibrary.wiley.com/journal/ietis

About the journal

Abstract

Keywords