Make complex CAPTCHAs simple: A fast text captcha solver based on a small number of samples

被引：14

作者：

Wang, Yao ^{[1
,2
]}

Wei, Yuliang ^{[1
,2
]}

Zhang, Mingjin ^{[1
]}

Liu, Yang ^{[1
]}

Wang, Bailing ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China

[2] Harbin Inst Technol, Res Inst Cyberspace Secur, Harbin 150001, Peoples R China

来源：

INFORMATION SCIENCES | 2021年 / 578卷

基金：

国家重点研发计划;

关键词：

Text CAPTCHAs; Deep learning; Generative adversarial networks; Vision algorithm; ATTACK; MODEL;

D O I：

10.1016/j.ins.2021.07.040

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text-based captchas are still widely used by many websites such as Wikipedia and Microsoft despite the emergence of many alternative captchas. Recently, the design of text-based captchas has become more and more complex to resist attacks from automatic cracking programs. However, most of the existing captcha solving methods have certain shortcomings, such as insufficient accuracy, poor generalization performance, and the need for a large number of labeled samples. This study proposes a fast captcha solver that can effectively break text-based captchas with complex security features using a small amount of labeled data. The solver was achieved by constructing a captcha transformation model based on generative adversarial networks to simplify the captcha images before character segmentation and recognition. Results showed that the proposed captcha solver achieved a high success rate of over 96% character accuracy and 74% captcha accuracy for all evaluated schemes. Moreover, the average time to process a single captcha image using a laptop GPU was only 4-8 ms. The effectiveness of this work may encourage captcha designers to reconsider a more secure human-machine distinction mechanism. (c) 2021 Elsevier Inc. All rights reserved.

引用

页码：181 / 194

页数：14

共 44 条

[1]

Athanasopoulos E, 2006, LECT NOTES COMPUT SC, V4237, P97

[2] Super-FAN: Integrated facial landmark localization and super-resolution of real-world low resolution faces in arbitrary poses with GANs [J].

Bulat, Adrian ;

Tzimiropoulos, Georgios .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :109-117

[3]

Bursztein E, 2014, 8 USENIX WORKSH OFF, P3

[4]

Bursztein E, 2011, PROCEEDINGS OF THE 18TH ACM CONFERENCE ON COMPUTER & COMMUNICATIONS SECURITY (CCS 11), P125

[5]

Chellapilla K, 2005, CEAS

[6] An Attack on Hollow CAPTCHA Using Accurate Filling and Nonredundant Merging [J].

Chen, Jun ;

Luo, Xiangyang ;

Hu, Jianwei ;

Ye, Dengpan ;

Gong, Daofu .

IETE TECHNICAL REVIEW, 2018, 35 :106-118

[7] A Simple Generic Attack on Text Captchas [J].

Gao, Haichang ;

Yan, Jeff ;

Cao, Fang ;

Zhang, Zhengya ;

Lei, Lei ;

Tang, Mengyun ;

Zhang, Ping ;

Zhou, Xin ;

Wang, Xuqin ;

Li, Jiawei .

23RD ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2016), 2016,

[8] Research on the Security of Microsoft's Two-Layer Captcha [J].

Gao, Haichang ;

Tang, Mengyun ;

Liu, Yi ;

Zhang, Ping ;

Liu, Xiyang .

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (07) :1671-1685

[9]

Gargi Ullas, 2013, US Patent, Patent No. [8,510,795, 8510795]

[10]

Goodfellow I.J., 2013, MULTIDIGIT NUMBER RE

← 1 2 3 4 5 →