Visible to the public Deep CAPTCHA Recognition Using Encapsulated Preprocessing and Heterogeneous Datasets

TitleDeep CAPTCHA Recognition Using Encapsulated Preprocessing and Heterogeneous Datasets
Publication TypeConference Paper
Year of Publication2022
AuthorsKimbrough, Turhan, Tian, Pu, Liao, Weixian, Blasch, Erik, Yu, Wei
Conference NameIEEE INFOCOM 2022 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)
Date Publishedmay
KeywordsCAPTCHA, captchas, composability, Computer architecture, Computers, Conferences, Cybersecurity applications, Deep Learning, Human Behavior, performance evaluation, pubcrawl, Software systems, Tracking, Training
AbstractCAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) is an important security technique designed to deter bots from abusing software systems, which has broader applications in cyberspace. CAPTCHAs come in a variety of forms, including the deciphering of obfuscated text, transcribing of audio messages, and tracking mouse movement, among others. This paper focuses on using deep learning techniques to recognize text-based CAPTCHAs. In particular, our work focuses on generating training datasets using different CAPTCHA schemes, along with a pre-processing technique allowing for character-based recognition. We have encapsulated the CRABI (CAPTCHA Recognition with Attached Binary Images) framework to give an image multiple labels for improvement in feature extraction. Using real-world datasets, performance evaluations are conducted to validate the efficacy of our proposed approach on several neural network architectures (e.g., custom CNN architecture, VGG16, ResNet50, and MobileNet). The experimental results confirm that over 90% accuracy can be achieved on most models.
DOI10.1109/INFOCOMWKSHPS54753.2022.9798233
Citation Keykimbrough_deep_2022