Combining convolutional neural network and self-adaptive algorithm to defeat synthetic multi-digit text-based CAPTCHA Conference Paper uri icon


  • 2017 IEEE. We always use CAPTCHA(Completely Automated Public Turing test to Tell Computers and Humans Apart) to prevent automated bot for data entry. Although there are various kinds of CAPTCHAs, text-based scheme is still applied most widely, because it is one of the most convenient and user-friendly way for daily user [1]. The fact is that segmentations of different types of CAPTCHAs are not always the same, which means one of CAPTCHA's bottleneck is the segmentation. Once we could accurately split the character, the problem could be solved much easier. Unfortunately, the best way to divide them is still case by case, which is to say there is no universal way to achieve it. In this paper, we present a novel algorithm to achieve state-of-the-art performance, what was more, we also constructed a new convolutional neural network as an add-on recognition part to stabilize our state-of-the-art performance of the whole CAPTCHA system. The CAPTCHA datasets we are using is from the State Administration for Industry& Commerce of the People's Republic of China. In this datasets, there are totally 33 entrances of CAPTCHAs. In this experiments, we assume that each of the entrance is known. Results are provided showing how our algorithms work well towards these CAPTCHAs.

name of conference

  • 2017 IEEE International Conference on Industrial Technology (ICIT)

published proceedings


author list (cited authors)

  • Wang, Y. e., Huang, Y., Zheng, W. u., Zhou, Z., Liu, D., & Lu, M. i.

citation count

  • 21

complete list of authors

  • Wang, Ye||Huang, Yuanjiang||Zheng, Wu||Zhou, Zhi||Liu, Debin||Lu, Mi

publication date

  • January 2017