Development of a system for graphic captcha systems recognition using competing cellular automata
DOI:
https://doi.org/10.15587/1729-4061.2018.148307Keywords:
competing cellular automaton, movable cellular automaton, captcha systemsAbstract
Peculiarities of the use of competing cellular automata for problems of recognition of complex captcha systems have been explored. For this purpose, the concept of competing cellular automata has been introduced and a mathematical model of their functioning and interaction has been developed. The mathematical model of competing cellular automata based on the set theory has been described to specify moving cellular automata, which shift to the neighboring states of characters and implement their transition rules in such a way. Based on this mathematical model, a recognition system for captcha images implemented in the code by means of JavaFX 2.0 technology has been developed, which allowed reaching the crossplatformness and correct functioning on different operating systems.
The libraries of cellular automata have been developed for the English language. Each symbol of the alphabet is represented in the form of a state system, which is aligned with a cellular automaton with states describing the given symbol.
We used Java programming language for development and OpenCV library for the ability to handle images which allowed us to achieve high-quality recognition results. The architecture of the developed system of recognition of complex captcha images in the form of diagrams of classes of the main blocks with detailed descriptions of each class has been considered. Computer experiments have been carried out with different sets of distorted characters used in actual captcha systems and recognition quality indices of the developed software obtained.
It has been shown that the probability of obtaining the correct result of captcha image recognition exceeds 80 % with a degree of deformation of characters up to 20 %. With a degree of deformation of characters over 30 %, there is a high probability of false character recognition.
The advantages of the method of text character recognition based on competing cellular automata include simplicity of rules of engagement, ability to parallelize the process of recognition easily, capability of recognition of distorted and partially overlapping characters that are the basis of modern captcha systemsReferences
- T'yuring, A. M. (2003). Vychislitel'nye mashiny i razum. Samara: Bahrah-M, 128.
- Wolfram, S. A. (2002). New Kind of Science. Wolfram Media. Inc., 1197.
- Oliveira, C. C., de Oliveira, P. P. B. (2008). An Approach to Searching for Two-Dimensional Cellular Automata for Recognition of Handwritten Digits. Lecture Notes in Computer Science, 462–471. doi: https://doi.org/10.1007/978-3-540-88636-5_44
- Suyasov, D. I. (2010). Retrieving structural features from symbol images based on the cellular automata with labels. Informacionno-upravlyayushchie sistemy, 4, 39–45.
- Wu, H., Zhou, J., Gong, X., Wen, Y., Li, B. (2011). A new JPEG Image Watermarking Algorithm Based on Cellular Automata. Journal of Information & Computational Science, 8 (12), 2431–2439.
- Belan, S. N. (2011). Specialized cellular structures for image contour analysis. Cybernetics and Systems Analysis, 47 (5), 695–704. doi: https://doi.org/10.1007/s10559-011-9349-8
- Hussain, R., Gao, H., Shaikh, R. A. (2016). Segmentation of connected characters in text-based CAPTCHAs for intelligent character recognition. Multimedia Tools and Applications, 76 (24), 25547–25561. doi: https://doi.org/10.1007/s11042-016-4151-2
- Hussain, R., Gao, H., Shaikh, R. A., Soomro, S. P. (2016). Recognition based segmentation of connected characters in text based CAPTCHAs. 2016 8th IEEE International Conference on Communication Software and Networks (ICCSN). doi: https://doi.org/10.1109/iccsn.2016.7586608
- Abdullah Hasan, W. K. (2016). A Survey of Current Research on CAPTCHA. International Journal of Computer Science & Engineering Survey, 7 (3), 1–21. doi: https://doi.org/10.5121/ijcses.2016.7301
- Anti-captcha. Available at: https://anti-captcha.com/mainpage/
- Myroniv, I. (2016). Development of the character recognition software on the base cellular authomata. VI-th International Conference of Students, PhD-Students and Young Scientists “Engineer of XXI Century”, 229–240.
- OpenCV library. Available at: https://opencv.org/
- Leonenkov, A. V. (2004). Samouchitel' UML. Sankt-Peterburg: BHV Peterburg, 576.
- Fake Captcha is the #1 free fake captcha maker! Available at: https://fakecaptcha.com/
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2018 Ivan Myroniv, Viktoriia Zhebka, Sergey Ostapov, Oleksander Val
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.