DEVELOPMENT OF FACE RECOGNITION SYSTEM IN A VIDEO STREAM WITH AUGMENTED REALITY
DOI:
https://doi.org/10.24025/2306-4412.3.2020.200277Keywords:
face recognition, augmented reality, image pre-processing, face detection, face landmarks estimationAbstract
In the paper, the problem of face recognition in a video stream with augmented reality is considered. The current state of this problem is investigated. The general process of face recognition and the basic concepts of augmented reality have been studied. The analysis of modern approaches to solving the face recognition problem is carried out, the strengths and weaknesses of the methods used have been found. A search is carried out for a method invariant to scaling, scene changes, head turns, changes in lighting, accessories, and changes in emotions. An algorithm, architecture, and the soft-ware system that solves the problem of face recognition in a video stream with the elements of aug-mented reality have been developed. A histogram of oriented gradients (HOG) is chosen as the method for face detection; face recognition functionality is developed on the basis of the convolutional neural network architecture – ResNet34. Experimental studies are carried out, the system has been tested on both one and several faces simultaneously. Estimate methods of the recognition quality for the developed software system are determined – the plotting of ROC-curves that show the dependence of the number of false positives on the detection accuracy (true positive rate) and measuring AUC. AUC =0.95 has been achieved during recognition of one face, and AUC = 0.83 – during recognition of sev-eral faces (maximum 4). Statement of the problem. Investigation and analysis of existing approaches to building face-to-face recognition technology in augmented reality systems by analyzing models, methods and algorithms for human face recognition, identifying strengths and weaknesses of existing solutions, choosing the best combination of detection and recognition methods. Analysis of recent research and publications. Approaches have been proposed for the formation of biometric face image templates that can be used for biometric verification or face identification. However, all recent facial recognition results have been obtained through the use of deep convolutional neural networks. In the work of Yu. V. Visilter et al., a convolutional neural network with a hash forest (ZNMHL), based on a convolutional network with a hash layer, has been obtained. J. Betty et al. have investigated how different factors influence recognition quality. The purpose of the article is to prove the effectiveness of the proposed approach based on the histogram of directed gradients and convolutional neural network architecture ResNet34 for the problem of face recognition in a video stream in augmented reality systems. Presenting main material. The basic concepts of augmented reality are analyzed. The process of face recognition is described. The description of the software for the solution of the problem is given and mathematical model is developed. The algorithm of work of the program of face recognition program developed by authors is detailed. The architecture of face recognition system in augmented reality video stream is designed. Results. A software system designed to recognize human faces in augmented reality video streams has shown satisfactory results. The application correctly recognizes the face, available in the database, in different conditions of lighting, head rotation, with the presence of accessory ditches, the closure of some parts of the face, changes in emotions, etc., similarly for recognizing multiple faces at the same time. The system has been tested on 520 examples: 4 people separately and together in different combinations under different conditions of lighting, noise, interference, accessories, emotions. Conclusion. Applying a neural network to the ResNet architecture with appropriate settings for detecting and recognizing human faces in augmented reality video streams is a good choice – this method is invariant to scaling, scene changes, head turns, light changes, accessories, and emotion changes. The system is a platform for further development. In particular, it is planned to conduct experimental studies using other methods of face recognition in a video stream and to perform a comparative analysis of the results, as well as to create a more convenient graphical interface of the program and adaptation for the mobile version.References
P. Sharma, R. N. Yadav, and K. V. Arya, "Pose-invariant face recognition using curvelet neural network", IET Biometrics, vol. 3, no. 3, pp. 128-138, Sept. 2014.
Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin, "Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, iss. 12, pp. 2916-2929, 2012, doi: 10.1109/TPAMI.2012.193.
K. Grauman, and R. Fergus, "Learning binary hash codes for large-scale image search", in Machine Learning for Computer Vision, R. Cipolla, S. Battiato, and G. M. Farinella, Eds. Berlin, Heidelberg: Springer, 2013, pp. 49-87, doi: 10.1007/978-3-642-28661-2_3.
W. Wang, J. Yang, J. Xiao, S. Li, and D. Zhou, "Face recognition based on deep learning", in Human Centered Computing, Q. Zu, B. Hu, N. Gu, S. Seng, Eds. Springer, 2015, doi: 10.1007/978-3319-15554-8_73.
X. Wu, "Learning robust deep face representation", arXiv preprint, 2015. [Online]. Available: https://arxiv.org/pdf/ 1507.04844.pdf. Accessed on: April 8, 2020.
D. Chen, X. Cao, F. Wen, and J. Sun, "Blessing of dimensionality: High-dimensional feature and its efficient compression for face verification", Proc CVPR, pp. 3025-3032, 2013, doi: 10.1109/CVPR.2013.389.
H.-V. Nguyen, and L. Bai, "Cosine similarity metric learning for face verif-?cation", Proc ACCV, pp.709-720, 2010, doi: 10.1007/978-3-642-19309-5_55.
Yu. V. Vizilter, V. S. Gorbatsevich, A. V. Vorotnikov, and N. A. Kostromov, "Real time face identification using convolutional neural network and hash for-est", Kompyuternaya optika, no. 2, 2017. [Online]. Available: https://cyberleninka.ru/ article/n/identifikatsiya-lits-v-realnom-vremeni-s-ispolzovaniem-svyortochnyy-neyronnoy-seti-i-heshiruyuschego-lesa. Accessed: April 8, 2020.
Y. Sun, X. Wang, and X. Tang, "Deep learning face representation by joint identification-verification", Proc. 27th Int. Conf. on Neural Information Processing Systems, 2014, pp. 1988-1996.
J. Betty, I. Bülthoff, B. J. Mohlera, and I. M. Thornton, "Face recognition of full-bodied avatars by active observers in a virtual environment", Vision Research, vol. 157, pp. 242-251, 2019.
P. Milgram, and F. Kishino, "A taxonomy of mixed reality visual displays", IEICE Transactions on Information and Systems, vol. 77, no. 12, pp. 1321-1329, 1994. [Online]. Available: https://www.researchgate.net/publication/231514051_A_Taxonomy_of_Mixed_Reality_Visual_Displays. Accessed on: Apr. 8, 2020.
A. Nayyar, B. Mahapatra, D. Le, and G. Suseendran, "Virtual Reality (VR) & Augmented Reality (AR) technologies", Int. Journal of Engineering & Technology, vol. 7, pp.156-160 2018. [Online]. Available: https://www.researchgate.net/publication/324745910_Virtual_Reality_VR_Augmented_Reality_AR_technologies_for_tourism_and_hospitality_industry. Accessed on: Apr. 8, 2020.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition". [Online]. Available: https://arxiv.org/pdf/ 1512.03385.pdf. Accessed on: Apr. 8, 2020.
Downloads
Published
How to Cite
Issue
Section
URN
License
Copyright (c) 2020 Олена Борисівна Данченко, Олег Євгенович Іларіонов, Ганна Валеріївна Красовська, Тетяна Сергіївна Короткова The authors who publish in this journal agree to the following terms:The authors reserve the right to authorship of their work and give the journal the right to first publish this work under the terms of the Creative Commons Attribution License CC BY-NC, which allows other persons to freely distribute published work with a mandatory reference to authors of the original work and the first publication of the work in this journal.
Authors have the right to conclude separate additional agreements for the non-exclusive distribution of the paper in the form in which it was published by this journal (for example, posting work in electronic repository or publishing as part of a monograph), provided that the link to the first publication in this journal is maintained.
The journal policy allows and encourages authors to post on the Internet (for example, in repositories of institutions or on personal websites) the manuscript of work, both before the submission of this manuscript to the editorial staff, and during its editorial work, as it contributes to the emergence of productive scientific discussion and positively affects the efficiency and dynamics of published work citation (see The Effect of Open Access).