DEVELOPMENT OF FACE RECOGNITION SYSTEM IN A VIDEO STREAM WITH AUGMENTED REALITY

Authors

DOI:

https://doi.org/10.24025/2306-4412.3.2020.200277

Keywords:

face recognition, augmented reality, image pre-processing, face detection, face landmarks estimation

Abstract

In the paper, the problem of face recognition in a video stream with augmented reality is considered. The current state of this problem is investigated. The general process of face recognition and the basic concepts of augmented reality have been studied. The analysis of modern approaches to solving the face recognition problem is carried out, the strengths and weaknesses of the methods used have been found. A search is carried out for a method invariant to scaling, scene changes, head turns, changes in lighting, accessories, and changes in emotions. An algorithm, architecture, and the soft-ware system that solves the problem of face recognition in a video stream with the elements of aug-mented reality have been developed. A histogram of oriented gradients (HOG) is chosen as the method for face detection; face recognition functionality is developed on the basis of the convolutional neural network architecture – ResNet34. Experimental studies are carried out, the system has been tested on both one and several faces simultaneously. Estimate methods of the recognition quality for the developed software system are determined – the plotting of ROC-curves that show the dependence of the number of false positives on the detection accuracy (true positive rate) and measuring AUC. AUC =0.95 has been achieved during recognition of one face, and AUC = 0.83 – during recognition of sev-eral faces (maximum 4). Statement of the problem. Investigation and analysis of existing approaches to building face-to-face recognition technology in augmented reality systems by analyzing models, methods and algorithms for human face recognition, identifying strengths and weaknesses of existing solutions, choosing the best combination of detection and recognition methods. Analysis of recent research and publications. Approaches have been proposed for the formation of biometric face image templates that can be used for biometric verification or face identification. However, all recent facial recognition results have been obtained through the use of deep convolutional neural networks. In the work of Yu. V. Visilter et al., a convolutional neural network with a hash forest (ZNMHL), based on a convolutional network with a hash layer, has been obtained. J. Betty et al. have investigated how different factors influence recognition quality. The purpose of the article is to prove the effectiveness of the proposed approach based on the histogram of directed gradients and convolutional neural network architecture ResNet34 for the problem of face recognition in a video stream in augmented reality systems. Presenting main material. The basic concepts of augmented reality are analyzed. The process of face recognition is described. The description of the software for the solution of the problem is given and mathematical model is developed. The algorithm of work of the program of face recognition program developed by authors is detailed. The architecture of face recognition system in augmented reality video stream is designed. Results. A software system designed to recognize human faces in augmented reality video streams has shown satisfactory results. The application correctly recognizes the face, available in the database, in different conditions of lighting, head rotation, with the presence of accessory ditches, the closure of some parts of the face, changes in emotions, etc., similarly for recognizing multiple faces at the same time. The system has been tested on 520 examples: 4 people separately and together in different combinations under different conditions of lighting, noise, interference, accessories, emotions. Conclusion. Applying a neural network to the ResNet architecture with appropriate settings for detecting and recognizing human faces in augmented reality video streams is a good choice – this method is invariant to scaling, scene changes, head turns, light changes, accessories, and emotion changes. The system is a platform for further development. In particular, it is planned to conduct experimental studies using other methods of face recognition in a video stream and to perform a comparative analysis of the results, as well as to create a more convenient graphical interface of the program and adaptation for the mobile version.

Author Biographies

О.B. Danchenko, Cherkasy State Technological University

D.Sc., professor

О.E. Іlarionov, Taras Shevchenko National University of Kyiv

Ph.D.,associateprofessor

Н.V. Krasovska, Taras Shevchenko National University of Kyiv

Ph.D.,associateprofessor

Т.S. Коrоtkоvа, Taras Shevchenko National University of Kyiv

M.Sc.

References

P. Sharma, R. N. Yadav, and K. V. Arya, "Pose-invariant face recognition using curvelet neural network", IET Biometrics, vol. 3, no. 3, pp. 128-138, Sept. 2014.

Y. Gong, S. Lazebnik, A. Gordo, and F. Perronnin, "Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, iss. 12, pp. 2916-2929, 2012, doi: 10.1109/TPAMI.2012.193.

K. Grauman, and R. Fergus, "Learning binary hash codes for large-scale image search", in Machine Learning for Computer Vision, R. Cipolla, S. Battiato, and G. M. Farinella, Eds. Berlin, Heidelberg: Springer, 2013, pp. 49-87, doi: 10.1007/978-3-642-28661-2_3.

W. Wang, J. Yang, J. Xiao, S. Li, and D. Zhou, "Face recognition based on deep learning", in Human Centered Computing, Q. Zu, B. Hu, N. Gu, S. Seng, Eds. Springer, 2015, doi: 10.1007/978-3319-15554-8_73.

X. Wu, "Learning robust deep face representation", arXiv preprint, 2015. [Online]. Available: https://arxiv.org/pdf/ 1507.04844.pdf. Accessed on: April 8, 2020.

D. Chen, X. Cao, F. Wen, and J. Sun, "Blessing of dimensionality: High-dimensional feature and its efficient compression for face verification", Proc CVPR, pp. 3025-3032, 2013, doi: 10.1109/CVPR.2013.389.

H.-V. Nguyen, and L. Bai, "Cosine similarity metric learning for face verif-?cation", Proc ACCV, pp.709-720, 2010, doi: 10.1007/978-3-642-19309-5_55.

Yu. V. Vizilter, V. S. Gorbatsevich, A. V. Vorotnikov, and N. A. Kostromov, "Real time face identification using convolutional neural network and hash for-est", Kompyuternaya optika, no. 2, 2017. [Online]. Available: https://cyberleninka.ru/ article/n/identifikatsiya-lits-v-realnom-vremeni-s-ispolzovaniem-svyortochnyy-neyronnoy-seti-i-heshiruyuschego-lesa. Accessed: April 8, 2020.

Y. Sun, X. Wang, and X. Tang, "Deep learning face representation by joint identification-verification", Proc. 27th Int. Conf. on Neural Information Processing Systems, 2014, pp. 1988-1996.

J. Betty, I. Bülthoff, B. J. Mohlera, and I. M. Thornton, "Face recognition of full-bodied avatars by active observers in a virtual environment", Vision Research, vol. 157, pp. 242-251, 2019.

P. Milgram, and F. Kishino, "A taxonomy of mixed reality visual displays", IEICE Transactions on Information and Systems, vol. 77, no. 12, pp. 1321-1329, 1994. [Online]. Available: https://www.researchgate.net/publication/231514051_A_Taxonomy_of_Mixed_Reality_Visual_Displays. Accessed on: Apr. 8, 2020.

A. Nayyar, B. Mahapatra, D. Le, and G. Suseendran, "Virtual Reality (VR) & Augmented Reality (AR) technologies", Int. Journal of Engineering & Technology, vol. 7, pp.156-160 2018. [Online]. Available: https://www.researchgate.net/publication/324745910_Virtual_Reality_VR_Augmented_Reality_AR_technologies_for_tourism_and_hospitality_industry. Accessed on: Apr. 8, 2020.

K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition". [Online]. Available: https://arxiv.org/pdf/ 1512.03385.pdf. Accessed on: Apr. 8, 2020.

Published

2020-11-23

How to Cite

Danchenko О. ., Іlarionov О. ., Krasovska Н. ., & Коrоtkоvа Т. . (2020). DEVELOPMENT OF FACE RECOGNITION SYSTEM IN A VIDEO STREAM WITH AUGMENTED REALITY. Bulletin of Cherkasy State Technological University, (3), 75–84. https://doi.org/10.24025/2306-4412.3.2020.200277

Issue

Section

Information Technologies

URN