Research of the informativeness the phase data of the user voice signal of the authentication system
DOI:
https://doi.org/10.30837/pt.2018.1.05Abstract
Ways to improve the efficiency of modern voice authentication systems in various access systems are investigated in the article. One of the main ways to increase the effectiveness of authentication systems under consideration is associated with the use of voice signal phase. The object of the research is the process of digital signal processing in voice authentication systems. The scientific problem of forming and using phase data of a voice signal of an au-thentication system is solved. The purpose of the research is to evaluate the informative value of phase data of a voice signal and determine their main characteristics. The formant information on the amplitude spectrum of the experimental voice signal was processed and its qualitative and quantitative characteristics were obtained. Then, the phase data of the analyzed signal were calculated, the features of their use were revealed, and the phase spectrum was obtained on the basis of the calculation results. Processing of the phase spectrum has shown that it is easier to select at least one and a half times bigger formant of a voice signal that can be used to authenticate a user. The approximation of the maxima of formants is performed using linear and quadratic polynomials. The qualitative and quantitative evaluation of the amplitude and phase spectra formant information has confirmed the hypothe-sis that the phase data are more informative. The presented research results should be used in voice authentication systems, in improving speech recogni-tion systems, as well as in solving speaker identification problems.References
“Otchet Cisco po informacionnoj bezopasnosti za 2018 god [Cisco 2018 annual cybersecurity report].” Cisco. 2018. https://www.cisco.com/c/dam/global/ru_ru/assets/offers/assets/cisco_2018_acr_ru.pdf.
BEIGI, H. Fundamentals of Speaker Recognition. NY: Springer, 2011.
OPPENHEJM, A.V.; LIM, DZH.S. “Vazhnost' fazy pri obrabotke signalov [The importance of phase in signal processing],” TIIJeR, t. 69, № 5, p. 39–54, 1981. (in Russian).
PAVLENKO, I.S.; PASTUSHENKO, N.S.; FAJZULAEVA, O.N. “Issledovanie fazovyh harakteristik golosovogo signala pol'zovatelja sistemy autentifikacii [Jelektronnyj resurs] [The study of the phase characteristics of the voice signal of the user authentication system],” Problemi telekomunіkacіj, n.2(21), p. 52-60, 2017. http://pt.journal.kh.ua/2017/2/1/172_pavlenko_signal.pdf. (In Russian).
SOROKIN, V.N. “Raspoznavanie lichnosti po golosu: analiticheskij obzor [Voice Recognition: An Analytical Review],” Informacionnye processy, t. 12, № 1, p. 1–30, 2012. (In Russian).
BESACIER, L.; BONASTRE, J.-F. “Subband architecture for automatic speaker recognition,” Signal Process, v. 80, p. 1245–1259, 2000. DOI: https://doi.org/10.1016/S0165-1684(00)00033-5.
LU, X.; DANG, J. “An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification,” Speech Communication, v. 50, n. 4, p. 312–322, 2007. DOI: https://doi.org/10.1016/j.specom.2007.10.005.
FANT, G. Akusticheskaja teorija recheobrazovanija. Per. s angl. [Acoustic Theory of Speech Formation]. M.: Nauka, Glavnaja redakcija fiziko-matematicheskoj literatury, 1964. (In Russian).
BENDAT, DZH.; PIRSOL, A. Prikladnoj analiz sluchajnyh dannyh: Per. s angl. [Applied analysis of random data]. M.: Mir, 1989.
Пристатейна бібліографія
Отчет Cisco по информационной безопасности за 2018 год. – 68 с – Режим доступа: https://www.cisco.com/c/dam/global/ru_ru/assets/offers/assets/cisco_2018_acr_ru.pdf.
Beigi H. Fundamentals of Speaker Recognition. – NY: Springer, 2011. – 1029 c.
Оппенхейм А.В., Лим Дж.С. Важность фазы при обработке сигналов // ТИИЭР. – 1981. – Т. 69, № 5. – С. 39–54.
Павленко И.С. Исследование фазовых характеристик голосового сигнала пользователя системы аутентификации [Электронный ресурс] / И.С. Павленко, Н.С. Пастушенко, О.Н. Файзулаева // Проблеми телекомунікацій. – 2017. – № 2 (21). – С. 52 - 60. – Режим доступа к журн.: http://pt.journal.kh.ua/2017/2/1/172_pavlenko_signal.pdf.
Сорокин В.Н. Распознавание личности по голосу: аналитический обзор // Информационные процессы. – 2012. – Т. 12, № 1. – С. 1–30.
Besacier L., Bonastre J.-F. Subband architecture for automatic speaker recognition // Signal Process. – 2000. – V. 80. – P. 1245–1259.
Lu X., Dang J. An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification // Speech Communication. – 2007. – V. 50, N 4. – P. 312–322.
Фант Г. Акустическая теория речеобразования / Г. Фант. Пер.с англ. – М.: Наука, Главная редакция физико-математической литературы, 1964. – 284 с.
Бендат Дж., Пирсол А. Прикладной анализ случайных данных: Пер. с англ. – М.: Мир, 1989. – 540 с.
Downloads
Published
Issue
Section
License
Copyright (c) 2019 Problemi telekomunìkacìj
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).