Research of the informativeness the phase data of the user voice signal of the authentication system

Authors

  • Mykola Savelijovych Pastushenko Kharkiv National University of Radio Electronics, Ukraine https://orcid.org/0000-0003-2664-1167
  • Vivaldo Gomes Montejro Pedro Kharkiv National University of Radio Electronics,
  • Olha Mykolaivna Faizulaieva Kharkiv National University of Radio Electronics,

DOI:

https://doi.org/10.30837/pt.2018.1.05

Abstract

Ways to improve the efficiency of modern voice authentication systems in various access systems are investigated in the article. One of the main ways to increase the effectiveness of authentication systems under consideration is associated with the use of voice signal phase. The object of the research is the process of digital signal processing in voice authentication systems. The scientific problem of forming and using phase data of a voice signal of an au-thentication system is solved. The purpose of the research is to evaluate the informative value of phase data of a voice signal and determine their main characteristics. The formant information on the amplitude spectrum of the experimental voice signal was processed and its qualitative and quantitative characteristics were obtained. Then, the phase data of the analyzed signal were calculated, the features of their use were revealed, and the phase spectrum was obtained on the basis of the calculation results. Processing of the phase spectrum has shown that it is easier to select at least one and a half times bigger formant of a voice signal that can be used to authenticate a user. The approximation of the maxima of formants is performed using linear and quadratic polynomials. The qualitative and quantitative evaluation of the amplitude and phase spectra formant information has confirmed the hypothe-sis that the phase data are more informative. The presented research results should be used in voice authentication systems, in improving speech recogni-tion systems, as well as in solving speaker identification problems.

References

“Otchet Cisco po informacionnoj bezopasnosti za 2018 god [Cisco 2018 annual cybersecurity report].” Cisco. 2018. https://www.cisco.com/c/dam/global/ru_ru/assets/offers/assets/cisco_2018_acr_ru.pdf.

BEIGI, H. Fundamentals of Speaker Recognition. NY: Springer, 2011.

OPPENHEJM, A.V.; LIM, DZH.S. “Vazhnost' fazy pri obrabotke signalov [The importance of phase in signal processing],” TIIJeR, t. 69, № 5, p. 39–54, 1981. (in Russian).

PAVLENKO, I.S.; PASTUSHENKO, N.S.; FAJZULAEVA, O.N. “Issledovanie fazovyh harakteristik golosovogo signala pol'zovatelja sistemy autentifikacii [Jelektronnyj resurs] [The study of the phase characteristics of the voice signal of the user authentication system],” Problemi telekomunіkacіj, n.2(21), p. 52-60, 2017. http://pt.journal.kh.ua/2017/2/1/172_pavlenko_signal.pdf. (In Russian).

SOROKIN, V.N. “Raspoznavanie lichnosti po golosu: analiticheskij obzor [Voice Recognition: An Analytical Review],” Informacionnye processy, t. 12, № 1, p. 1–30, 2012. (In Russian).

BESACIER, L.; BONASTRE, J.-F. “Subband architecture for automatic speaker recognition,” Signal Process, v. 80, p. 1245–1259, 2000. DOI: https://doi.org/10.1016/S0165-1684(00)00033-5.

LU, X.; DANG, J. “An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification,” Speech Communication, v. 50, n. 4, p. 312–322, 2007. DOI: https://doi.org/10.1016/j.specom.2007.10.005.

FANT, G. Akusticheskaja teorija recheobrazovanija. Per. s angl. [Acoustic Theory of Speech Formation]. M.: Nauka, Glavnaja redakcija fiziko-matematicheskoj literatury, 1964. (In Russian).

BENDAT, DZH.; PIRSOL, A. Prikladnoj analiz sluchajnyh dannyh: Per. s angl. [Applied analysis of random data]. M.: Mir, 1989.

Пристатейна бібліографія

Отчет Cisco по информационной безопасности за 2018 год. – 68 с – Режим доступа: https://www.cisco.com/c/dam/global/ru_ru/assets/offers/assets/cisco_2018_acr_ru.pdf.

Beigi H. Fundamentals of Speaker Recognition. – NY: Springer, 2011. – 1029 c.

Оппенхейм А.В., Лим Дж.С. Важность фазы при обработке сигналов // ТИИЭР. – 1981. – Т. 69, № 5. – С. 39–54.

Павленко И.С. Исследование фазовых характеристик голосового сигнала пользователя системы аутентификации [Электронный ресурс] / И.С. Павленко, Н.С. Пастушенко, О.Н. Файзулаева // Проблеми телекомунікацій. – 2017. – № 2 (21). – С. 52 - 60. – Режим доступа к журн.: http://pt.journal.kh.ua/2017/2/1/172_pavlenko_signal.pdf.

Сорокин В.Н. Распознавание личности по голосу: аналитический обзор // Информационные процессы. – 2012. – Т. 12, № 1. – С. 1–30.

Besacier L., Bonastre J.-F. Subband architecture for automatic speaker recognition // Signal Process. – 2000. – V. 80. – P. 1245–1259.

Lu X., Dang J. An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification // Speech Communication. – 2007. – V. 50, N 4. – P. 312–322.

Фант Г. Акустическая теория речеобразования / Г. Фант. Пер.с англ. – М.: Наука, Главная редакция физико-математической литературы, 1964. – 284 с.

Бендат Дж., Пирсол А. Прикладной анализ случайных данных: Пер. с англ. – М.: Мир, 1989. – 540 с.

Published

2018-12-11

Issue

Section

Articles