Influence of phase information on voice pre-processing signal in the authentication system
DOI:
https://doi.org/10.30837/pt.2024.1.04Abstract
The article analyzes and investigates directions for improving the quality characteristics of voice authentication systems in various access systems by improving the procedures for pre-processing registration materials. One of the main ways of improving the quality characteristics of user authentication systems, which was studied in the work, is the use of phase information of the voice signal. The actual scientific task of researching new procedures for pre-processing the voice signal of the user of the authentication system is being solved. The purpose of this work is to develop additional preprocessing procedures to reduce noise in voice signals of the authentication system. Refinement of pre-processing procedures was carried out based on the use of phase data of the voice signal. The results are obtained in the process of statistical analysis of simulation results using experimental model data of the authentication system. The phase space of the voice signal allows you to expand the possibilities of pre-processing due to the use of a priori information about the nature of changes in phase data. The scientific novelty of the obtained results lies in the fact that for the first time, a technique was developed, and experimental studies were carried out for the pre-processing of the user’s voice signal using the space of phase data. The practical significance of the obtained results is as follows: the phase information approximation interval was selected taking into account a priori data on the nature of its changes; an original linear approximation of phase data containing one harmonic of a voice signal is proposed; a mechanism for determining two harmonics in the phase data of a voice signal when using the proposed linear approximation is developed; the conducted experimental studies allow to develop a mechanism for compensation of random errors in registration materials. The presented research results are advisable for use in voice authentication systems, improvement of speech recognition systems, and solving speaker identification tasks.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).