INFORMATION TECHNOLOGY FOR DETERMINING USEFUL DATA WHILE OPTIMIZING THE STRUCTURE AND MINIMIZING THE VOLUME OF THE DISTRIBUTED DATABASE NODE
DOI:
https://doi.org/10.24025/2306-4412.4.2019.184808Keywords:
distributed transaction, database management system, distributed database, distributed SQL-query, data replication, text parsing, parse tree, profiling, ANTLR, multidimensional analysis, classification task, neural network, data mining.Abstract
The paper deals with the tendency to move from "universal" accounting systems to specialized solutions usage. This requires the synchronization of distributed database data. It is noted that among the strategies of data distribution between distributed database nodes, the combined one is the most justified, but the main disadvantage consists in the existence of distributed transactions when handling data. The research aims to improve the general availability of data in the separate node of the distributed database and the efficiency of using software systems to work with database data by reducing the number of distributed requests. The goal is achieved by optimizing the structure of the distributed database node and minimizing the amount of data stored in it. To achieve the goal, users' query accounting subsystem and T-SQL grammar have been created, and SQL query code has been parsed. As a result, the queries are classified by the list of database tables that are found in the query, and, after performing more deyailed analysis, by the list of attributes and relation tuples. The last one is achieved by executing a set of queries with getting the primary key of each relation included in the query. Performing the complete analysis of the database tables attributes and tuples estimation is a very resource-intensive operation, so it cannot be performed every time the database data is changed. The research proposes to solve the problem of classification of new data by using the perceptron, which learns on the basis of pre-evaluated data based on SQL query parsing. Also, according to the need of performing the analysis of received data from the point of view of multiple dimensions, as well as probably their large amount, the data required for the analysis has been presented in the form of a multidimensional model.
References
N. Kozliuk, and S. Ugrimova, Warehouse accounting in trade enterprises. Feniks, 2005 [in Russian].
Trading, warehouse and CRM in the cloud. [Online]. Available: https://www.moysklad.ru
M. L. Dvoretskyi, S. V. Dvoretska, and S. Yu. Borovlova, " Web-based warehouse accounting application in non-automated outlets", Naukovi pratsi Chornomor. nats. un-tu im. Petra Mohyly kompleksu «Kyievo-Mohylianska akademiia». Seriia: Kompiuterni tekhnolohii: sci.-method. journ. Mykolaiv: Vyd-vo ChNU im. P. Mohyly, iss. 308, vol. 320, pp. 45-52, 2018 [in Ukrainian].
1C: Enterprise 8. Management of a trade enterprise for Ukraine. [Online]. Available: http://rarus.com.ua/torgovyy-i-skladskoy-uchet/1S_Predpriyatie_8_Upravlenie_torgovym_predpriyatiem_dlya_Ukrainy_
M. Kuznetsov, and I. Symdianov, MySQL 5. St. Petersburg: BKhV-Peterburg, 2010 [in Russian].
D. Petkovych, Microsoft SQL Server 2008: beginner's guide, transl. from Engl. St. Petersburg: BKhV-Peterburg, 2009 [in Russian].
T. Konnolli, and K. Begg, Databases. De-sign, implementation and maintenance. Theory and practice, 3rd ed., transl. from Engl. Moscow: Wiliams, 2003 [in Russian]
M. Tamer Özsu, and Patrick Valduriez. Principles of distributed database systems, 3rd ed. Springler, 2011.
Automatic synchronization of distributed databases in split mode. [Online]. Available: http://stimul.kiev.ua/materialy.htm?a=avtom ati-cheskaya_sinkhronizatsiya_raspredelennykh_baz_dannykh_v_razdelennom_rezh
D. D. Ulman, D. Uidom, and G. Garsya-Molina. Database systems: full course. Moscow: Wiliams, 2004 [in Russian].
Transaction Management (Database Engine component) [Online]. Available: https://technet.microsoft.com/ru-ru/library/ ms175523(v=sql.105).aspx
Rebekka M. Raiordan, Relational database fundamentals. Moscow: Russkaia Redakt-syia, 2001 [in Russian].
M. L. Dvoretskyi, "Design and evaluation of data warehouse structure and multidi-mensional database structure", Naukovi pratsi Chornomor. nats. un-tu im. Petra Mohyly kompleksu «Kyievo-Mohylianska akademiia». Seriia: Kompiuterni tekhnolo-hii: sci.-method. journ. Mykolaiv: Vyd-vo MDHU im. P. Mohyly, iss. 77, vol. 90, pp. 52-60, 2008 [in Ukrainian].
The use of synchronous and asynchronous database operations. [Online]. Available: http://help.adobe.com/ru_RU/as3/dev/WS5b3cc c516d4fbf351e63e3d118666ade46-7d39.html
M. L. Dvoretskyi, Ye. O. Davydenko, and S. Yu. Borovlova, "Design of the structure of a distributed database based on SQL query parsing". Naukovi pratsi Chornomor. nats. un-tu im. Petra Mohyly kompleksu «Kyievo-Mohylianska akademiia». Seriia: Kompiuterni tekhnolohii: sci.-method. journ. Mykolaiv: Vyd-vo MDHU im. P. Mohyly, iss. 275, vol. 287, pp. 53-61, 2016 [in Ukrainian].
M. Fisun, M. Dvoretskyi, A. Shved, and Y. Davydenko, "Query parsing in order to optimize distributed DB structure," in 9th IEEE Int. Conf. Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS), Buchar-est, 2017, pp. 172-178.doi: 10.1109/IDAACS.2017.8095071
What is an artificial neural network? Here’s everything you need to know. [Online]. Available: https://www.digitaltrends.com/ cool-tech/what-is-an-artificial-neural-network/
M. T. Fisun, M. L. Dvoretskyi, and A. V. Yukhatov, "Comparative analysis of methods for building olap systems using MS SQL SERVER and ORACLE", Naukovi pratsi Chornomor. nats. un-tu im. Petra Mohyly kompleksu «Kyievo-Mohylianska akademiia». Seriia: Kompiuterni tekhnolo-hii: sci.-method. journ. Mykolaiv: Vyd-vo MDHU im. P. Mohyly, iss. 271, vol. 283, pp. 36-42, 2016 [in Ukrainian].
Downloads
Published
How to Cite
Issue
Section
URN
License
Copyright (c) 2020 Михайло Леонідович Дворецький, Світлана Володимирівна Дворецька, Євген Олександрович Давиденко The authors who publish in this journal agree to the following terms:The authors reserve the right to authorship of their work and give the journal the right to first publish this work under the terms of the Creative Commons Attribution License CC BY-NC, which allows other persons to freely distribute published work with a mandatory reference to authors of the original work and the first publication of the work in this journal.
Authors have the right to conclude separate additional agreements for the non-exclusive distribution of the paper in the form in which it was published by this journal (for example, posting work in electronic repository or publishing as part of a monograph), provided that the link to the first publication in this journal is maintained.
The journal policy allows and encourages authors to post on the Internet (for example, in repositories of institutions or on personal websites) the manuscript of work, both before the submission of this manuscript to the editorial staff, and during its editorial work, as it contributes to the emergence of productive scientific discussion and positively affects the efficiency and dynamics of published work citation (see The Effect of Open Access).