Intellectual data analysis in relational information and analytical systems
DOI:
https://doi.org/10.30837/2522-9818.2025.4.101Keywords:
relational model, data mining, associative dependencies, database, decision tree, semantic network, information system.Abstract
The subject of the study is the methods of intellectual analysis, namely the construction of a decision tree, associative analysis, the identification of patterns between related events based on data presented by a relational model. The purpose of the study is to analyze the features of information units and data structures, using the example of relational systems that affect the technology of knowledge extraction. Tasks: the article solves the following tasks: to consider the relational data model as the most popular and effective data structure used in intelligent information systems for data processing and storage; to analyze the operations of relational algebra, the operational component of the relational data model regarding the application of aggregate functions; to develop a general formal statement of the problem of knowledge extraction from a relational database; to consider the concept of functional associative rules; the ID3 decision tree generation algorithm focused on data processing in relational systems is analyzed. The following methods are implemented: modern view and trends in the field of data mining; features of building information systems based on relational databases, relational algebra, theory of normalization of relations; analysis of literature on the topic of research; comparative analysis. Results achieved: the relational data model is considered as the most effective data structure used in intelligent information systems for data processing and storage. A group of aggregate functions of relational databases is identified and analyzed with respect to key attributes of the relation, which makes it possible to build logical dependencies between information units of the subject area being analyzed. The task of extracting knowledge from the database is formally formulated. The concept of functional associative rules is introduced. The ID3 decision tree generation algorithm focused on data processing in relational systems is carefully analyzed. The semantic network (SN), built on the basis of the proposed approach, allows to increase the efficiency of decision support systems. Conclusions: the universal approach proposed in the article to build a relational data model of an information system for searching for associative patterns in data allows to solve a whole class of typical tasks in which objects are connected by a "many-to-many" relationship or M →N. The relational database model is proposed as a universal information structure for solving associative analysis tasks and presenting knowledge in the form of a semantic network. The examples given in the article confirm the effectiveness of the developed and considered approaches to solving the problem of data mining in the environment of relational systems. Solving the problem of identifying knowledge in data will allow to improve the quality of management decisions made.
References
References
Xuanhe, Z., Chengliang, C., Guoliang, L., Ji, S. (2022), "Database Meets Artificial Intelligence: A Survey". IEEE Transactions on Knowledge and Data Engineering, Vol. 34, No. 3, Р. 1096–1116. DOI: 10.1109/TKDE.2020.2994641
Chen, J., Sun, J., Wang, G. (2022), "From Unmanned Systems to Autonomous Intelligent Systems". Engineering, 12(5), Р. 16–19. DOI: 10.1016/j.eng.2021.10.007
Zhang, F., Yuan, N. J., Lian, D., Xie, X., Ma, W.-Y. (2016), "Collaborative knowledge base embedding for recommender systems". Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Р. 353–362. DOI: 10.1145/2939672.2939673
Avrunin, O., Vlasov, O., Filatov, V. (2020), "Model of semantic integration of information systems properties in relay database reengineering problems". Innovative Technologies and Scientific Solutions for Industries, 4 (14), Р. 5–12. DOI: 10.30837/itssi.2020.14.005
Cappuzzo, R., Papotti, P., Thirumuruganathan, S. (2020), "Creating embeddings of heterogeneous relational datasets for data integration tasks". In Proceedings of the 2020 ACM SIGMOD international conference on management of data. Р. 1335 – 1349. DOI: 10.1145/3318464.3389742
Shi, C., Li, Y., Zhang, J., Sun, Y., Yu, P. S. (2017), "A Survey of Heterogeneous Information Network Analysis". IEEE Transactions on Knowledge and Data Engineering, 29(1), Р. 17–37. DOI: 10.1109/TKDE.2016.2598561
Parciak, M., Weytjens, S., Hens, N., Neven, F., Peeters, L. M., Vansummeren, S. (2025), "Measuring approximate functional dependencies: a comparative study". The VLDB Journal, 34(4). DOI: 10.1007/s00778-025-00931-x
Filatov, V., Doskalenko, S. (2018), "On the Approach to Searching for Functional Dependences of Data in Relational Systems". Innovative Technologies and Scientific Solutions for Industries, 1 (3), Р. 54–58. DOI: 10.30837/2522-9818.2018.3.054
Filatov, V., Semenets, V., Zolotukhin, O. (2019), "Synthesis of Semantic Model of Subject Area at Integration of Relational Databases". 2019 IEEE 8th International Conference on Advanced Optoelectronics and Lasers (CAOL). DOI: 10.1109/caol46282.2019.9019532
Filatov V., Kovalenko A. (2019), "Fuzzy Systems in Data Mining Tasks". Advances in Spatio-Temporal Segmentation of Visual Data. Studies in Computational Intelligence, Vol 876. Springer, Cham P. 243–274. DOI: 10.1007/978-3-030-35480-0_6
Glava, M., Malakhov, V. (2018), "Information Systems Reengineering Approach Based on the Model of Information Systems Domains". International Journal of Software Engineering and Computer Systems (IJSECS). Vol. 4, Р. 95–105. DOI: 10.15282/ijsecs.4.1.2018.8.0041
Codd, E. F. (1983), "A relational model of data for large shared data banks". Communications of the ACM. Vol. 26, No. 1. P. 64–69. DOI: 10.1145/357980.358007
Maier, D. (1983), The theory of relational databases. London: Pitman, 637 p.
Wan, X., Han, X., Wang, J., Li, J. (2024), "Efficient Discovery of Functional Dependencies on Massive Data". IEEE Transactions on Knowledge and Data Engineering, 36(1), Р. 107–121. DOI: 10.1109/tkde.2023.3288209
Wang, Y. (2025), "Design and Implementation of a General Data Collection System Architecture Based on Relational Database Technology. In: Xu, Z., Alrabaee, S., Loyola-González, O., Ab Rahman, N.H. (eds) Cyber Security Intelligence and Analytics. CSIA 2024. Lecture Notes in Networks and Systems, Vol 1351. Springer, Cham. DOI: 10.1007/978-3-031-88287-6_53
Date, C. J. (2003), Introduction to database systems. Pearson Education, Limited. 1024 p.
Hector Garcia-Molina, Jennifer Widom, Jeffrey D. Ullman (2013), Database systems the complete book. Pearson India Education, 1139 p.
Sliusarenko, T., Filatov, V. (2023), "Relational vs non-relational databases". Grail of science. No. 23. P. 269–271. DOI: 10.36074/grail-of-science.23.12.2022.41
Filatov, V. O., Yerokhin, A. L., Zolotukhin, O. V., Kudryavtseva, M. S. (2019), "Information space model in tasks of distributed mobile objects managing". Information Extraction and Processing, 2019(47), Р. 80–86. DOI: 10.15407/vidbir2019.47.080
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Our journal abides by the Creative Commons copyright rights and permissions for open access journals.
Authors who publish with this journal agree to the following terms:
Authors hold the copyright without restrictions and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-commercial and non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their published work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.












