Intellectual data analysis in relational information and analytical systems

Authors

DOI:

https://doi.org/10.30837/2522-9818.2025.4.101

Keywords:

relational model, data mining, associative dependencies, database, decision tree, semantic network, information system.

Abstract

The subject of the study is the methods of intellectual analysis, namely the construction of a decision tree, associative analysis, the identification of patterns between related events based on data presented by a relational model. The purpose of the study is to analyze the features of information units and data structures, using the example of relational systems that affect the technology of knowledge extraction. Tasks: the article solves the following tasks: to consider the relational data model as the most popular and effective data structure used in intelligent information systems for data processing and storage; to analyze the operations of relational algebra, the operational component of the relational data model regarding the application of aggregate functions; to develop a general formal statement of the problem of knowledge extraction from a relational database; to consider the concept of functional associative rules; the ID3 decision tree generation algorithm focused on data processing in relational systems is analyzed. The following methods are implemented: modern view and trends in the field of data mining; features of building information systems based on relational databases, relational algebra, theory of normalization of relations; analysis of literature on the topic of research; comparative analysis. Results achieved: the relational data model is considered as the most effective data structure used in intelligent information systems for data processing and storage. A group of aggregate functions of relational databases is identified and analyzed with respect to key attributes of the relation, which makes it possible to build logical dependencies between information units of the subject area being analyzed. The task of extracting knowledge from the database is formally formulated. The concept of functional associative rules is introduced. The ID3 decision tree generation algorithm focused on data processing in relational systems is carefully analyzed. The semantic network (SN), built on the basis of the proposed approach, allows to increase the efficiency of decision support systems. Conclusions: the universal approach proposed in the article to build a relational data model of an information system for searching for associative patterns in data allows to solve a whole class of typical tasks in which objects are connected by a "many-to-many" relationship or MN. The relational database model is proposed as a universal information structure for solving associative analysis tasks and presenting knowledge in the form of a semantic network. The examples given in the article confirm the effectiveness of the developed and considered approaches to solving the problem of data mining in the environment of relational systems. Solving the problem of identifying knowledge in data will allow to improve the quality of management decisions made.

Author Biographies

Valentin Filatov, Kharkiv National University of Radio Electronics

Doctor of Sciences (Engineering), Professor, Professor of Artificial Intelligence Department

Oleh Zolotukhin, Kharkiv National University of Radio Electronics

PhD (Engineering Sciences), Associate Professor, Dean of Computer Science Faculty

Maryna Kudryavtseva, Kharkiv National University of Radio Electronics

PhD (Engineering Sciences), Associate Professor, Professor of Artificial Intelligence Department

References

References

Xuanhe, Z., Chengliang, C., Guoliang, L., Ji, S. (2022), "Database Meets Artificial Intelligence: A Survey". IEEE Transactions on Knowledge and Data Engineering, Vol. 34, No. 3, Р. 1096–1116. DOI: 10.1109/TKDE.2020.2994641

Chen, J., Sun, J., Wang, G. (2022), "From Unmanned Systems to Autonomous Intelligent Systems". Engineering, 12(5), Р. 16–19. DOI: 10.1016/j.eng.2021.10.007

Zhang, F., Yuan, N. J., Lian, D., Xie, X., Ma, W.-Y. (2016), "Collaborative knowledge base embedding for recommender systems". Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Р. 353–362. DOI: 10.1145/2939672.2939673

Avrunin, O., Vlasov, O., Filatov, V. (2020), "Model of semantic integration of information systems properties in relay database reengineering problems". Innovative Technologies and Scientific Solutions for Industries, 4 (14), Р. 5–12. DOI: 10.30837/itssi.2020.14.005

Cappuzzo, R., Papotti, P., Thirumuruganathan, S. (2020), "Creating embeddings of heterogeneous relational datasets for data integration tasks". In Proceedings of the 2020 ACM SIGMOD international conference on management of data. Р. 1335 – 1349. DOI: 10.1145/3318464.3389742

Shi, C., Li, Y., Zhang, J., Sun, Y., Yu, P. S. (2017), "A Survey of Heterogeneous Information Network Analysis". IEEE Transactions on Knowledge and Data Engineering, 29(1), Р. 17–37. DOI: 10.1109/TKDE.2016.2598561

Parciak, M., Weytjens, S., Hens, N., Neven, F., Peeters, L. M., Vansummeren, S. (2025), "Measuring approximate functional dependencies: a comparative study". The VLDB Journal, 34(4). DOI: 10.1007/s00778-025-00931-x

Filatov, V., Doskalenko, S. (2018), "On the Approach to Searching for Functional Dependences of Data in Relational Systems". Innovative Technologies and Scientific Solutions for Industries, 1 (3), Р. 54–58. DOI: 10.30837/2522-9818.2018.3.054

Filatov, V., Semenets, V., Zolotukhin, O. (2019), "Synthesis of Semantic Model of Subject Area at Integration of Relational Databases". 2019 IEEE 8th International Conference on Advanced Optoelectronics and Lasers (CAOL). DOI: 10.1109/caol46282.2019.9019532

Filatov V., Kovalenko A. (2019), "Fuzzy Systems in Data Mining Tasks". Advances in Spatio-Temporal Segmentation of Visual Data. Studies in Computational Intelligence, Vol 876. Springer, Cham P. 243–274. DOI: 10.1007/978-3-030-35480-0_6

Glava, M., Malakhov, V. (2018), "Information Systems Reengineering Approach Based on the Model of Information Systems Domains". International Journal of Software Engineering and Computer Systems (IJSECS). Vol. 4, Р. 95–105. DOI: 10.15282/ijsecs.4.1.2018.8.0041

Codd, E. F. (1983), "A relational model of data for large shared data banks". Communications of the ACM. Vol. 26, No. 1. P. 64–69. DOI: 10.1145/357980.358007

Maier, D. (1983), The theory of relational databases. London: Pitman, 637 p.

Wan, X., Han, X., Wang, J., Li, J. (2024), "Efficient Discovery of Functional Dependencies on Massive Data". IEEE Transactions on Knowledge and Data Engineering, 36(1), Р. 107–121. DOI: 10.1109/tkde.2023.3288209

Wang, Y. (2025), "Design and Implementation of a General Data Collection System Architecture Based on Relational Database Technology. In: Xu, Z., Alrabaee, S., Loyola-González, O., Ab Rahman, N.H. (eds) Cyber Security Intelligence and Analytics. CSIA 2024. Lecture Notes in Networks and Systems, Vol 1351. Springer, Cham. DOI: 10.1007/978-3-031-88287-6_53

Date, C. J. (2003), Introduction to database systems. Pearson Education, Limited. 1024 p.

Hector Garcia-Molina, Jennifer Widom, Jeffrey D. Ullman (2013), Database systems the complete book. Pearson India Education, 1139 p.

Sliusarenko, T., Filatov, V. (2023), "Relational vs non-relational databases". Grail of science. No. 23. P. 269–271. DOI: 10.36074/grail-of-science.23.12.2022.41

Filatov, V. O., Yerokhin, A. L., Zolotukhin, O. V., Kudryavtseva, M. S. (2019), "Information space model in tasks of distributed mobile objects managing". Information Extraction and Processing, 2019(47), Р. 80–86. DOI: 10.15407/vidbir2019.47.080

Downloads

Published

2025-12-28

How to Cite

Filatov, V., Zolotukhin, O., & Kudryavtseva, M. (2025). Intellectual data analysis in relational information and analytical systems. INNOVATIVE TECHNOLOGIES AND SCIENTIFIC SOLUTIONS FOR INDUSTRIES, (4(34), 101–111. https://doi.org/10.30837/2522-9818.2025.4.101