Development of intelligent electronic document management system model based on machine learning methods
DOI:
https://doi.org/10.15587/1729-4061.2022.251689Keywords:
electronic document management system, machine learning, multi-agent technologies, topic modelingAbstract
With the daily increase in document flow, as well as the transition to paperless document management around the world, the demand for electronic document management systems is increasing. This significantly requires optimization of these systems in terms of quality document information retrieval and document management. However, research based on statistical methods cannot effectively handle large amounts of data extracted from electronic documents. In this regard, machine learning methods can effectively solve this problem. This paper presents an approach to building a model of an intelligent document management system using machine learning techniques to ensure efficient employee performance in organizations. The authors have solved a number of problems to optimize each of the document management subsystems, resulting in the development of an intelligent document management system model, which can be effectively applied to enterprises, government and corporate institutions. The feasibility and effectiveness of the proposed model of intelligent document management system based on machine learning and multi-agent modeling of information retrieval processes provides maximum reliability and reduced time of work on documents. The obtained results show that with the help of the presented model it is possible to further develop an intelligent document management system that will allow an electronic document to qualitatively go through the whole life cycle of a document, starting from the moment of document registration and finishing with its closing, i.e. execution, which will greatly facilitate the daily work of users with large volumes of documents. At the same time, the paper considers the application of topic modeling methods and algorithms of text analysis based on a multi-agent approach, which can be used to build an intelligent document management system.
Supporting Agency
- «This research has been funded by the Science Committee of the Ministry of Education and Science of the Republic of Kazakhstan (Grant No AP08857179)»
References
- Lapshina, S. N. (2012). Architecture of Enterprise. Yekaterinburg: UrFU.
- Alpaidin, E. (2017). Machine learning: the new artificial intelligence. Moscow: Alpina Publisher, Publishing Group "Tochka", 208. Available at: https://cdn1.ozone.ru/multimedia/1017469342.pdf
- Deelman, E., Mandal, A., Jiang, M., Sakellariou, R. (2019). The role of machine learning in scientific workflows. The International Journal of High Performance Computing Applications, 33 (6), 1128–1139. doi: https://doi.org/10.1177/1094342019852127
- Obukhov, A., Krasnyanskiy, M., Nikolyukin, M. (2019). Implementation of Decision Support Subsystem in Electronic Document Systems Using Machine Learning Techniques. 2019 International Multi-Conference on Industrial Engineering and Modern Technologies (FarEastCon). doi: https://doi.org/10.1109/fareastcon.2019.8934879
- Obukhov, A., Krasnyanskiy, M., Nikolyukin, M. (2020). Algorithm of adaptation of electronic document management system based on machine learning technology. Progress in Artificial Intelligence, 9 (4), 287–303. doi: https://doi.org/10.1007/s13748-020-00214-2
- Levina, T., Rodionov, A., Farkhutdinov, R. (2020). Software module for extracting data from electronic documents. 2020 International Conference on Electrotechnical Complexes and Systems (ICOECS). doi: https://doi.org/10.1109/icoecs50468.2020.9278492
- Goodrum, H., Roberts, K., Bernstam, E. V. (2020). Automatic classification of scanned electronic health record documents. International Journal of Medical Informatics, 144, 104302. doi: https://doi.org/10.1016/j.ijmedinf.2020.104302
- Kostkina, A., Bodunkov, D., Klimov, V. (2018). Document Categorization Based on Usage of Features Reduction with Synonyms Clustering in Weak Semantic Map. Procedia Computer Science, 145, 288–292. doi: https://doi.org/10.1016/j.procs.2018.11.061
- Chemchem, A., Alin, F., Krajecki, M. (2018). Deep Learning and Data Mining Classification through the Intelligent Agent Reasoning. 2018 6th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW). doi: https://doi.org/10.1109/w-ficloud.2018.00009
- Holzinger, A., Kieseberg, P., Tjoa, A. M., & Weippl, E. (Eds.) (2018). Machine Learning and Knowledge Extraction. Lecture Notes in Computer Science. Springer, 372. doi: https://doi.org/10.1007/978-3-319-99740-7
- Edinaya sistema elektronnogo dokumentooborota gosudarstvennyh organov (ESEDO). Available at: https://www.nitec.kz/index.php/post/edinaya-sistema-elektronnogo-dokumentooborota-gosudarstvennyih-organov-esedo
- Aliev, V. S., Chistov, D. V. (2011). Business planning using the Project Expert program (full course). Moscow: INFRA-M, 432.
- Eremeev, M., Vorontsov, K. (2019). Lexical quantile-based text complexity measure. Proceedings of Recent Advances in Natural Language Processing. Varna, 270–275. Available at: https://aclanthology.org/R19-1031.pdf
- Ataeva, O. M. (2016). An information model of LibMeta semantic library. Software & Systems, 4, 36–44. doi: https://doi.org/10.15827/0236-235x.116.036-044
- Semantic Web. Available at: https://www.w3.org/standards/semanticweb/
- Weitzel, D., Bockelman, B., Brown, D. A., Couvares, P., Würthwein, F., Hernandez, E. F. (2017). Data Access for LIGO on the OSG. Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact. doi: https://doi.org/10.1145/3093338.3093363
- Linev, A. A. (2014). Modern EDMS: From Document Management to Efficiency Management. Deloproizvodstvo, 1. Available at: https://www.top-personal.ru/officeworkissue.html?314
- Challenger, M., Tezel, B., Alaca, O., Tekinerdogan, B., Kardas, G. (2018). Development of Semantic Web-Enabled BDI Multi-Agent Systems Using SEA_ML: An Electronic Bartering Case Study. Applied Sciences, 8 (5), 688. doi: https://doi.org/10.3390/app8050688
- Jensen, A. B., Villadsen, J. (2020). GOAL-DTU: Development of Distributed Intelligence for the Multi-Agent Programming Contest. Lecture Notes in Computer Science, 79–105. doi: https://doi.org/10.1007/978-3-030-59299-8_4
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2022 Madina Sambetbayeva, Inkarzhan Kuspanova, Aigerim Yerimbetova, Sandugash Serikbayeva, Shynar Bauyrzhanova
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.