ONTOLOGY-DRIVEN MEANS FOR PROCESSING AND PRESENTATION OF LARGE ARRAYS OF UNSTRUCTURED TEXTS
DOI:
https://doi.org/10.30837/ITSSI.2023.24.027Keywords:
unstructured texts; ontology; recursive reduction; taxonomy; texts processing; presentation of informationAbstract
The subject of the article's research is methods of ontology-driven processing and presentation of unstructured texts in a global environment. The goal of the work is to improve the efficiency of automatic search, classification and selection of the necessary information contained in electronic publications in an unstructured form by developing a model, method and means of automated processing and transdisciplinary presentation of text documents created in Ukrainian, Russian and English languages. In accordance with the goal, the following tasks were set: to conduct an overview of models of processing and presentation of unstructured texts, to identify the advantages of information systems with an ontology-driven architecture, to develop a conceptual model and means for automated processing and transdisciplinary presentation of text documents, improve the method of recursive reduction, with the help of developed method and means to process a large array of unstructured texts (in particular information base of knowledge of scientific products (IBK SP), educational programs, as well as educational and research projects of the National Center "Junior Academy of Sciences of Ukraine"), to present them in a form that allows to reveal the quality and completeness of the knowledge system available in the databases and, thus, experimentally to confirm the effectiveness of the developed means. The conducted research is based on the following methods: systemological classification, algebraic-logical and axiomatic methods, the method of recursive reduction. The following results were obtained: the existing models of processing and presentation of unstructured texts were described, the technological aspects of the ontology-driven approach to the processing and presentation of large arrays of network documents were improved, the technology of forming ontologies of documents based on the representation of their taxonomies, connections between their contexts and multiple representations was developed, which provides effective search for information in unstructured texts. Conclusions: The advantages of ontology-driven means for processing and presentation unstructured texts are substantiated. The method of recursive reduction by constructing a transformation, which leads to the predicative form of ontologically given descriptions of information, gained further development. The idea of effective search, processing, classification and selection of the necessary information has gained further development.
References
Список літератури
Павленко П. М., Філоненко С. Ф., Бабіч К. С. та ін. Інформаційні системи і технології. Київ : НАУ, 2013. 324 с.
Jones K. S. Natural language processing: a historical review. 1994. URL: https://aclanthology.org/www.mt-archive.info/Zampolli-1994-Sparck-Jones.pdf (дата звернення: 10.05.2021)
Liddy E. D. Natural Language Processing. Encyclopedia of Library and Information Science. NY : Marcel Decker, Inc. 2001.
Jurafsky D., Martin G. H. Speech and Language Processing. Prentice Hall. 2000. URL: https://web.stanford.edu/~jurafsky/slp3 (дата звернення: 10.05.2021)
Barkovska O., Khomych V., Nastenko O. Дослідження методів обробки та аналізу тексту при організації електронних сховищ інформаційних об’єктів. Сучасний стан наукових досліджень та технологій в промисловості. 2022. № 1 (19). С. 5–12. DOI: https://doi.org/https://doi.org/10.30837/ITSSI.2022.19.005
Praveena S., Justus S. A Study on Knowledge Representation Models. European Journal of Molecular and Clinical Medicine. 2020. вип. 7. № 4. С. 2446–2452.
Субботін С. О. Подання й обробка знань у системах штучного інтелекту та підтримки прийняття рішень. Запоріжжя: ЗНТУ, 2008. 341 с.
Yang L., Cormican K., Yu M. Ontology-based systems engineering: A state-of-the-art review. Computers in Industry. 2019. вип. 111. С. 148–171. DOI: https://doi.org/https://doi.org/10.1016/j.compind.2019.05.003
Басюк Т. М., Досин Д. Г., Литвин В. В. Онтологічний інжиніринг. Львів : Вид-во Львівської політехніки. 2017. 224 с.
Інструкція користувача КІТ "Поліедр". 2020. URL: https://storage.ulif.org.ua/storage/instructions/polyhedron_instruction.pdf (дата звернення: 10.05.2021)
Lawrence M., Williams S., Nanz P., Renn O. Characteristics, potentials, and challenges of transdisciplinary research. One Earth. 2022. вип. 5. № 1. С. 44–61. DOI: https://doi.org/https://doi.org/10.1016/j.oneear.2021.12.010
Renn O. Transdisciplinarity: Synthesis towards a modular approach. Futures. 2021. вип. 130. С. 1–18. DOI: https://doi.org/https://doi.org/10.1016/j.futures.2021.102744
Гончар А. В., Стрижак О. Є., Беркман Л. Н. Трансдисциплінарна консолідація інформаційних середовищ. Зв’язок. 2021. № 1(149). С. 3–10. DOI: https://doi.org/10.31673/2412-9070.2021.010310
Гайко С. І., Приходнюк В. В. Підхід до автоматизованої структуризації освітніх ресурсів на основі методу рекурсивної редукції. Наукові записки Малої академії наук України. 2021. № 1 (20). С. 28–38. DOI: https://doi.org/http://doi.org/10.51707/2618-0529-2021-20-03
Приходнюк В. В. Технологічні засоби трансдисциплінарного представлення геопросторової інформації: дис. канд. техн. наук. Інститут телекомунікацій і глобального інформаційного простору. 2017. 267 с.
Стрижак О. Є., Приходнюк В. В., Гайко С. І., Шаповалов В. Б. Відображення мережевої інформації у вигляді інтерактивних документів. Трансдисциплінарний підхід. Математичне моделювання в економіці. 2018. № 3. С. 87–100.
Гайко С., Приходнюк В. Средства трансдисциплинарного представления информационных ресурсов разных стилей. Information Models and Analysis. Sofia : ITHEA, 2020. вип. 9. № 1. С. 78–99.
Величко В. Ю. Алгоритм побудови зростаючих пірамідальних мереж у паралельному обчиcлювальному середовищі. Комп’ютерні засоби мережі та системи. 2011. № 10. С. 50–57.
Dovgyi S., Stryzhak O. Transdisciplinary Fundamentals of Information-Analytical Activity. Advances in Information and Communication Technology and Systems. MCT 2019. Lecture Notes in Networks and Systems. 2019. вип. 152. DOI: https://doi.org/https://doi.org/10.1007/978-3-030-58359-0_7
References
Pavlenko, P. M., Filonenko, S. F., Babich, K. S. and others. (2013), Information systems and technologies, Kyiv, NAU, 324 p.
Jones, K. S. (1994), "Natural language processing: a historical review", available at: https://aclanthology.org/www.mt-archive.info/Zampolli-1994-Sparck-Jones.pdf (last accessed: 10.05.2021)
Liddy, E. D. (2001), "Natural Language Processing", Library and Information Science, New York, P. 15–25. DOI: https://doi.org/10.1145/234173.234180
Jurafsky, D., Martin, G. H. (2000), "Speech and Language Processing", Prentice Hall, available at: https://web.stanford.edu/~jurafsky/slp3/ (last accessed: 11.01.2023).
Barkovska, O., Khomych, V., Nastenko, O. (2022), "Study of text processing and analysis methods in the organization of electronic storage of information objects", Innovative technologies and scientific solutions for industries, No 1 (19), P. 5–12. DOI: https://doi.org/https://doi.org/10.30837/ITSSI.2022.19.005
Praveena, S., Justus, S. (2020), "A Study on Knowledge Representation Models", European Journal of Molecular and Clinical Medicine, Vol 7, No. 4, P. 2446–2452.
Subbotin, S. O. (2008), Presentation and processing of knowledge in artificial intelligence and decision support systems, Zaporizhzhia, 108 p.
Yang, L., Cormican, K., Yu, M. (2019), "Ontology-based systems engineering: A state-of-the-art review", Computers in Industry, No. 111, P. 148–171. DOI: https://doi.org/https://doi.org/10.1016/j.compind.2019.05.003
Basyuk, T. M., Dosyn, D. G., Lytvyn, V. V. (2017), Ontological engineering, Lviv, 224 p.
"KIT "Polyhedron" user manual", (2020), available at: https://storage.ulif.org.ua/storage/instructions/polyhedron_instruction.pdf (last accessed: 12.08.2021)
Lawrence, M., Williams, S., Nanz, P., Renn, O. (2022), "Characteristics, potentials, and challenges of transdisciplinary research", One Earth, Vol. 5, No. 1, P. 44–61. DOI: https://doi.org/https://doi.org/10.1016/j.oneear.2021.12.010
Renn, O. (2021), "Transdisciplinarity: Synthesis towards a modular approach", Futures, Vol. 130, P. 1–18. DOI: https://doi.org/https://doi.org/10.1016/j.futures.2021.102744
Gonchar, A. V., Stryzhak, O. E., Berkman, L. N. (2021), "Transdisciplinary consolidation of information environments" ["Transdy`scy`plinarna konsolidaciya informacijny`x seredovy`shh"], Connection [Zv`yazok], No. 1(149), P. 3–10. DOI: https://doi.org/10.31673/2412-9070.2021.010310
Haiko, S. I., Prykhodnyuk, V. V. (2021), "Approach to the automated structuring of educational resources based on the method of recursive reduction", Scientific notes of the Junior Academy of Sciences of Ukraine, No. 1 (20), P. 28–38. DOI: https://doi.org/http://doi.org/10.51707/2618-0529-2021-20-03
Prikhodniuk, V. V. (2017), Technological means of transdisciplinary presentation of geospatial information [Texnologichni zasoby` transdy`scy`plinarnogo predstavlennya geoprostorovoyi informaciyi], Ph.D. thesis, Institute of Telecommunications and Global Information Space, 267 p.
Stryzhak, O. E., Prykhodnyuk, V. V., Haiko, S. I., Shapovalov, V. B. (2018), "Display of network information in the form of interactive documents. Transdisciplinary approach" ["Vidobrazhennya merezhevoyi informaciyi u vy`glyadi interakty`vny`x dokumentiv. Transdy`scy`plinarny`j pidxid"], Mathematical modeling in economics, No. 3, P. 87–100.
Gaiko, S., Prikhodnyuk, V. (2020), "Means of transdisciplinary presentation of information resources of different styles" ["Sredstva transdistsiplinarnogo predstavleniya informatsionnykh resursov raznykh stiley"], Information Models and Analysis, Sofia, ITHEA, Vol. 9, No. 1, P. 78–99.
Velichko, V. Yu. (2011), "Algorithm for building growing pyramidal networks in a parallel computing environment" ["Algory`tm pobudovy` zrostayuchy`x piramidal`ny`x merezh u paralel`nomu obchy`clyuval`nomu seredovy`shhi"], Network and system computer tools, No. 10. P. 50–57.
Dovgyi, S., Stryzhak, O. (2019), "Transdisciplinary Fundamentals of Information-Analytical Activity", Lecture Notes in Networks and Systems, Vol. 152. DOI: https://doi.org/https://doi.org/10.1007/978-3-030-58359-0_7
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Світлана Гайко
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Our journal abides by the Creative Commons copyright rights and permissions for open access journals.
Authors who publish with this journal agree to the following terms:
Authors hold the copyright without restrictions and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-commercial and non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their published work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.