ONTOLOGY-DRIVEN MEANS FOR PROCESSING AND PRESENTATION OF LARGE ARRAYS OF UNSTRUCTURED TEXTS

Authors

DOI:

https://doi.org/10.30837/ITSSI.2023.24.027

Keywords:

unstructured texts; ontology; recursive reduction; taxonomy; texts processing; presentation of information

Abstract

The subject of the article's research is methods of ontology-driven processing and presentation of unstructured texts in a global environment. The goal of the work is to improve the efficiency of automatic search, classification and selection of the necessary information contained in electronic publications in an unstructured form by developing a model, method and means of automated processing and transdisciplinary presentation of text documents created in Ukrainian, Russian and English languages. In accordance with the goal, the following tasks were set: to conduct an overview of models of processing and presentation of unstructured texts, to identify the advantages of information systems with an ontology-driven architecture, to develop a conceptual model and means for automated processing and transdisciplinary presentation of text documents, improve the method of recursive reduction, with the help of developed method and means to process a large array of unstructured texts (in particular information base of knowledge of scientific products (IBK SP), educational programs, as well as educational and research projects of the National Center "Junior Academy of Sciences of Ukraine"), to present them in a form that allows to reveal the quality and completeness of the knowledge system available in the databases and, thus, experimentally to confirm the effectiveness of the developed means. The conducted research is based on the following methods: systemological classification, algebraic-logical and axiomatic methods, the method of recursive reduction. The following results were obtained: the existing models of processing and presentation of unstructured texts were described, the technological aspects of the ontology-driven approach to the processing and presentation of large arrays of network documents were improved, the technology of forming ontologies of documents based on the representation of their taxonomies, connections between their contexts and multiple representations was developed, which provides effective search for information in unstructured texts. Conclusions: The advantages of ontology-driven means for processing and presentation unstructured texts are substantiated. The method of recursive reduction by constructing a transformation, which leads to the predicative form of ontologically given descriptions of information, gained further development. The idea of effective search, processing, classification and selection of the necessary information has gained further development.

Author Biography

Svitlana Haiko, Institute of Telecommunications and Global Information Space of the National Academy of Sciences of Ukraine

Junior Researcher of Institute of Telecommunications and Global Information Space of the National Academy of Sciences of Ukraine

References

Список літератури

Павленко П. М., Філоненко С. Ф., Бабіч К. С. та ін. Інформаційні системи і технології. Київ : НАУ, 2013. 324 с.

Jones K. S. Natural language processing: a historical review. 1994. URL: https://aclanthology.org/www.mt-archive.info/Zampolli-1994-Sparck-Jones.pdf (дата звернення: 10.05.2021)

Liddy E. D. Natural Language Processing. Encyclopedia of Library and Information Science. NY : Marcel Decker, Inc. 2001.

Jurafsky D., Martin G. H. Speech and Language Processing. Prentice Hall. 2000. URL: https://web.stanford.edu/~jurafsky/slp3 (дата звернення: 10.05.2021)

Barkovska O., Khomych V., Nastenko O. Дослідження методів обробки та аналізу тексту при організації електронних сховищ інформаційних об’єктів. Сучасний стан наукових досліджень та технологій в промисловості. 2022. № 1 (19). С. 5–12. DOI: https://doi.org/https://doi.org/10.30837/ITSSI.2022.19.005

Praveena S., Justus S. A Study on Knowledge Representation Models. European Journal of Molecular and Clinical Medicine. 2020. вип. 7. № 4. С. 2446–2452.

Субботін С. О. Подання й обробка знань у системах штучного інтелекту та підтримки прийняття рішень. Запоріжжя: ЗНТУ, 2008. 341 с.

Yang L., Cormican K., Yu M. Ontology-based systems engineering: A state-of-the-art review. Computers in Industry. 2019. вип. 111. С. 148–171. DOI: https://doi.org/https://doi.org/10.1016/j.compind.2019.05.003

Басюк Т. М., Досин Д. Г., Литвин В. В. Онтологічний інжиніринг. Львів : Вид-во Львівської політехніки. 2017. 224 с.

Інструкція користувача КІТ "Поліедр". 2020. URL: https://storage.ulif.org.ua/storage/instructions/polyhedron_instruction.pdf (дата звернення: 10.05.2021)

Lawrence M., Williams S., Nanz P., Renn O. Characteristics, potentials, and challenges of transdisciplinary research. One Earth. 2022. вип. 5. № 1. С. 44–61. DOI: https://doi.org/https://doi.org/10.1016/j.oneear.2021.12.010

Renn O. Transdisciplinarity: Synthesis towards a modular approach. Futures. 2021. вип. 130. С. 1–18. DOI: https://doi.org/https://doi.org/10.1016/j.futures.2021.102744

Гончар А. В., Стрижак О. Є., Беркман Л. Н. Трансдисциплінарна консолідація інформаційних середовищ. Зв’язок. 2021. № 1(149). С. 3–10. DOI: https://doi.org/10.31673/2412-9070.2021.010310

Гайко С. І., Приходнюк В. В. Підхід до автоматизованої структуризації освітніх ресурсів на основі методу рекурсивної редукції. Наукові записки Малої академії наук України. 2021. № 1 (20). С. 28–38. DOI: https://doi.org/http://doi.org/10.51707/2618-0529-2021-20-03

Приходнюк В. В. Технологічні засоби трансдисциплінарного представлення геопросторової інформації: дис. канд. техн. наук. Інститут телекомунікацій і глобального інформаційного простору. 2017. 267 с.

Стрижак О. Є., Приходнюк В. В., Гайко С. І., Шаповалов В. Б. Відображення мережевої інформації у вигляді інтерактивних документів. Трансдисциплінарний підхід. Математичне моделювання в економіці. 2018. № 3. С. 87–100.

Гайко С., Приходнюк В. Средства трансдисциплинарного представления информационных ресурсов разных стилей. Information Models and Analysis. Sofia : ITHEA, 2020. вип. 9. № 1. С. 78–99.

Величко В. Ю. Алгоритм побудови зростаючих пірамідальних мереж у паралельному обчиcлювальному середовищі. Комп’ютерні засоби мережі та системи. 2011. № 10. С. 50–57.

Dovgyi S., Stryzhak O. Transdisciplinary Fundamentals of Information-Analytical Activity. Advances in Information and Communication Technology and Systems. MCT 2019. Lecture Notes in Networks and Systems. 2019. вип. 152. DOI: https://doi.org/https://doi.org/10.1007/978-3-030-58359-0_7

References

Pavlenko, P. M., Filonenko, S. F., Babich, K. S. and others. (2013), Information systems and technologies, Kyiv, NAU, 324 p.

Jones, K. S. (1994), "Natural language processing: a historical review", available at: https://aclanthology.org/www.mt-archive.info/Zampolli-1994-Sparck-Jones.pdf (last accessed: 10.05.2021)

Liddy, E. D. (2001), "Natural Language Processing", Library and Information Science, New York, P. 15–25. DOI: https://doi.org/10.1145/234173.234180

Jurafsky, D., Martin, G. H. (2000), "Speech and Language Processing", Prentice Hall, available at: https://web.stanford.edu/~jurafsky/slp3/ (last accessed: 11.01.2023).

Barkovska, O., Khomych, V., Nastenko, O. (2022), "Study of text processing and analysis methods in the organization of electronic storage of information objects", Innovative technologies and scientific solutions for industries, No 1 (19), P. 5–12. DOI: https://doi.org/https://doi.org/10.30837/ITSSI.2022.19.005

Praveena, S., Justus, S. (2020), "A Study on Knowledge Representation Models", European Journal of Molecular and Clinical Medicine, Vol 7, No. 4, P. 2446–2452.

Subbotin, S. O. (2008), Presentation and processing of knowledge in artificial intelligence and decision support systems, Zaporizhzhia, 108 p.

Yang, L., Cormican, K., Yu, M. (2019), "Ontology-based systems engineering: A state-of-the-art review", Computers in Industry, No. 111, P. 148–171. DOI: https://doi.org/https://doi.org/10.1016/j.compind.2019.05.003

Basyuk, T. M., Dosyn, D. G., Lytvyn, V. V. (2017), Ontological engineering, Lviv, 224 p.

"KIT "Polyhedron" user manual", (2020), available at: https://storage.ulif.org.ua/storage/instructions/polyhedron_instruction.pdf (last accessed: 12.08.2021)

Lawrence, M., Williams, S., Nanz, P., Renn, O. (2022), "Characteristics, potentials, and challenges of transdisciplinary research", One Earth, Vol. 5, No. 1, P. 44–61. DOI: https://doi.org/https://doi.org/10.1016/j.oneear.2021.12.010

Renn, O. (2021), "Transdisciplinarity: Synthesis towards a modular approach", Futures, Vol. 130, P. 1–18. DOI: https://doi.org/https://doi.org/10.1016/j.futures.2021.102744

Gonchar, A. V., Stryzhak, O. E., Berkman, L. N. (2021), "Transdisciplinary consolidation of information environments" ["Transdy`scy`plinarna konsolidaciya informacijny`x seredovy`shh"], Connection [Zv`yazok], No. 1(149), P. 3–10. DOI: https://doi.org/10.31673/2412-9070.2021.010310

Haiko, S. I., Prykhodnyuk, V. V. (2021), "Approach to the automated structuring of educational resources based on the method of recursive reduction", Scientific notes of the Junior Academy of Sciences of Ukraine, No. 1 (20), P. 28–38. DOI: https://doi.org/http://doi.org/10.51707/2618-0529-2021-20-03

Prikhodniuk, V. V. (2017), Technological means of transdisciplinary presentation of geospatial information [Texnologichni zasoby` transdy`scy`plinarnogo predstavlennya geoprostorovoyi informaciyi], Ph.D. thesis, Institute of Telecommunications and Global Information Space, 267 p.

Stryzhak, O. E., Prykhodnyuk, V. V., Haiko, S. I., Shapovalov, V. B. (2018), "Display of network information in the form of interactive documents. Transdisciplinary approach" ["Vidobrazhennya merezhevoyi informaciyi u vy`glyadi interakty`vny`x dokumentiv. Transdy`scy`plinarny`j pidxid"], Mathematical modeling in economics, No. 3, P. 87–100.

Gaiko, S., Prikhodnyuk, V. (2020), "Means of transdisciplinary presentation of information resources of different styles" ["Sredstva transdistsiplinarnogo predstavleniya informatsionnykh resursov raznykh stiley"], Information Models and Analysis, Sofia, ITHEA, Vol. 9, No. 1, P. 78–99.

Velichko, V. Yu. (2011), "Algorithm for building growing pyramidal networks in a parallel computing environment" ["Algory`tm pobudovy` zrostayuchy`x piramidal`ny`x merezh u paralel`nomu obchy`clyuval`nomu seredovy`shhi"], Network and system computer tools, No. 10. P. 50–57.

Dovgyi, S., Stryzhak, O. (2019), "Transdisciplinary Fundamentals of Information-Analytical Activity", Lecture Notes in Networks and Systems, Vol. 152. DOI: https://doi.org/https://doi.org/10.1007/978-3-030-58359-0_7

Published

2023-11-13

How to Cite

Haiko, S. (2023). ONTOLOGY-DRIVEN MEANS FOR PROCESSING AND PRESENTATION OF LARGE ARRAYS OF UNSTRUCTURED TEXTS. INNOVATIVE TECHNOLOGIES AND SCIENTIFIC SOLUTIONS FOR INDUSTRIES, (2 (24), 27–38. https://doi.org/10.30837/ITSSI.2023.24.027