Building a model for resolving referential relations in a multilingual system

Authors

DOI:

https://doi.org/10.15587/1729-4061.2022.255786

Keywords:

information extraction, proximity measure, referential factors, semantic text analysis, anaphora

Abstract

This paper considers an approach to resolving referential relations when extracting information from a text. The proposed approach is an attempt to integrate the multifactorial model of the activation coefficient with the approach to resolving the referential ambiguity of the text when replenishing the ontology. The found objects are compared based on an assessment of the proximity of attributes and relationships of objects. An ontological interpretation of relations and measures of similarity of attributes based on a multifactorial model is proposed. This model is distinguished by the fact that it makes it possible to introduce the concepts of "rhetorical distance", "linear distance", "animation", "distance between paragraphs", and "syntactic and semantic role of the antecedent". A multifactorial model is proposed, which is a necessary and sufficient component for the purpose of explaining the measure of similarity of referents for choosing the best applicant. The counting system and its modification were revealed by trial and error; the work was carried out until the selected numerical weights began to explain all the available material. The current study also examines the factors of choice of reference devices that make it possible to work with complex sentences and texts. Moreover, examples of finding a measure of proximity in a multilingual system for the Kazakh, Russian, and English languages are offered. For the current paper, texts in the Russian, English, and Kazakh languages were used as a source for practical tasks. The texts were selected using news articles on the Internet sites where translations into other languages, including those named above, were offered.

The authors of this study have done massive practical work, which confirms the correctness of the thesis they are considering

Author Biographies

Yerzhan Zhumabay, Astana International University

Doctoral Student

Department of IT-Management

Gulzhamal Kalman, L.N. Gumilyov Eurasian National University

Doctoral Student

Department of Information System

Madina Sambetbayeva, L.N. Gumilyov Eurasian National University; Committee of Science of the Ministry of Education and Science of the Republic of Kazakhstan

PhD, Associate Professor

Department of Information Systems

Leading Researcher

Institute of Information and Computational Technologies

Aigerim Yerimbetova, Committee of Science of the Ministry of Education and Science of the Republic of Kazakhstan; Satbayev University

PhD, Associate Professor, Leading Researcher

Institute of Information and Computational Technologies

Professor

Department of Software Engineering

Assem Ayapbergenova, Satbayev University

Master of Engineering and Technology, Senior Lecturer

Department of Software Engineering

Institute of Automation and Information Technology

Almagul Bizhanova, Almaty University of Power Engineering and Telecommunications named after Gumarbek Daukeyev

Senior Lecturer

Department of Information Systems and Cybersecurity

Institute of Information Technologies

References

  1. Kudriavtceva, A. S. (2020). Referent activation and probabilistic evaluation of referential choice: a study of English newspaper texts. Computational Linguistics and Intellectual Technologies.
  2. Zhanturina, B. N., Makarenko, A. S. (2021). Referential ambiguity and discourse factors. Voenno-filologicheskiy zhurnal, 3, 13‒21.
  3. Voronina, L. V. (2020). Relevance of the reference and referential mean within the antecedent-anaphoric complex with purpose semantics in political discourse. Bulletin of the Moscow State Regional University (Russian Philology), 5, 16–25. doi: https://doi.org/10.18384/2310-7278-2020-5-16-25
  4. Garanina, N. O., Sidorova, E. A., Seryi, A. S. (2018). Multiagent Approach to Coreference Resolution Based on the Multifactor Similarity in Ontology Population. Programming and Computer Software, 44 (1), 23–34. doi: https://doi.org/10.1134/s0361768818010036
  5. Sidorova, E. A., Garanina, N. O., Kononenko, I. S. (2018). Mnogomestnye ontologicheskie otnosheniya v zadache razresheniya koreferentsii. Shestnadtsataya natsional'naya konferentsiya po iskusstvennomu intellektu s mezhdunarodnym uchastiem KII-2018.
  6. Sidorova, E. A., Garanina, N. O., Kononenko, I. S., Sery, A. S. (2018). Approach to coreference resolution based on ontological similarity measure. Intellekt. Yazyk. Komp'yuter, 1, 347‒351.
  7. Ganieva, S. K. (2021). Indexical shift: typology and analysis. Aktual'nye problemy yazykoznaniya, 49‒56.
  8. Sokolova, O. V. (2021). Lingvopragmaticheskie i semanticheskie parametry yazykovoy i diskursivnoy kreativnosti v reklame. Kritika i semiotika, 2, 52‒70.
  9. Solov'ev, S. S., Garshina, V. V. (2020). Ispol'zovanie mashinnogo obucheniya dlya razresheniya koreferentsii. Sbornik studencheskikh nauchnykh rabot fakul'teta komp'yuternykh nauk VGU, 259‒265.
  10. Kupriyanova, A. D., Shilin, I. A. (2018). Primenenie metodov mashinnogo obucheniya k zadache razresheniya koreferentsii. Al'manakh nauchnykh rabot molodykh uchenykh Universiteta ITMO, 2, 387‒389.
  11. Kibrik, A. A. (1999). Reference and Working Memory. Current Issues in Linguistic Theory, 29. doi: https://doi.org/10.1075/cilt.176.04kib

Downloads

Published

2022-04-30

How to Cite

Zhumabay, Y. ., Kalman, G., Sambetbayeva, M., Yerimbetova, A., Ayapbergenova, A. ., & Bizhanova, A. . (2022). Building a model for resolving referential relations in a multilingual system. Eastern-European Journal of Enterprise Technologies, 2(2 (116), 27–35. https://doi.org/10.15587/1729-4061.2022.255786