Parser as a tool for natural language understanding by machine
DOI:
https://doi.org/10.15587/1729-4061.2013.12353Keywords:
Understanding of natural language texts, primary analysis of natural language, natural language parser designAbstract
This article discusses the features of design and implementation of a tool for machine understanding of natural language texts and presents the results obtained. The main purpose of the study is a comprehensive analysis of applicability of modern approaches and paradigms of parsing to design Russian language text parse. Following the results of the study, we have developed a prototype, which is based on the proposed logical-linguistic model of textual information and which uses grammar of relationships for categories of elements of language structures. The prototype provides a formal representation of textual information in natural language in the form of the dependencies tree without using the parsing. The features of realization include the separation of a module of text processing to prepare it for parsing by pre-segmentation and tokenization, and the solution of the problem of morphological homonymy by choosing among possible grammatical forms the one, which provides the maximum composition of the dependencies tree. The results can be used to design a comprehensive system of the machine translationReferences
- Марчук, Ю.Н. Компьютерная лингвистика [Текст] / Ю.Н. Марчук. – М.: Изд-во АСТ, 2007. – 320 с.
- Компиляторы. Принципы, технологии и инструментарий / [Альфред В. Ахо, Моника С. Лам, Рави Сети, Джеффри Д. Ульман]; пер. с англ. И. Красиков. – М.: Издательство «Вильямс», 2008. – 1184 с.
- Foster, J.M. Automatic Syntactic Analysis [Текст] / J.M. Foster; general ed. Stanley Gill. – New York: MacDonald, London and American Elsevier Inc., 1970. – 70 p.
- Dependency Parsing: [Synthesis Lectures on Human Language Technologies] / [Sandra Kubler, Ryan McDonald, Joakim Nivre]; ser. еd. Graeme Hirst. – Morgan & Claypool Publishers, 2009. – 115 p.
- D. Grune Parsing Techniques – A Practical Guide [Текст] / D. Grune, Ceriel J.H. Jacobs. – [2-ond ed.]. – Amsterdam: Springer, 2008. – 662 p.
- David R. Dowty Natural Language Parsing: Psychological, Computational, and Theoretical Perspectives [Текст] / David R. Dowty, Lauri Karttunen, Arnold M. Zwicky. – Cambridge University Press, 2005. – 428 p.
- Оценка методов автоматического анализа текста 2011–2012: синтаксические парсеры русского языка / [Толдова С., Соколова Е., Астафьева И. и др.] // Компьютерная лингвистика и интеллектуальные технологии. – 2012. – Вып. 11 (18). – С. 797–810.
- Encyclopedia of Linguistics / ed. Philipp Strazny. – [2 vols.]. – New York, Oxon: Fitzroy Dearborn, 2005. – 1304 p.
- Тестелец, Я. Г. Введение в общий синтаксис [Текст] / Я. Г. Тестелец. – М.: РГГУ, 2001. – 798 с.
- Братко, И. Алгоритмы искусственного интеллекта на языке PROLOG / И. Братко; пер. с англ. – [3-е изд.]. – М.: Издательский дом “Вильямс”, 2004. – 640 с.
- Marchuk, Y.N. Computer Linguistics. (2007). – M.: AST Publishing House. – 320 p.
- Aho, Alfred V., Lam, Monica S., Seti, R., Ullman, Jeffrey D. Compilers. Principles, technologies and tools. (2008). Trans. from eng. Krasikov, I. – M.: Williams Publishing House. – 1184 p.
- Foster, J.M. Automatic Syntactic Analysis. (1970). General ed. Stanley Gill. – New York: MacDonald, London and American Elsevier Inc. – 70 p.
- Kubler, S., McDonald, R., Nivre, J. Dependency Parsing: [Synthesis Lectures on Human Language Technologies]. (2009). Ser. еd. Hirst, G. – Morgan & Claypool Publishers. – 115 p.
- Grune, D., Jacobs, Ceriel J.H. Parsing Techniques – A Practical Guide. (2008). 2-ond ed. – Amsterdam: Springer. – 662 p.
- Dowty, David R., Karttunen, L., Zwicky, Arnold M. Natural Language Parsing: Psychological, Computational, and Theoretical Perspectives. (2005). – Cambridge University Press. – 428 p.
- Toldova, S., Sokolova, E., Astafieva, I. Rating parsing methods 2011–2012: Syntactic parsers of the Russian language. (2012). Computer Linguistics and Intelligent Technologies, 11(18), 797–810.
- Encyclopedia of Linguistics. (2005). Ed. Strazny, Ph., 2 vols. – New York, Oxon: Fitzroy Dearborn. – 1304 p.
- Testelets, Y.G. Introduction to common syntax. (2001). – M.: RGGU University Press. – 798 p.
- Bratko, I. Artificial intelligence algorithms in the language PROLOG. (2004). Trans. from eng. 2-ond ed. – M.: Williams Publishing House. – 640 p.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2014 Ірина Анатоліївна Жирякова, Михайло Сергійович Симоненко
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.