The mechanism of terminological analysis of performance indexes of the integrated information system
DOI:
https://doi.org/10.15587/2312-8372.2015.56824Keywords:
semantic network, terminological analysis, machine-readable documents, statistical criteria, integrated information systemAbstract
It was analyzed the problem of automatically determining time series element in the text flow for machine documents and selecting statistical and linguistic criteria, allowing precise aims of research, which is to create problems for parsing unit and formation of context rules.
Software was developed as a result of the practical implementation of the formed parsing block of context rules.
Testing of the software using text content control revealed that a mechanism of terminological analysis of performance indexes of the integrated information system performs all tasks assigned to it.
The proposed mechanism of terminological analysis is reliable in terms of getting time series of the integrated system. The resulting application performances are clear of noise. Test results of the developed mechanism are more accurate than the previously obtained by other methods.References
- Salton, G., Buckley, C. (1988, January). Term-weighting approaches in automatic text retrieval. Information Processing & Management, Vol. 24, № 5, 513–523. doi:10.1016/0306-4573(88)90021-0
- Jacquemin, C., Bourigault, D.; In: Mitkov, R. (2003). Term extraction and automatic indexing. Handbook of Computational Linguistics. Oxford University Press, 599‑615. doi:10.1093/oxfordhb/9780199276349.013.0033
- Dobrov, B. V., Lukashevych, N. V., Syromyatnykov, S. V. (2003). Formyrovanye bazy termynolohycheskykh slovosochetanyy po tekstam predmetnoy oblasty. Trudy pyatoy konferentsyy vserossyyskoy nauchnoy konferentsyy «Elektronnye byblyoteky: Perspektyvnie metody y tekhnolohyy, elektronnye kollektsyy», 201–210.
- Efremova, N. E., Bol'shakova, E. I., Noskov, A. A., Antonov, V. Yu. (2010). Analysis of text terminology based on lexicosyntactic patterns. Available: http://www.dialog-21.ru/digests/dialog2010/materials/pdf/20.pdf. Last accessed: 23.10.2015.
- Beuster, G. (2001). MIC – A System for Classification of Structured and Unstructured Texts. University Koblenz. Available: http://www/gb/papers/thesismic/mic.pdf. Last accessed: 10.10.2015.
- Metodika raboty s istochnikami informatsii. Razdel 2. Available: http://edu.dvgups.ru/METDOC/EKMEN/ETEOR/ORGANIZ_ISSLED_D/METOD/SIMONENKO/UP/frame/frame_tema5.htm. Last accessed: 27.10.2015.
- Leont'eva, N. N. (2002). K teorii avtomaticheskogo ponimaniia teksta. Part 3. Semanticheskii komponent. Lokal'nyi semanticheskii analiz. Moscow: Publishing House of the Moscow University, 49.
- Kuznetsov, I. P., Kozerenko, E. B. (2008). Linguistic Рrocessor «Semantix» for Knowledge extraction from natural texts in Russian and English. Proceeding of International Conference on Machine Learning, ISAT-2008, 14-18 July, 2008. Las Vegas, USA CSREA Press, 835–841.
- XML DTD – An Introduction to XML Document Type Definitions. Available: http://www.xmlfiles.com/dtd/. Last accessed: 01.11.2015.
- Boyer, J. (2001, March). Canonical XML Version 1.0. Available: http://dx.doi.org/10.17487/rfc3076
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 Валентина Іванівна Кунченко-Харченко
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.