Development of knowledge­oriented system of machine translation based on the analytic­synthetic text processing

Leonid Lytvynenko; Oleksandr Nikolaievskyi; Valeriy Lakhno; Elena Skliarenko

doi:10.15587/1729-4061.2017.92021

Authors

Leonid Lytvynenko European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115, Ukraine https://orcid.org/0000-0002-0828-383X
Oleksandr Nikolaievskyi European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115, Ukraine https://orcid.org/0000-0002-0786-5432
Valeriy Lakhno European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115, Ukraine https://orcid.org/0000-0001-9695-4543
Elena Skliarenko European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115, Ukraine https://orcid.org/0000-0001-6555-1223

DOI:

https://doi.org/10.15587/1729-4061.2017.92021

Keywords:

machine translation system, automated text analysis, analytical-synthetic text processing

Abstract

A method for automated syntactic text analysis based on the declarative representation of the rules of syntactic combinability was developed. In this method, in contrast to those existing, the tables of syntactic rules are used not only for the context analysis, but also for defining the subject, predicate, secondary parts of the sentence, as well as superphrase syntactic combinations.

A method for software distribution of analytical-synthetic processing of a natural language text in machine translation systems was developed. The developed method, in contrast to the known methods, takes into account conditions of transition to parallel data processing both at the level of processing tasks and depending on the data type.

The C# applications, where the developed methods for analytical-synthetic processing of multilingual Russian, Ukrainian and English texts were realized, were implemented by software. It was experimentally proved that the developed software for texts on military subject area allow reducing the number of errors of semantic character by 14–16 % in comparison with the existing machine translation systems through the automated text processing at the level of sign system and the introduction of super-phrase synthesis.

Author Biographies

Leonid Lytvynenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Postgraduate student

Department of Іnformation Systems and Mathematical Sciences

Oleksandr Nikolaievskyi, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Postgraduate student

Department of Іnformation Systems and Mathematical Sciences

Valeriy Lakhno, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Doctor of Technical Science, Associate Professor

Department of Managing Information Security

Elena Skliarenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

PhD, Associate Professor

Department of Іnformation Systems and Mathematical Sciences

References

Toldova, S., Lyashevskaya, O., Bonch-Osmolovskaya, A., Ionov, M. (2015). Evaluation for morphologically rich language: Russian NLP. International Conference on Artificial Intelligence (ICAI). USA: ACM, 300–306.
Freitag, M., Huck, M., Ney, H. (2014). Jane: Open Source Machine Translation System Combination. Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics. doi: 10.3115/v1/e14-2008
Clark, E. M., Williams, J. R., Jones, C. A., Galbraith, R. A., Danforth, C. M., Dodds, P. S. (2016). Sifting robotic from organic text: A natural language approach for detecting automation on Twitter. Journal of Computational Science, 16, 1–7. doi: 10.1016/j.jocs.2015.11.002
Evans, J. A., Aceves, P. (2016). Machine Translation: Mining Text for Social Theory. Annual Review of Sociology, 42 (1), 21–50. doi: 10.1146/annurev-soc-081715-074206
Dunham, J., Cook, G., Horner, J. (2014). LingSync and the Online Linguistic Database: New Models for the Collection and Management of Data for Language Communities, Linguists and Language Learners. Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages. doi: 10.3115/v1/w14-2204
Wahl, H., Galler, R., Winiwarter, W. (2015). A Generic Software Framework for Intelligent Integrated Computer-Assisted Language Learning (iiCALL) Environment. Lecture Notes in Computer Science, 264–270. doi: 10.1007/978-3-319-25515-6_26
Ghosh, S., Ghosh, S., Das, D. (2016). Part-of-speech Tagging of Code-Mixed Social Media Text. Proceedings of the Second Workshop on Computational Approaches to Code Switching. doi: 10.18653/v1/w16-5811
Mel’cuk, I. A. (1988). Dependency Syntax: Theory and Practic. NY: SUNY, 428.
Apresian, J., Boguslavsky, I., Iomdin, L. et. al. (2003). ETAP-3 Linguistic Processor: a Full-Fledged NLP Implementation of the MTT. Conference on Meaning-Text Theory. Paris: Ecole Normale Superieure, 279–288.
Zamarujeva, I. V. (1999). Komp’juterna model' rozuminnja pryrodno-movnoi' tekstovoi' informacii'. Problemy programmirovanija, 2, 96–102.
Zamarujeva, I. V., Ros', A. O., Gubajdulin, O. Ju. et. al. (2000). Znannja-orijentovanyj pidhid do avtomatyzacii' informacijno-analitychnoi' dijal'nosti. Problemy programuvannja, 1-2, 601–614.
Moroz, A. V. (2012). Automatic creating test as one of the processing tasks of natural language texts. Eastern-European Journal of Enterprise Technologies, 2 (2 (56)), 14–17. Available at: http://journals.uran.ua/eejet/article/view/3658/3430
Turian, J., Ratinov, L., Bengio, Y. (2010). Word representations: A simple and general method for semi-supervised learning. The 6 Association for Computational Linguistics. Sweden: ACM, 384–394.
Klementiev, A., Titov, I., Bhattarai, B. (2012). Inducing Crosslingual Distributed Representations of Words. Conference on Computational Linguistics (COLING): 24th international conference. Bombai: ACL, 1–15.
Zou, W. Y., Socher, R., Cer, D., Manning, C. D. (2013). Bilingual Word Embeddings for Phrase-Based Machine Translation. Conference on Empirical Methods in Natural Language Processing: 12th international conference. USA: ACL, 1–6.
Lytvynenko, L., Lytvynenko, L. (2013). Module of syntactical module for analysis of natural language texts. The Advanced Science, 1, 57–60.
Feldman, R., Sanger, J. (2006). The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge: Cambridge University Press, 423. doi: 10.1017/cbo9780511546914
Danchenkova, S. I., Polyakov, V. N. (2010). Automatic text classification in the system of concepts lexical ontology. Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, 152, 255–267.
Wahl, H., Winiwarter, W., Quirchmayr, G. (2010). Natural language processing technologies for developing a language learning environment. Proceedings of the 12th International Conference on Information Integration and Web-Based Applications & Services – iiWAS’10. doi: 10.1145/1967486.1967546
Cohn, T., Lapata, M. (2007). Machine translation by triangulation: Making effective use of multi-parallel corpora. Meeting of the Association for Computational Linguistics: 45th annual conference. USA: ACL, 728–735.
Bazrafshan, M., Gildea, D. (2013). Semantic Roles for String to Tree Machine Translation. Meeting of the Association for Computational Linguistics. USA: ACL, 419–423.
Furstenau, H., Lapata, M. (2012). Semi-Supervised Semantic Role Labeling via Structural Alignment. Computational Linguistics, 38 (1), 135–171. doi: 10.1162/coli_a_00087
Xiong, D., Zhang, M., Li, H. (2012). Modeling the Translation of Predicate-Argument Structure for SMT. Meeting of the Association for Computational Linguistics: 50th annual conference. USA: ACL, 902–911.
Nikolaievskyi, O. (2013). Components of Lingware for Automatic Morphological Analysis in Kwnoeledge-Oriented Machine Translation System. The Advanced Science, 5, 32–36.
Nikolaievskyi, O., Nikolaievskyi, O. (2013). Procedure of Forming Dictionary of Quasi-Inflections Basing on Wordform Dictionary. The Advanced Science, 8, 61–65.

Development of knowledgeoriented system of machine translation based on the analyticsynthetic text processing

Authors

DOI:

Keywords:

Abstract

Author Biographies

Leonid Lytvynenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Oleksandr Nikolaievskyi, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Valeriy Lakhno, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Elena Skliarenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

References

Downloads

Published

How to Cite

Issue

Section

License

Language

Information

Make a Submission

Developed By

Current Issue

Development of knowledge­oriented system of machine translation based on the analytic­synthetic text processing

Authors

DOI:

Keywords:

Abstract

Author Biographies

Leonid Lytvynenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Oleksandr Nikolaievskyi, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Valeriy Lakhno, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

Elena Skliarenko, European University Academika Vernadskogo blvd., 16 V, Kyiv, Ukraine, 03115

References

Downloads

Published

How to Cite

Issue

Section

License

Language

Information

Make a Submission

Developed By

Current Issue

Development of knowledgeoriented system of machine translation based on the analyticsynthetic text processing