Simulation of search and classification of serv ice documents in automated information retrieval systems
DOI:
https://doi.org/10.15587/2312-8372.2013.14904Keywords:
Classification of documents, latent semantic analysisAbstract
The article discusses the use of methods of search and classification of service documents. This task is particularly relevant for bodies of state administration, where the quality of the decision depends on the timeliness of the required information. Tangible assistance in this matter can perform automated information retrieval systems. Despite the wide variety of automated document processing systems, a number of questions remain open. A promising direction for further development of these systems is the application of semantic analysis. In this regard, this study investigated the possibility of applying the methods of the latent semantic analysis to simulate the processes of search and classification of customs documents. Adequacy of this method was shown. Further development of the proposed methods is performed in the development of information technology of search of texts, their automatic classification and definition of the related documents.
References
- Деркач, Л. Українська митниця: вчора, сьогодні, завтра [Текст] / Л. В. Деркач. – К.: Державна митна служба України, 2000. – 542 с.
- Ульяновська, Ю. В. Автоматизація діловодства в митній справі [Текст] / Ю.В. Ульяновська, В.О. Яковенко, В.М. Ганжа // Вісник Академії митної служби України. – 2006. – №1(29). – С. 77-80.
- Величкевич, М. Б. Електронний документообіг, тенденції та перспективи [Текст] / М. Б. Величкевич, Н. В. Мітрофан, Н. Е. Кунанець // Вісник Національного університету «Львівська політехніка». Інформаційні системи та мережі. – 2010. – № 689. – С. 44–54.
- Матвієнко, О.В. Основи організації електронного документообігу. [Текст] / О.В. Матвієнко, М.Н. Цивін. – К.: Центр учбової літератури, 2008. – 112 с.
- Belkin, N. Evaluating Interactive Information Retrieval Systems: Opportunities and Challenges N. Belkin, J. Scholtz, S. Dumais, R. Wilkinson [Text] / N. Belkin, J. Scholtz, S. Dumais, R. Wilkinson. – CHI 2004. – April 24–29, 2004. – Vienna, Austria.
- Корнеев, В.В. Базы данных. Интеллектуальная обработка информации [Текст] / В.В. Корнеев, А.Ф. Гареев, С.В. Васютин, В.В. Райх. – М.: «Нолидж», 2000. – 352 с.
- Marcus, A. Recovering documentation-to-source-code traceability links using latent semantic indexing [Text] / A. Marcus, J.I. Maletic // Software Engineering, 2003. Proceedings. 25th International Conference on. – pp. 125 - 135
- Deerwester, S. Indexing by Latent Semantic Analysis [Text] / S. Deerwester, S.T. Dumais, G.W. Furnas, T.K. Landauer and R.A. Harshman // Journal of the American Society for Information Science. – 1990. – №41. – pp. 391-407.
- Круковский, М. Ю. Критерии эффективности систем электронного документооборота [Текст] / М. Ю. Круковский // Системи підтримки прийняття рішень. Теорія і практика. – 2005. – С. 107–111.
- Кураленок, И. Автоматическая классификация документов на основе латентно-семантического анализа [Текст] / И. Кураленок, И. Некрестьянов // Научные труды Донецкого национального технического университета. Серия: Информатика, кибернетика и вычислительная техника (ИКВТ-2006). – Вып. 25. – Донецк: ДонНТУ, 2006. – С. 324-335.
- Callan, J. Learning while filtering documents. In Proc. of SIGIR'98 [Text] / J. Callan. – Melbourne, Australia, 1998. – pp. 224-231.
- Derkach, L. (2000). Ukrainian Customs: yesterday, today, tomorrow. Kiyv: State Customs Service of Ukraine. 542p.
- Ulianovska, Yu., Yakovenko, V., Ganzha, V. (2006). Office-work automation in customs business. The bulletin of Ukrainian Academy of Customs, №1(29), 77-80
- Velichkevich, M.B., Mitrophan, N.V., Kunanec, N.E. (2010). Electronic document circulation, tendencies and prospects. The bulletin of National University “Lviv Polytechnic“.Information systems and networks, № 689, 44–54.
- Matvienko, О., Cyvin, M. (2008). Bases of the organization of electronic document circulation. Kiyv: The centre of the educational literature. 112p.
- Belkin, N., Scholtz, J., Dumais, S., Wilkinson, R. (2004). Evaluating Interactive Information Retrieval Systems: Opportunities and Challenges, CHI 2004, April 24–29, Vienna, Austria.
- Korneev, V.V., Gareev, A.F., Vasjutin, S.V., Reich, V.V. (2000). Database. Intelligent processing of information. M: Knowledge. 352p.
- Marcus, A., Maletic, J.I. (2003). Recovering documentation-to-source-code traceability links using latent semantic indexing. Software Engineering, Proceedings. 25th International Conference on, 125 – 135.
- Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K. & Harshman, R.A. (1990). Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science, 41, 391-407.
- Krukovskiy, M.Yu. (2005). Criteria of efficiency of systems of electronic document circulation. Systems of support of decision-making. The theory and practice, 107–111.
- Кuralenok, I., Nekrestianov, I. (2006). Automatic classification of documents on the basis of the latentno-semantic analysis. Proceedings of Donetsk national technical university. A series: Computer science, cybernetics and computer facilities, № 25, 324-335.
- Callan, J. (1998). Learning while filtering documents. In Proc. of SIGIR'98, Melbourne, Australia, 224-231.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2016 Юлія Вікторівна Ульяновська
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.