Research of text information search methods using the capabilities of the Elastic platform
DOI:
https://doi.org/10.31498/2225-6733.47.2023.299979Keywords:
search, data, platform, databases, service, application, C#, Elastic, software, Logstash, IndexAbstract
This work is devoted to the analysis and optimization of the application search process using the ElasticSearch software tool. The subject of research is the Elastic platform in the context of data retrieval and analysis. The object is to optimize the search and data analysis process based on this platform. The purpose of this work is to study the possibilities and features of using ElasticSearch to create an effective application search mechanism in large application stores. The main tasks for achieving the goal of the work are defined. Conducted analysis of scientific literature on methods and technologies of data search and analysis based on the Elastic platform. Covers the core components and capabilities of the Elastic platform, including ElasticSearch, Kibana, and LogStash. A comparative analysis of search performance on the Elastic platform and alternative solutions was performed. A test scenario was developed and implemented to evaluate the speed and accuracy of the search using the platform. Implemented service that provides search functionality for applications. The possibilities of integrating the developed service with existing applications and systems for optimal use of its functionality are considered. The developed service was tested using various data sets to confirm its effectiveness and accuracy. The possibilities of scaling and optimization of the developed service for optimal performance when using large volumes of data are determined. The results of using the developed service are compared with similar solutions and conclusions are given regarding its competitiveness. Instructions for using the developed service have been developed and recommendations for its effective implementation in practical scenarios have been provided. The analysis of the obtained results was carried out and conclusions were drawn regarding the effectiveness and practical value of the developed service for solving specific tasks of processing and analyzing large volumes of data using the Elastic platform. Empirical research was conducted using real data sets to evaluate the performance of search on the platform. Explored opportunities to optimize and improve search performance on the platform through configuration and customization. Conclusions are made regarding the effectiveness of search using the Elastic platform and recommendations are provided for its use in specific scenarios
References
Обзор решений для полнотекстового поиска в веб-проектах: Sphinx, Apache Lucene, Xapian. URL: https://dou.ua/lenta/articles/full-text-search-engines-overview-sphinx-apache-lucene-xapian/ (дата звернення 28.07.2023).
Why Full Text’s CONTAINS Queries Are So Slow. URL: https://www.brentozar.com/archive/2020/11/why-full-texts-contains-queries-are-so-slow (дата звернення 28.07.2023).
Apache Solr. URL: https://solr.apache.org (дата звернення 28.07.2023).
Elastic Stack. URL: https://www.elastic.co/elastic-stack (дата звернення 13.08.2023).
Croft W.B., Lafferty J. Language modeling for information retrieval. Springer Science & Business Media. 2003. 246 p. DOI: https://doi.org/10.1007/978-94-017-0171-6.
Different ways to model your data in ElasticSearch. URL: https://medium.com/@zhaoyi0113/different-ways-to-model-your-data-in-elasticsearch-bbc719f3d4fc (дата звернення 10.08.2023).
Downloads
Published
How to Cite
Issue
Section
License
The journal «Reporter of the Priazovskyi State Technical University. Section: Technical sciences» is published under the CC BY license (Attribution License).
This license allows for the distribution, editing, modification, and use of the work as a basis for derivative works, even for commercial purposes, provided that proper attribution is given. It is the most flexible of all available licenses and is recommended for maximum dissemination and use of non-restricted materials.
Authors who publish in this journal agree to the following terms:
1. Authors retain the copyright of their work and grant the journal the right of first publication under the terms of the Creative Commons Attribution License (CC BY). This license allows others to freely distribute the published work, provided that proper attribution is given to the original authors and the first publication of the work in this journal is acknowledged.
2. Authors are allowed to enter into separate, additional agreements for non-exclusive distribution of the work in the same form as published in this journal (e.g., depositing it in an institutional repository or including it in a monograph), provided that a reference to the first publication in this journal is maintained.







