Research of text information search methods using the capabilities of the Elastic platform

Authors

  • A.V. Krasnoperov State Higher Education Institution "Priazovskyi state technical university", Dnipro, Ukraine
  • O.V. Kryvenko State Higher Education Institution "Priazovskyi state technical university", Dnipro, Ukraine https://orcid.org/0009-0006-2860-6575
  • T.O. Levytska State Higher Education Institution "Priazovskyi state technical university", Dnipro, Ukraine https://orcid.org/0000-0003-3359-1313

DOI:

https://doi.org/10.31498/2225-6733.47.2023.299979

Keywords:

search, data, platform, databases, service, application, C#, Elastic, software, Logstash, Index

Abstract

This work is devoted to the analysis and optimization of the application search process using the ElasticSearch software tool. The subject of research is the Elastic platform in the context of data retrieval and analysis. The object is to optimize the search and data analysis process based on this platform. The purpose of this work is to study the possibilities and features of using ElasticSearch to create an effective application search mechanism in large application stores. The main tasks for achieving the goal of the work are defined. Conducted analysis of scientific literature on methods and technologies of data search and analysis based on the Elastic platform. Covers the core components and capabilities of the Elastic platform, including ElasticSearch, Kibana, and LogStash. A comparative analysis of search performance on the Elastic platform and alternative solutions was performed. A test scenario was developed and implemented to evaluate the speed and accuracy of the search using the platform. Implemented service that provides search functionality for applications. The possibilities of integrating the developed service with existing applications and systems for optimal use of its functionality are considered. The developed service was tested using various data sets to confirm its effectiveness and accuracy. The possibilities of scaling and optimization of the developed service for optimal performance when using large volumes of data are determined. The results of using the developed service are compared with similar solutions and conclusions are given regarding its competitiveness. Instructions for using the developed service have been developed and recommendations for its effective implementation in practical scenarios have been provided. The analysis of the obtained results was carried out and conclusions were drawn regarding the effectiveness and practical value of the developed service for solving specific tasks of processing and analyzing large volumes of data using the Elastic platform. Empirical research was conducted using real data sets to evaluate the performance of search on the platform. Explored opportunities to optimize and improve search performance on the platform through configuration and customization. Conclusions are made regarding the effectiveness of search using the Elastic platform and recommendations are provided for its use in specific scenarios

Author Biographies

A.V. Krasnoperov, State Higher Education Institution "Priazovskyi state technical university", Dnipro

Master's student

O.V. Kryvenko, State Higher Education Institution "Priazovskyi state technical university", Dnipro

PhD (Engineering), associate professor

T.O. Levytska, State Higher Education Institution "Priazovskyi state technical university", Dnipro

PhD (Engineering), associate professor

References

Обзор решений для полнотекстового поиска в веб-проектах: Sphinx, Apache Lucene, Xapian. URL: https://dou.ua/lenta/articles/full-text-search-engines-overview-sphinx-apache-lucene-xapian/ (дата звернення 28.07.2023).

Why Full Text’s CONTAINS Queries Are So Slow. URL: https://www.brentozar.com/archive/2020/11/why-full-texts-contains-queries-are-so-slow (дата звернення 28.07.2023).

Apache Solr. URL: https://solr.apache.org (дата звернення 28.07.2023).

Elastic Stack. URL: https://www.elastic.co/elastic-stack (дата звернення 13.08.2023).

Croft W.B., Lafferty J. Language modeling for information retrieval. Springer Science & Business Media. 2003. 246 p. DOI: https://doi.org/10.1007/978-94-017-0171-6.

Different ways to model your data in ElasticSearch. URL: https://medium.com/@zhaoyi0113/different-ways-to-model-your-data-in-elasticsearch-bbc719f3d4fc (дата звернення 10.08.2023).

Published

2023-12-28

How to Cite

Krasnoperov, A. ., Kryvenko, O. ., & Levytska, T. . (2023). Research of text information search methods using the capabilities of the Elastic platform. Reporter of the Priazovskyi State Technical University. Section: Technical Sciences, (47), 49–57. https://doi.org/10.31498/2225-6733.47.2023.299979