Development of adaptive reconfiguration method for stream data processing systems using system metrics
DOI:
https://doi.org/10.15587/2706-5448.2025.344185Keywords:
distributed systems, stream processing, Kafka Streams, adaptivity, adaptive, dynamic, RocksDBAbstract
The object of research is the process of adaptive configuration changes for stream processing applications which is focused on improving specific performance properties. The absence of the generalized automated approach for dynamic reconfiguration of state-store in limited hardware environment is the research problem addressed in this paper. The proposed solution helps to avoid a need for manual application reconfiguration from engineers. The implementation is based on Kafka Streams but designed to be portable across other frameworks that use RocksDB as a state store. Static configuration of modern stream processing systems limits efficiency under variable workloads. In this study, an adaptive module is proposed that monitors system metrics in real-time and automatically updates state-store configurations. The module performs deterministic check to derive new configuration based on predefined thresholds or utilizes a fine-tuned Large Language Model (LLM) to select new configuration values when decisions are vague. The method dynamically applies updates to the affected instance. High-load experimental results reveal the fact that adaptive executions eliminated write stalls, increased memtable hit ratio from 2% to 40% and block-cache hit ratio from 15% to 80%, reduced disk I/O by approximately 50%, and improved throughput by around 5%, at the cost of higher memory consumption. To avoid redundant adaptive updates and outlier-based bias a 10-minute observation frequency was selected. The approach is suitable for systems with fixed resources, state-intensive workloads with high key cardinality. Additionally, if covers the need for safe configuration change under operational constraints. The architecture is framework agnostic for the RocksDB-based based stream processing with state stores.
References
- Fragkoulis, M., Carbone, P., Kalavri, V., Katsifodimos, A. (2023). A survey on the evolution of stream processing systems. The VLDB Journal, 33 (2), 507–541. https://doi.org/10.1007/s00778-023-00819-8
- Checkpointing. Apache Flink. Available at: https://nightlies.apache.org/flink/flink-docs-master/docs/dev/datastream/fault-tolerance/checkpointing/ Last accessed: 27.10.2025
- Bashtovyi, A., Fechan, A. (2023). Change Data capture for migration to event-driven microservices Case Study. 2023 IEEE 18th International Conference on Computer Science and Information Technologies (CSIT). IEEE, 1–4. https://doi.org/10.1109/csit61576.2023.10324262
- Vyas, S., Tyagi, R. K., Sahu, S. (2023). Fault Tolerance and Error Handling Techniques in Apache Kafka. Proceedings of the 5th International Conference on Information Management & Machine Intelligence. Association for Computing Machinery, 1–5. https://doi.org/10.1145/3647444.3647844
- A persistent key-value store for fast storage environments. RocksDB. Available at: https://rocksdb.org/ Last accessed: 27.10.2025
- Cardellini, V., Lo Presti, F., Nardelli, M., Russo, G. R. (2022). Runtime Adaptation of Data Stream Processing Systems: The State of the Art. ACM Computing Surveys, 54 (11s), 1–36. https://doi.org/10.1145/3514496
- Herodotou, H., Odysseos, L., Chen, Y., Lu, J. (2022). Automatic Performance Tuning for Distributed Data Stream Processing Systems. 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 3194–3197. https://doi.org/10.1109/icde53745.2022.00296
- Venkataraman, S., Panda, A., Ousterhout, K., Armbrust, M., Ghodsi, A., Franklin, M. J. et al. (2017). Drizzle. Proceedings of the 26th Symposium on Operating Systems Principles. Association for Computing Machinery, 374–389. https://doi.org/10.1145/3132747.3132750
- Geldenhuys, M., Pfister, B., Scheinert, D., Thamsen, L., Kao, O. (2022). Khaos: Dynamically Optimizing Checkpointing for Dependable Distributed Stream Processing. Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 30, 553–561. https://doi.org/10.15439/2022f225
- Sun, D., Peng, J., Zhu, T., Kua, J., Gao, S., Buyya, R. (2025). Toward High‐Availability Distributed Stream Computing Systems via Checkpoint Adaptation. Concurrency and Computation: Practice and Experience, 37 (15-17). https://doi.org/10.1002/cpe.70171
- Liu, J., Gulisano, V. (2025). On-demand Memory Compression of Stream Aggregates through Reinforcement Learning. Proceedings of the 16th ACM/SPEC International Conference on Performance Engineering. Association for Computing Machinery, 240–252. https://doi.org/10.1145/3676151.3719369
- Wladdimiro, D., Arantes, L., Sens, P., Hidalgo, N. (2024). PA-SPS: A predictive adaptive approach for an elastic stream processing system. Journal of Parallel and Distributed Computing, 192, 104940. https://doi.org/10.1016/j.jpdc.2024.104940
- Hovorushchenko, T., Medzatyi, D., Voichur, Y., Lebiga, M. (2023). Method for forecasting the level of software quality based on quality attributes. Journal of Intelligent & Fuzzy Systems, 44 (3), 3891–3905. https://doi.org/10.3233/jifs-222394
- How to Tune RocksDB for Your Kafka Streams Application (2021). Confluent. Available at: https://www.confluent.io/blog/how-to-tune-rocksdb-kafka-streams-state-stores-performance/ Last accessed: 27.10.2025
- Oh, S., Moon, G. E., Park, S. (2024). ML-Based Dynamic Operator-Level Query Mapping for Stream Processing Systems in Heterogeneous Computing Environments. 2024 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 226–237. https://doi.org/10.1109/cluster59578.2024.00027
- Vysotska, V., Kyrychenko, I., Demchuk, V., Gruzdo, I. (2024). Holistic Adaptive Optimization Techniques for Distributed Data Streaming Systems. Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Systems. Volume II: Modeling, Optimization, and Controlling in Information and Technology Systems Workshop (MOCITSW-CoLInS 2024). https://doi.org/10.31110/colins/2024-2/009
- Dong, S., Kryczka, A., Jin, Y., Stumm, M. (2021). RocksDB: Evolution of Development Priorities in a Key-value Store Serving Large-scale Applications. ACM Transactions on Storage, 17 (4), 1–32. https://doi.org/10.1145/3483840
- Bashtovyi, A. V., Fechan, A. V. (2025). Evaluating fault recovery in distributed applications for stream processing applications: business insights based on metrics. Radio Electronics, Computer Science, Control, 3, 17–27. https://doi.org/10.15588/1607-3274-2025-3-2
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Artem Bashtovyi, Andrii Fechan

This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.



