Development of adaptive reconfiguration method for stream data processing systems using system metrics

Authors

DOI:

https://doi.org/10.15587/2706-5448.2025.344185

Keywords:

distributed systems, stream processing, Kafka Streams, adaptivity, adaptive, dynamic, RocksDB

Abstract

The object of research is the process of adaptive configuration changes for stream processing applications which is focused on improving specific performance properties. The absence of the generalized automated approach for dynamic reconfiguration of state-store in limited hardware environment is the research problem addressed in this paper. The proposed solution helps to avoid a need for manual application reconfiguration from engineers. The implementation is based on Kafka Streams but designed to be portable across other frameworks that use RocksDB as a state store. Static configuration of modern stream processing systems limits efficiency under variable workloads. In this study, an adaptive module is proposed that monitors system metrics in real-time and automatically updates state-store configurations. The module performs deterministic check to derive new configuration based on predefined thresholds or utilizes a fine-tuned Large Language Model (LLM) to select new configuration values when decisions are vague. The method dynamically applies updates to the affected instance. High-load experimental results reveal the fact that adaptive executions eliminated write stalls, increased memtable hit ratio from 2% to 40% and block-cache hit ratio from 15% to 80%, reduced disk I/O by approximately 50%, and improved throughput by around 5%, at the cost of higher memory consumption. To avoid redundant adaptive updates and outlier-based bias a 10-minute observation frequency was selected. The approach is suitable for systems with fixed resources, state-intensive workloads with high key cardinality. Additionally, if covers the need for safe configuration change under operational constraints. The architecture is framework agnostic for the RocksDB-based based stream processing with state stores.

Author Biographies

Artem Bashtovyi, Lviv Polytechnic National University

PhD, Assistant

Department of Software

Andrii Fechan, Lviv Polytechnic National University

Doctor of Technical Sciences, Professor

Department of Software

References

  1. Fragkoulis, M., Carbone, P., Kalavri, V., Katsifodimos, A. (2023). A survey on the evolution of stream processing systems. The VLDB Journal, 33 (2), 507–541. https://doi.org/10.1007/s00778-023-00819-8
  2. Checkpointing. Apache Flink. Available at: https://nightlies.apache.org/flink/flink-docs-master/docs/dev/datastream/fault-tolerance/checkpointing/ Last accessed: 27.10.2025
  3. Bashtovyi, A., Fechan, A. (2023). Change Data capture for migration to event-driven microservices Case Study. 2023 IEEE 18th International Conference on Computer Science and Information Technologies (CSIT). IEEE, 1–4. https://doi.org/10.1109/csit61576.2023.10324262
  4. Vyas, S., Tyagi, R. K., Sahu, S. (2023). Fault Tolerance and Error Handling Techniques in Apache Kafka. Proceedings of the 5th International Conference on Information Management & Machine Intelligence. Association for Computing Machinery, 1–5. https://doi.org/10.1145/3647444.3647844
  5. A persistent key-value store for fast storage environments. RocksDB. Available at: https://rocksdb.org/ Last accessed: 27.10.2025
  6. Cardellini, V., Lo Presti, F., Nardelli, M., Russo, G. R. (2022). Runtime Adaptation of Data Stream Processing Systems: The State of the Art. ACM Computing Surveys, 54 (11s), 1–36. https://doi.org/10.1145/3514496
  7. Herodotou, H., Odysseos, L., Chen, Y., Lu, J. (2022). Automatic Performance Tuning for Distributed Data Stream Processing Systems. 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 3194–3197. https://doi.org/10.1109/icde53745.2022.00296
  8. Venkataraman, S., Panda, A., Ousterhout, K., Armbrust, M., Ghodsi, A., Franklin, M. J. et al. (2017). Drizzle. Proceedings of the 26th Symposium on Operating Systems Principles. Association for Computing Machinery, 374–389. https://doi.org/10.1145/3132747.3132750
  9. Geldenhuys, M., Pfister, B., Scheinert, D., Thamsen, L., Kao, O. (2022). Khaos: Dynamically Optimizing Checkpointing for Dependable Distributed Stream Processing. Proceedings of the 17th Conference on Computer Science and Intelligence Systems, 30, 553–561. https://doi.org/10.15439/2022f225
  10. Sun, D., Peng, J., Zhu, T., Kua, J., Gao, S., Buyya, R. (2025). Toward High‐Availability Distributed Stream Computing Systems via Checkpoint Adaptation. Concurrency and Computation: Practice and Experience, 37 (15-17). https://doi.org/10.1002/cpe.70171
  11. Liu, J., Gulisano, V. (2025). On-demand Memory Compression of Stream Aggregates through Reinforcement Learning. Proceedings of the 16th ACM/SPEC International Conference on Performance Engineering. Association for Computing Machinery, 240–252. https://doi.org/10.1145/3676151.3719369
  12. Wladdimiro, D., Arantes, L., Sens, P., Hidalgo, N. (2024). PA-SPS: A predictive adaptive approach for an elastic stream processing system. Journal of Parallel and Distributed Computing, 192, 104940. https://doi.org/10.1016/j.jpdc.2024.104940
  13. Hovorushchenko, T., Medzatyi, D., Voichur, Y., Lebiga, M. (2023). Method for forecasting the level of software quality based on quality attributes. Journal of Intelligent & Fuzzy Systems, 44 (3), 3891–3905. https://doi.org/10.3233/jifs-222394
  14. How to Tune RocksDB for Your Kafka Streams Application (2021). Confluent. Available at: https://www.confluent.io/blog/how-to-tune-rocksdb-kafka-streams-state-stores-performance/ Last accessed: 27.10.2025
  15. Oh, S., Moon, G. E., Park, S. (2024). ML-Based Dynamic Operator-Level Query Mapping for Stream Processing Systems in Heterogeneous Computing Environments. 2024 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 226–237. https://doi.org/10.1109/cluster59578.2024.00027
  16. Vysotska, V., Kyrychenko, I., Demchuk, V., Gruzdo, I. (2024). Holistic Adaptive Optimization Techniques for Distributed Data Streaming Systems. Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Systems. Volume II: Modeling, Optimization, and Controlling in Information and Technology Systems Workshop (MOCITSW-CoLInS 2024). https://doi.org/10.31110/colins/2024-2/009
  17. Dong, S., Kryczka, A., Jin, Y., Stumm, M. (2021). RocksDB: Evolution of Development Priorities in a Key-value Store Serving Large-scale Applications. ACM Transactions on Storage, 17 (4), 1–32. https://doi.org/10.1145/3483840
  18. Bashtovyi, A. V., Fechan, A. V. (2025). Evaluating fault recovery in distributed applications for stream processing applications: business insights based on metrics. Radio Electronics, Computer Science, Control, 3, 17–27. https://doi.org/10.15588/1607-3274-2025-3-2
Development of adaptive reconfiguration method for stream data processing systems using system metrics

Downloads

Published

2025-12-29

How to Cite

Bashtovyi, A., & Fechan, A. (2025). Development of adaptive reconfiguration method for stream data processing systems using system metrics. Technology Audit and Production Reserves, 6(2(86), 15–22. https://doi.org/10.15587/2706-5448.2025.344185

Issue

Section

Information Technologies