Development and verification of an orchestration architecture of AI agents for automated API testing with a unified representation of requirements

Maksym Moskalenko

doi:10.15587/2706-5448.2026.360928

Authors

Maksym Moskalenko Educational and Scientific Institute “Ukrainian State University of Chemical Technology”, Ukraine https://orcid.org/0009-0009-0767-2233

DOI:

https://doi.org/10.15587/2706-5448.2026.360928

Keywords:

API, testing, orchestration, API requirements, LLM, DevOps, CI/CD, architecture, automation, determinism

Abstract

The object of this research is the process of automated testing of application programming interfaces (API) in systems developed following Agile and DevOps approaches, where requirements are heterogeneous, frequently changing, and represented in various formats: from formal OpenAPI specifications to textual documentation (Confluence). The problem addressed is the absence of an architectural mechanism for systematic integration of heterogeneous requirements sources and for decoupling requirements interpretation from test generation. Specification-oriented approaches fail to incorporate business rules from textual sources, covering only 70–80% of specification content. LLM-based approaches are unstable: repeated runs with identical prompts yield test sets differing by 20–40%, with coverage standard deviation reaching ±12%.

AI-driven orchestration architecture is proposed, comprising a coordination layer O, requirements-processing agents A, and a protocol-independent unified requirements representation R. The test generation process is formalized as T = G(O(A(S))), where S denotes the set of requirements sources and G the deterministic test generation algorithm. The key property of the architecture is isolation of the stochastic LLM component at the agent level, guaranteeing reproducibility of the test set T for any fixed R.

Verification was conducted through a comparative experiment on a REST API service with 5 endpoints and 12 business rules. API coverage: 88–92% vs. 72–78% (specification-based) and 55–82% (LLM-based). Standard deviation: ±2% vs. ±3% and ±12%. Reproducibility: 0.97 vs. 0.95 and 0.62. Maintenance: 15–20% modified tests vs. 60–70% and 40–55%.

The proposed architecture targets software development teams practising API-First Development and Documentation-Driven Development. Results are applicable to Agile/DevOps environments with frequently changing requirements.

Author Biography

Maksym Moskalenko, Educational and Scientific Institute “Ukrainian State University of Chemical Technology”

PhD Student

Department of Information Systems

References

Harman, M., Mansouri, S. A., Zhang, Y. (2012). Search-based software engineering. ACM Computing Surveys, 45 (1), 1–61. https://doi.org/10.1145/2379776.2379787
McIntosh, S., Kamei, Y., Adams, B., Hassan, A. E. (2015). An empirical study of the impact of modern code review practices on software quality. Empirical Software Engineering, 21 (5), 2146–2189. https://doi.org/10.1007/s10664-015-9381-9
Ammann, P., Offutt, J. (2016). Introduction to Software Testing. Cambridge University Press, 768. Available at: https://www.cambridge.org/highereducation/books/introduction-to-software-testing/95E57CCADEA697EC8594F03729F47311
Amalfitano, D., Faralli, S., Hauck, J. C. R., Matalonga, S., Distante, D. (2023). Artificial Intelligence Applied to Software Testing: A Tertiary Study. ACM Computing Surveys, 56 (3), 1–38. https://doi.org/10.1145/3616372
Richardson, L., Amundsen, M. (2013). RESTful Web APIs. O’Reilly Media, 406. Available at: https://www.oreilly.com/library/view/restful-web-apis/9781449359713/
Cerny, T., Donahoo, M. J., Trnka, M. (2018). Contextual understanding of microservice architecture. ACM SIGAPP Applied Computing Review, 17 (4), 29–45. https://doi.org/10.1145/3183628.3183631
ISTQB Certified Tester Foundation Level Syllabus. Version 4.0 (2023). International Software Testing Qualifications Board (ISTQB). Available at: https://istqb.org/istqb-releases-certified-tester-foundation-level-v4-0-ctfl/
Garousi, V., Felderer, M., Hacaloğlu, T. (2017). Software test maturity assessment and test process improvement: A multivocal literature review. Information and Software Technology, 85, 16–42. https://doi.org/10.1016/j.infsof.2017.01.001
Kim, M., Xin, Q., Sinha, S., Orso, A. (2022). Automated test generation for REST APIs: no time to rest yet. Proceedings of the 31st ACM SIGSOFT International Symposium on Software Testing and Analysis, 289–301. https://doi.org/10.1145/3533767.3534401
Corradini, D., Zampieri, A., Pasqua, M., Viglianisi, E., Dallago, M., Ceccato, M. (2022). Automated black‐box testing of nominal and error scenarios in RESTful APIs. Software Testing, Verification and Reliability, 32 (5). https://doi.org/10.1002/stvr.1808
Utting, M., Pretschner, A., Legeard, B. (2011). A taxonomy of model‐based testing approaches. Software Testing, Verification and Reliability, 22 (5), 297–312. https://doi.org/10.1002/stvr.456
Utting, M., Legeard, B. (2010). Practical Model-Based Testing: A Tools Approach. Morgan Kaufmann, 424. Available at: https://www.sciencedirect.com/book/9780123725011/practical-model-based-testing
Arcuri, A., Briand, L. (2012). Formal Analysis of the Probability of Interaction Fault Detection Using Random Testing. IEEE Transactions on Software Engineering, 38 (5), 1088–1099. https://doi.org/10.1109/tse.2011.85
Arcuri, A., Briand, L. (2011). Adaptive random testing. Proceedings of the 2011 International Symposium on Software Testing and Analysis, 265–275. https://doi.org/10.1145/2001420.2001452
Golmohammadi, A., Zhang, M., Arcuri, A. (2023). Testing RESTful APIs: A Survey. ACM Transactions on Software Engineering and Methodology, 33 (1), 1–41. https://doi.org/10.1145/3617175
Chen, Y., Hu, Z., Zhi, C., Han, J., Deng, S., Yin, J. (2024). ChatUniTest: A Framework for LLM-Based Test Generation. Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 572–576. https://doi.org/10.1145/3663529.3663801
Tang, Y., Liu, Z., Zhou, Z., Luo, X. (2024). ChatGPT vs SBST: A Comparative Assessment of Unit Test Suite Generation. IEEE Transactions on Software Engineering, 50 (6), 1340–1359. https://doi.org/10.1109/tse.2024.3382365
Kim, M., Corradini, D., Sinha, S., Orso, A., Pasqua, M., Tzoref-Brill, R., Ceccato, M. (2023). Enhancing REST API Testing with NLP Techniques. Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 1232–1243. https://doi.org/10.1145/3597926.3598131
Atlidakis, V., Godefroid, P., Polishchuk, M. (2019). RESTler: Stateful REST API Fuzzing. 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE), 748–758. https://doi.org/10.1109/icse.2019.00083
Arcuri, A. (2019). RESTful API Automated Test Case Generation with EvoMaster. ACM Transactions on Software Engineering and Methodology, 28 (1), 1–37. https://doi.org/10.1145/3293455

Development and verification of an orchestration architecture of AI agents for automated API testing with a unified representation of requirements

Authors

DOI:

Keywords:

Abstract

Author Biography

Maksym Moskalenko, Educational and Scientific Institute “Ukrainian State University of Chemical Technology”

References

Downloads

Published

How to Cite

Issue

Section

License

Information site

Language

Information

Developed By

Current Issue