Open Access Open Access  Restricted Access Subscription Access

AI-Agent–Based Autonomous Cloud Orchestration for Multi-Tenant and Scalable SaaS Platforms

Bijal Tayade

Abstract


The fast development of multi-tenant Software-as-a-Service (SaaS) environments has increased the pressure on developing smart, scalable, and autonomous cloud infrastructure management. Classical rule-based orchestration and reactive auto-scaling systems cannot support dynamic workloads of heterogeneous resources and complex service-level objectives (SLOs). This paper introduces a proposed artificial intelligence agent (AI) framework of autonomous cloud orchestration created to provide the best optimization of resource requesting, workload placement, fault tolerance, and service scaling in multi-tenant SaaS settings.

 

The suggested approach combines reinforcement learning, using generative AI models, and self- adaptive control attempted within various layers of the cloud, Infrastructure-as-a-Service (IaaS), Platform-as-a-Service (PaaS), and that of container orchestration systems such as Kubernetes. The intelligent agents are constantly checking the telemetry of the systems and then forecasting changes in workload, identifying the anomaly, and automatically scaling the computing, storage, and network resources to guarantee performance isolation and cost-effectiveness among tenants.

 

Moreover, the framework has explainable decision modules, which will increase transparency in orchestration policy and boost trust in automated cloud operation. The model of autonomous orchestration based on AI-agents has been experimentally proven to yield better scalability, fewer SLA violations, lower power usage, and quicker fault recovery than traditional threshold- based and heuristic orchestration models, as well as to enable highly scalable and resilient SaaS systems.

 


Full Text:

PDF

References


Chen, Y., Liu, Z., & Wang, H. (2023). Autonomous cloud resource management using deep reinforcement learning. IEEE Transactions on Cloud Computing, 11(2), 1456–1468.

Verma, A., & Shenoy, P. (2022). AI-driven elastic scaling for multi-tenant SaaS applications. In Proceedings of the IEEE International Conference on Cloud Computing (CLOUD) (pp. 112–119).

Kumar, S., Buyya, R., & Beloglazov, A. (2022). Self-adaptive resource provisioning in cloud data centers using machine learning. IEEE Transactions on Services Computing, 15(4), 1890–1903.

Ghosh, M., et al. (2024). Generative AI for automated cloud configuration and optimization. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC) (pp. 55–64).

Li, T., & Xu, X. (2024). LLM-based infrastructure-as-code generation for cloud automation. In Proceedings of the IEEE International Conference on Big Data (pp. 2310–2318).

Patel, J., & Kant, K. (2023). Predictive auto-scaling in SaaS platforms using deep learning models. IEEE Access, 11, 84221–84234.

Singh, R., & Wang, L. (2023). Autonomous fault detection and recovery in cloud systems. IEEE Transactions on Network and Service Management, 20(3), 2104–2116.

Garlan, D., et al. (2022). Self-adaptive software systems: A decade of progress. ACM Transactions on Autonomous and Adaptive Systems, 15(4). Springer Nature.

Zhang, H., & Satyanarayanan, M. (2023). Edge–cloud synergy for scalable SaaS applications. Cluster Computing, 26, 1981–1995. Springer Nature.

Chen, F., et al. (2024). AI-driven observability and anomaly detection in cloud-native environments. IEEE Transactions on Cloud Computing, 12(1), 77–90.


Refbacks

  • There are currently no refbacks.