

AIOPS for Predictive Infrastructure Scaling
Abstract
Cloud-based applications rely on auto- scaling mechanisms to maintain performance and availability. However, traditional reactive scaling methods often lead to inefficient resource utilization, increased costs, and potential downtime. To address these challenges, this project proposes an AI-driven predictive infrastructure scaling system that utilizes Kubernetes, Prometheus, Python (ML), and hpa auto scaling.
This system leverages machine learning models to analyze historical and real-time infrastructure metrics to forecast demand spikes and failures before they occur. By proactively adjusting cloud resources, itensures optimized performance, minimal latency, and cost efficiency. The predictive model is built using time-series forecasting and anomaly detection techniques, enabling preemptive scaling of resources rather than reactive responses.
By implementing AIOps (Artificial Intelligence for IT Operations), this approach significantly reduces downtime, over-provisioning, and manual intervention, making cloud infrastructure management more autonomous and intelligent. The result is a highly resilient, self-scaling system capable of maintaining service continuity under dynamic workload.
References
Murthy, P., & Bobba, S. (2021, October). AI-powered predictive scaling in cloud computing: Enhancing efficiency through real-time workload forecasting. IRE Journals, 5(4), 143–151.
Díaz-de-Arcaya, J., Torre-Bastida, A. I., Miñón, R., & Almeida, A. (2022, October). Orfeon: An AIOps framework for the goal-driven operationalization of distributed analytical pipelines. Future Generation Computer Systems, 136, 348–365. https://doi.org/10.1016/j.future.2022.06.017 (Add DOI if available)
Dong, W. (2022, July). AIOps architecture in data center site infrastructure monitoring. Computational Intelligence and Neuroscience, 2022, 1–12. https://doi.org/10.1155/2022/xxxxxxx
Sivakumar, S. (2024, November). Agentic AI in predictive AIOps: Enhancing IT autonomy and performance. International Journal of Scientific Research and Management (IJSRM), 12(11), 1631–1638.
Chittala, S. (2024, September–October). AIOps in action: Automating AI deployment and management of large language models for scalable and ethical operations. International Journal for Multidisciplinary Research (IJFMR), 6(5), 1–8.
Refbacks
- There are currently no refbacks.