|
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
| Volume 187 - Issue 87 |
| Published: March 2026 |
| Authors: Shraddhaben R. Gajjar |
10.5120/ijca2026926515
|
Shraddhaben R. Gajjar . AI-Driven Predictive Resource Management for Scalable and Resilient Cloud Infrastructure. International Journal of Computer Applications. 187, 87 (March 2026), 45-49. DOI=10.5120/ijca2026926515
@article{ 10.5120/ijca2026926515,
author = { Shraddhaben R. Gajjar },
title = { AI-Driven Predictive Resource Management for Scalable and Resilient Cloud Infrastructure },
journal = { International Journal of Computer Applications },
year = { 2026 },
volume = { 187 },
number = { 87 },
pages = { 45-49 },
doi = { 10.5120/ijca2026926515 },
publisher = { Foundation of Computer Science (FCS), NY, USA }
}
%0 Journal Article
%D 2026
%A Shraddhaben R. Gajjar
%T AI-Driven Predictive Resource Management for Scalable and Resilient Cloud Infrastructure%T
%J International Journal of Computer Applications
%V 187
%N 87
%P 45-49
%R 10.5120/ijca2026926515
%I Foundation of Computer Science (FCS), NY, USA
Cloud infrastructure underpins modern healthcare systems, financial platforms, artificial-intelligence services, and public-sector applications. In these environments, infrastructure managers must continuously balance service-level objectives against rising compute cost. Reactive autoscaling remains the dominant operational mechanism in practice, but threshold-based policies expand capacity only after congestion has already appeared, which can produce latency spikes, brief SLA violations, and persistent over-provisioning. This paper presents a practical AI-driven predictive resource management framework for scalable and resilient cloud infrastructure. The framework combines workload forecasting, multivariate anomaly detection, policy-aware decision logic, and cloud-native orchestration to allocate resources before demand peaks occur. A prototype evaluation using representative cyclical and bursty workloads compares static provisioning, reactive autoscaling, and predictive scaling. The predictive approach reduces total compute-hours by 22.2% versus reactive autoscaling and 36.4% versus static provisioning, while improving SLA compliance to 99.1% and increasing average utilization to 76%. The paper also discusses design trade-offs, deployment constraints, and portability across Kubernetes-based environments. The results suggest that predictive resource management can materially improve both resilience and cost efficiency when integrated with disciplined observability and automated control loops.