Open Access Open Access  Restricted Access Subscription Access

A Machine Learning Ensemble–Based Data-Driven Model for Software Reliability Growth

Saurabh Pandey, Priyanka Sahani

Abstract


Software reliability growth modelling is an essential aspect of evaluating the quality and dependability of software systems throughout both testing and operational phases [1][2]. Conventional and statistical Software Reliability Growth Models (SRGMs), while well established in theory, often face limitations when dealing with the complex, nonlinear, and data-driven behaviour observed in real software failure patterns. To address these challenges, this study proposes a purely data-driven software reliability growth model using machine learning ensemble methods. The proposed framework combines bagging- and boosting-based algorithms, including Random Forest, AdaBoost, and Gradient Boosting, to effectively learn failure trends from historical inter-failure time and cumulative failure data.

Comprehensive experiments are performed using widely used public software reliability datasets. The experimental results indicate that ensemble-based models consistently achieve higher prediction accuracy and better generalisation performance compared to traditional machine learning approaches and classical regression-based models. These findings highlight the strength of ensemble learning in capturing complex reliability growth behaviours, especially when dealing with limited or noisy datasets. Overall, the proposed model will provide a reliable and scalable solution for software reliability assessment and support informed decision-making in practical software quality assurance environments.


Full Text:

PDF

References


J. D. Musa, Software Reliability Engineering, McGraw-Hill, 2004.

A. L. Goel and K. Okumoto, “Time-Dependent Error Detection Rate Model for Software Reliability,” IEEE Transactions on Reliability, 1979.

B. Littlewood and J. L. Verrall, “A Bayesian Reliability Growth Model,” Applied Statistics, 1973.

R. Malhotra, “A Systematic Review of Machine Learning Techniques for Software Fault Prediction,” Applied Soft Computing, 2015.

R. Malhotra and Y. Singh, “On the Applicability of Machine Learning Techniques for Object-Oriented Software Fault Prediction,” Software Engineering Journal, 2011.

T. Gyimóthy, R. Ferenc, and I. Siket, “Empirical Validation of Object-Oriented Metrics on Open Source Software for Fault Prediction,” IEEE Transactions on Software Engineering, 2005.

L. Breiman, “Random Forests,” Machine Learning, 2001.

Y. Freund and R. Schapire, “A Decision-Theoretic Generalization of Online Learning and an Application to Boosting,” Journal of Computer and System Sciences, 1997.

J. H. Friedman, “Greedy Function Approximation: A Gradient Boosting Machine,” Annals of Statistics, 2001.

V. Vapnik, Statistical Learning Theory, Wiley, 1998.


Refbacks

  • There are currently no refbacks.