• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Impact of Resample and Ensembles on Software Maintainability Prediction

Vol. 19, No. 8, August 31, 2025
10.3837/tiis.2025.08.008, Download Paper (Free):

Abstract

Software maintainability prediction (SMP) ensures long-term software sustainability and minimizes development costs. However, data imbalance often leads to biased classifications and inaccurate predictions. This research proposes a comprehensive approach to address SMP data imbalance and accuracy issues. The method utilizes Resample, Class Balancer, and Spread Subsample techniques to balance a large, unbalanced dataset comprising seven software packages: LUCENE, JDT, PDE, EQUINOX, MYLENE, UIMS, and QUES. We evaluated six machine learning models for the training of SMP models: Support Vector Machine, Decision Tree, Naïve Bayes, k-nearest Neighbour, and two ensemble methods (Bagged Tree and Boosted Tree). TOPSIS analysis identifies the Bagged Tree with Resample as the best-performing model, achieving a maximum accuracy of 95.7%. Shapley Additive Explanations and Local Interpretable Model-Agnostic Explanations techniques analysed the interpretability of the best SMP model, revealing key predictors influencing software maintainability. Finally, the Mean Magnitude of Relative Error (MMRE) demonstrates that the proposed model outperforms existing studies. This work confirms its effectiveness, reliability, and practical use in software engineering.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
R. Yadav and R. Singh, "Impact of Resample and Ensembles on Software Maintainability Prediction," KSII Transactions on Internet and Information Systems, vol. 19, no. 8, pp. 2548-2568, 2025. DOI: 10.3837/tiis.2025.08.008.

[ACM Style]
Rohit Yadav and Raghuraj Singh. 2025. Impact of Resample and Ensembles on Software Maintainability Prediction. KSII Transactions on Internet and Information Systems, 19, 8, (2025), 2548-2568. DOI: 10.3837/tiis.2025.08.008.

[BibTeX Style]
@article{tiis:103077, title="Impact of Resample and Ensembles on Software Maintainability Prediction", author="Rohit Yadav and Raghuraj Singh and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2025.08.008}, volume={19}, number={8}, year="2025", month={August}, pages={2548-2568}}