TY - GEN
T1 - Empirical investigation of fault predictors in context of class membership probability estimation
AU - Hussain, Shahid
AU - Khan, Arif Ali
AU - Bennin, Kwabena Ebo
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/4/4
Y1 - 2016/4/4
N2 - In the domain of software fault prediction, class membership probability of a selected classifier and the factors related to its estimation can be considered as necessary information for tester to take informed decisions about software quality issues. The objective of this study is to empirically investigate the class membership probability estimation capability of 15 classifiers/fault predictors on 12 datasets of open source projects retrieved from PROMISE repository. We empirically validate the effect of dataset characteristics and set of metrics on the performance of classifiers in estimating the class membership probability. We used Receiver Operating Characteristics-Area under Curve (ROC-AUC) value and overall accuracy as benchmarks to evaluate and compare the performance of classifiers. We apply Friedman's, post-hoc Nemenyi and Analysis of Means (ANOM) test to compare the significant performance of classifiers. We conclude that ADTree and RandomForest outperform, while ZeroR classifier cannot show significant performance for estimation of class membership probability.
AB - In the domain of software fault prediction, class membership probability of a selected classifier and the factors related to its estimation can be considered as necessary information for tester to take informed decisions about software quality issues. The objective of this study is to empirically investigate the class membership probability estimation capability of 15 classifiers/fault predictors on 12 datasets of open source projects retrieved from PROMISE repository. We empirically validate the effect of dataset characteristics and set of metrics on the performance of classifiers in estimating the class membership probability. We used Receiver Operating Characteristics-Area under Curve (ROC-AUC) value and overall accuracy as benchmarks to evaluate and compare the performance of classifiers. We apply Friedman's, post-hoc Nemenyi and Analysis of Means (ANOM) test to compare the significant performance of classifiers. We conclude that ADTree and RandomForest outperform, while ZeroR classifier cannot show significant performance for estimation of class membership probability.
UR - http://www.scopus.com/inward/record.url?scp=84975853420&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84975853420&partnerID=8YFLogxK
U2 - 10.1145/2851613.2851973
DO - 10.1145/2851613.2851973
M3 - Conference contribution
AN - SCOPUS:84975853420
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 1550
EP - 1553
BT - 2016 Symposium on Applied Computing, SAC 2016
PB - Association for Computing Machinery
T2 - 31st Annual ACM Symposium on Applied Computing, SAC 2016
Y2 - 4 April 2016 through 8 April 2016
ER -