TY - GEN
T1 - Detection of Fault-Prone Classes Using Logistic Regression Based Object-Oriented Metrics Thresholds
AU - Hussain, Shahid
AU - Keung, Jacky
AU - Khan, Arif Ali
AU - Bennin, Kwabena Ebo
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/9/21
Y1 - 2016/9/21
N2 - Background: In the plethora of studies, the objectorientedmetrics have been empirically validated to assess thedesign properties and quantify the high-level quality attributessuch as fault-proneness, either at the method or class granularitylevels of software. Motivation: A more precise value of an objectorientedmetric can be used as an indicator for the developers tomake the informed decisions regarding the detection of designflaws and classify the fault-proneness classes. Method: Benderused an approach in the domain of epidemiology studies to derivethe threshold values for the risk factors. In our study, we followthe Bender's approach and propose a model to derive thethresholds for a set of software design metrics via non-linearfunctions, which are described through logistic regressioncoefficients. Subsequently, we perform four types of analysis andthree experiments in order to evaluate and compare theeffectiveness of derived thresholds in the domain of classificationof fault proneness classes. We use the Precision, Recall, Fmeasureand classification accuracy performance measures toassess the effectiveness of derived metrics thresholds. Results: Wecompare the derive threshold values of DIT, CA, LCOM andNPM metrics with their existing data distribution basedthreshold values, and observed the significant increase in theclassification accuracy of fault-prone classes. For example, DIT(27%), Ca (2%), NPM (2%) and LCOM (15%) for the Ant-1.5project. Conclusion: The analysis results suggest that theproposed model can be applied to derive the thresholds of otherobject-oriented metrics which present either with or withoutheavy-tailed distribution, however, the proposed model to derivethresholds cannot generalize for all the systems due to variationin data characteristics.
AB - Background: In the plethora of studies, the objectorientedmetrics have been empirically validated to assess thedesign properties and quantify the high-level quality attributessuch as fault-proneness, either at the method or class granularitylevels of software. Motivation: A more precise value of an objectorientedmetric can be used as an indicator for the developers tomake the informed decisions regarding the detection of designflaws and classify the fault-proneness classes. Method: Benderused an approach in the domain of epidemiology studies to derivethe threshold values for the risk factors. In our study, we followthe Bender's approach and propose a model to derive thethresholds for a set of software design metrics via non-linearfunctions, which are described through logistic regressioncoefficients. Subsequently, we perform four types of analysis andthree experiments in order to evaluate and compare theeffectiveness of derived thresholds in the domain of classificationof fault proneness classes. We use the Precision, Recall, Fmeasureand classification accuracy performance measures toassess the effectiveness of derived metrics thresholds. Results: Wecompare the derive threshold values of DIT, CA, LCOM andNPM metrics with their existing data distribution basedthreshold values, and observed the significant increase in theclassification accuracy of fault-prone classes. For example, DIT(27%), Ca (2%), NPM (2%) and LCOM (15%) for the Ant-1.5project. Conclusion: The analysis results suggest that theproposed model can be applied to derive the thresholds of otherobject-oriented metrics which present either with or withoutheavy-tailed distribution, however, the proposed model to derivethresholds cannot generalize for all the systems due to variationin data characteristics.
UR - http://www.scopus.com/inward/record.url?scp=84991781027&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84991781027&partnerID=8YFLogxK
U2 - 10.1109/QRS-C.2016.16
DO - 10.1109/QRS-C.2016.16
M3 - Conference contribution
AN - SCOPUS:84991781027
T3 - Proceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016
SP - 93
EP - 100
BT - Proceedings - 2016 IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Software Quality, Reliability and Security-Companion, QRS-C 2016
Y2 - 1 August 2016 through 3 August 2016
ER -