TY - GEN
T1 - Predicting bugs in software code changes using isolation forest
AU - He, Yueyang
AU - Zhu, Xiaoyan
AU - Wang, Guangtao
AU - Sun, Heli
AU - Wang, Yong
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/8/11
Y1 - 2017/8/11
N2 - Identifying bug immediately when it is introduced can help improve the validity and effectiveness of bug fixing. Predicting bugs in software code changes makes such identification possible. Buggy changes, changes that introduce bugs into source code, can be viewed as anomalies relative to clean changes for that they are rare and irregular. Thus, anomaly detection techniques can be applied to buggy change prediction. Isolation Forest, which detects anomalies based on the hypothesis that the anomalies have the shortest average path length on the constructed random forest, has exhibited its good performance on anomaly detection compared to other anomaly detection methods. In this paper, we adopt it in predicting bugs in software code changes. Empirical study with eight practical open source projects are conducted to validate the effective of Isolation Forest in bug prediction in software code changes. Results of the empirical study show that compared to traditional classification methods used in literature, Isolation Forest can achieve better clean precision, buggy recall, buggy F-measure, AUC and Gmean.
AB - Identifying bug immediately when it is introduced can help improve the validity and effectiveness of bug fixing. Predicting bugs in software code changes makes such identification possible. Buggy changes, changes that introduce bugs into source code, can be viewed as anomalies relative to clean changes for that they are rare and irregular. Thus, anomaly detection techniques can be applied to buggy change prediction. Isolation Forest, which detects anomalies based on the hypothesis that the anomalies have the shortest average path length on the constructed random forest, has exhibited its good performance on anomaly detection compared to other anomaly detection methods. In this paper, we adopt it in predicting bugs in software code changes. Empirical study with eight practical open source projects are conducted to validate the effective of Isolation Forest in bug prediction in software code changes. Results of the empirical study show that compared to traditional classification methods used in literature, Isolation Forest can achieve better clean precision, buggy recall, buggy F-measure, AUC and Gmean.
UR - https://www.scopus.com/pages/publications/85029435110
U2 - 10.1109/QRS.2017.40
DO - 10.1109/QRS.2017.40
M3 - 会议稿件
AN - SCOPUS:85029435110
T3 - Proceedings - 2017 IEEE International Conference on Software Quality, Reliability and Security, QRS 2017
SP - 296
EP - 305
BT - Proceedings - 2017 IEEE International Conference on Software Quality, Reliability and Security, QRS 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th IEEE International Conference on Software Quality, Reliability and Security, QRS 2017
Y2 - 25 July 2017 through 29 July 2017
ER -