Identify Severity Bug Report with Distribution Imbalance by CR-SMOTE and ELM

被引:122
作者
Guo, Shikai [1 ]
Chen, Rong [1 ]
Li, Hui [1 ]
Zhang, Tianlun [1 ]
Liu, Yaqing [1 ,2 ]
机构
[1] Dalian Maritime Univ, Coll Informat Sci & Technol, Dalian 116026, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Bug reports; extreme learning machine; imbalanced distribution; severity; software application testing; EXTREME LEARNING-MACHINE; CLASSIFICATION; ALGORITHMS; PRIORITY;
D O I
10.1142/S0218194019500074
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Manually inspecting bugs to determine their severity is often an enormous but essential software development task, especially when many participants generate a large number of bug reports in a crowdsourced software testing context. Therefore, boosting the capabilities of methods of predicting bug report severity is critically important for determining the priority of fixing bugs. However, typical classification techniques may be adversely affected when the severity distribution of the bug reports is imbalanced, leading to performance degradation in a crowdsourcing environment. In this study, we propose an enhanced oversampling approach called CR-SMOTE to enhance the classification of bug reports with a realistically imbalanced severity distribution. The main idea is to interpolate new instances into the minority category that are near the center of existing samples in that category. Then, we use an extreme learning machine (ELM) - a feedforward neural network with a single layer of hidden nodes - to predict the bug severity. Several experiments were conducted on three datasets from real bug repositories, and the results statistically indicate that the presented approach is robust against real data imbalance when predicting the severity of bug reports. The average accuracies achieved by the ELM in predicting the severity of Eclipse, Mozilla, and GNOME bug reports were 0.780, 0.871, and 0.861, which are higher than those of classifiers by 4.36%, 6.73%, and 2.71%, respectively.
引用
收藏
页码:139 / 175
页数:37
相关论文
共 53 条
  • [1] [Anonymous], 2014, INT J ARTIF INTELL A
  • [2] [Anonymous], 2014, P 11 WORKING C MININ, DOI 10.1145/2597073.2597099
  • [3] [Anonymous], PATTERN RECOGN LETT
  • [4] [Anonymous], P ACM IEEE INT S EMP
  • [5] [Anonymous], 2014, PROC 6 INT C SIGNAL
  • [6] Antoniol G., 2008, C CTR ADV STUD MINDS, P304, DOI 10.1145/1463788.1463819
  • [7] Reducing the Effort of Bug Report Triage: Recommenders for Development-Oriented Decisions
    Anvik, John
    Murphy, Gail C.
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2011, 20 (03)
  • [8] An experimental comparison of classification algorithms for imbalanced credit scoring data sets
    Brown, Iain
    Mues, Christophe
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) : 3446 - 3453
  • [9] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [10] Chen N, 2012, IEEE INT CONF AUTOM, P140, DOI 10.1145/2351676.2351697