An Analysis of Software Bug Reports Using Random Forest

被引:1
|
作者
Ha Manh Tran [1 ]
Sinh Van Nguyen [1 ]
Synh Viet Uyen Ha [1 ]
Thanh Quoc Le [1 ]
机构
[1] Vietnam Natl Univ, Int Univ, Comp Sci & Engn, Ho Chi Minh City, Vietnam
来源
FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018 | 2018年 / 11251卷
关键词
Random forest; Decision tree; Software bug report; Network fault detection; Fault management; FAULT-TREE ANALYSIS; SEARCH;
D O I
10.1007/978-3-030-03192-3_21
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bug tracking systems manage bug reports for assuring the quality of software products. A bug report also referred as trouble, problem, ticket or defect contains several features for problem management and resolution purposes. Severity and priority are two essential features of a bug report that define the effect level and fixing order of the bug. Determining these features is challenging and depends heavily on human being, e.g., software developers or system operators, especially for assessing a large number of error and warning events occurring on software products or network services. This study proposes an approach of using random forest for assessing severity and priority for software bug reports automatically. This approach aims at constructing multiple decision trees based on the subsets of the existing bug dataset and features, and then selecting the best decision trees to assess the severity and priority of new bugs. The approach can be applied for detecting and forecasting faults in large, complex communication networks and distributed systems today. We have presented the applicability of random forest for bug report analysis and performed several experiments on software bug datasets obtained from open source bug tracking systems. Random forest yields an average accuracy score of 0.75 that can be sufficient for assisting system operators in determining these features. We have provided some analysis of the experimental results.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 50 条
  • [31] PCirc: random forest-based plant circRNA identification software
    Yin, Shuwei
    Tian, Xiao
    Zhang, Jingjing
    Sun, Peisen
    Li, Guanglin
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [32] Analyzing Key Features of Open Source Software Survivability with Random Forest
    Park, Sohee
    Kwon, Gihwon
    APPLIED SCIENCES-BASEL, 2025, 15 (02):
  • [33] Trend analysis and forecasting of streamflow using random forest in the Punarbhaba River basin
    Swapan Talukdar
    Swades Pal
    Mohd Waseem Shahfahad
    Ayesha Naikoo
    Atiqur Parvez
    Environmental Monitoring and Assessment, 2023, 195
  • [34] Season wise bike sharing demand analysis using random forest algorithm
    Sathishkumar, V. E.
    Cho, Yongyun
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (01)
  • [35] Study on the Effect of Training Data Sampling Strategy on the Accuracy of the Landslide Susceptibility Analysis using Random Forest Method
    Kang, Kyoung-Hee
    Park, Hyuck-Jin
    ECONOMIC AND ENVIRONMENTAL GEOLOGY, 2019, 52 (02): : 199 - 212
  • [36] Analysis of NBA Players and Shot Prediction Using Random Forest and XGBoost Models
    Oughali, Maram Shikh
    Bahloul, Mariah
    El Rahman, Sahar A.
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 157 - 161
  • [37] Susceptibility Analysis of Earthquake-Induced Landslide Using Random Forest Method
    Miao, T. Y.
    Wang, M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 771 - 775
  • [38] Trend analysis and forecasting of streamflow using random forest in the Punarbhaba River basin
    Talukdar, Swapan
    Pal, Swades
    Shahfahad
    Naikoo, Mohd Waseem
    Parvez, Ayesha
    Rahman, Atiqur
    ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (01)
  • [39] Analysis of Significant Factors for Dengue Infection Prognosis Using the Random Forest Classifier
    Fathima, A. Shameem
    Manimeglai, D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (02) : 240 - 245
  • [40] ECG Biometric Identification Using Wavelet Analysis Coupled with Probabilistic Random Forest
    Tan, Robin
    Perkowski, Marek
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 182 - 187