An Analysis of Software Bug Reports Using Random Forest

被引:1
|
作者
Ha Manh Tran [1 ]
Sinh Van Nguyen [1 ]
Synh Viet Uyen Ha [1 ]
Thanh Quoc Le [1 ]
机构
[1] Vietnam Natl Univ, Int Univ, Comp Sci & Engn, Ho Chi Minh City, Vietnam
来源
FUTURE DATA AND SECURITY ENGINEERING, FDSE 2018 | 2018年 / 11251卷
关键词
Random forest; Decision tree; Software bug report; Network fault detection; Fault management; FAULT-TREE ANALYSIS; SEARCH;
D O I
10.1007/978-3-030-03192-3_21
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bug tracking systems manage bug reports for assuring the quality of software products. A bug report also referred as trouble, problem, ticket or defect contains several features for problem management and resolution purposes. Severity and priority are two essential features of a bug report that define the effect level and fixing order of the bug. Determining these features is challenging and depends heavily on human being, e.g., software developers or system operators, especially for assessing a large number of error and warning events occurring on software products or network services. This study proposes an approach of using random forest for assessing severity and priority for software bug reports automatically. This approach aims at constructing multiple decision trees based on the subsets of the existing bug dataset and features, and then selecting the best decision trees to assess the severity and priority of new bugs. The approach can be applied for detecting and forecasting faults in large, complex communication networks and distributed systems today. We have presented the applicability of random forest for bug report analysis and performed several experiments on software bug datasets obtained from open source bug tracking systems. Random forest yields an average accuracy score of 0.75 that can be sufficient for assisting system operators in determining these features. We have provided some analysis of the experimental results.
引用
收藏
页码:273 / 285
页数:13
相关论文
共 50 条
  • [1] An Analysis of Software Bug Reports Using Machine Learning Techniques
    Tran H.M.
    Le S.T.
    Nguyen S.V.
    Ho P.T.
    SN Computer Science, 2020, 1 (1)
  • [2] Software Defect Prediction Using Random Forest Algorithm
    Soe, Yan Naung
    Santosa, Paulus Insap
    Hartanto, Rudy
    2018 12TH SOUTH EAST ASIAN TECHNICAL UNIVERSITY CONSORTIUM (SYMPOSIUM SEATUC 2018): ENGINEERING EDUCATION AND RESEARCH FOR SUSTAINABLE DEVELOPMENT, 2018,
  • [3] Enhancing Software Defect Prediction accuracy using Modified Entropy Calculation in Random Forest Algorithm
    Suryawanshi, Ranjeetsingh
    Kadam, Amol
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 84 - 91
  • [4] Prediction Analysis of Crop and Their Futuristic Yields Using Random Forest Regression
    Ramisetty, Uma Maheswari
    Kumar Gundavarapu, Venkata Nagesh
    Rajender, R.
    Ramirez, Isaac Segovia
    Garcia Marquez, Fausto Pedro
    IOT AND DATA SCIENCE IN ENGINEERING MANAGEMENT, 2023, 160 : 280 - 285
  • [5] Controller Monitoring System In Software Defined Networks Using Random Forest Algorithm
    Kirutika, K.
    Vetriselvi, V.
    Parthasarathi, Ranjani
    Rao, G. Subrahmanya V. R. K.
    2019 IEEE 53RD INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST 2019), 2019,
  • [6] Road Crashes Analysis and Prediction using Gradient Boosted and Random Forest Trees
    Elyassami, Sanaa
    Hamid, Yasir
    Habuza, Tetiana
    2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, : 520 - 525
  • [7] Random forest explainability using counterfactual sets
    Fernandez, Ruben R.
    Martin de Diego, Isaac
    Acena, Victor
    Fernandez-Isabel, Alberto
    Moguerza, Javier M.
    INFORMATION FUSION, 2020, 63 : 196 - 207
  • [8] Prediction and Analysis of Student Performance using Hybrid Model of Multilayer Perceptron and Random Forest
    Jain, Akagra
    Shah, Kushagra
    Chaturvedi, Pradhyumn
    Tambe, Anuj
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [9] Investigating the use of random forest in software effort estimation
    Abdelali, Zakrani
    Mustapha, Hain
    Abdelwahed, Namir
    SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 343 - 352
  • [10] Analysis of Data Splitting on Streamflow Prediction using Random Forest
    Puri, Diksha
    Sihag, Parveen
    Thakur, Mohindra Singh
    Jameel, Mohammed
    Chadee, Aaron Anil
    Hazi, Mohammad Azamathulla
    AIMS ENVIRONMENTAL SCIENCE, 2024, 11 (04) : 593 - 609