Prioritization of Regression Test Cases Based on Machine Learning Methods

被引:0
作者
Kiran, Selcuk [1 ]
Emre, Ilkim Ecem [1 ]
Tasdelen, Selen [1 ]
机构
[1] Marmara Univ, Fac Business Adm, Dept Management Informat Syst, Istanbul, Turkiye
来源
GAZI UNIVERSITY JOURNAL OF SCIENCE | 2025年 / 38卷 / 01期
关键词
Machine learning Natural language processing; Software testing Regression testing Test case prioritization;
D O I
10.35378/gujs.1446469
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Due to resource and time constraints involved in the software testing process, it is not possible to implement all test scenarios for each release. Test scenarios can be prioritized according to certain criteria defined by the developers to ensure effective execution of the testing process and detection of errors. This study investigated whether machine learning based models could be used to prioritize test scenarios created in regression testing. It is attempted to determine which tests can be prioritized for execution based on different independent variables. In total, each of the 964 test scenarios in the dataset was labelled as minor (482) and major (482) by two experts. In the models, the number of related requirements, the number of related errors, and the age of the scenario were used as independent variables, and the scenario classes labelled as minor- major were taken as the target variable. The scenarios were pre-processed using natural language processing techniques and different machine learning algorithms were used for model development. In the classification based on test scenarios, the random forest algorithm showed the best performance with a F1-score of 81%. In the classification based on the number of related requirements, the number of interrelated errors, and the age of the test scenarios, the random forest model once again demonstrated the highest success rate at 79%. This study demonstrates that machine learning techniques offer a variety of models for test case prioritization.
引用
收藏
页码:131 / 144
页数:14
相关论文
共 57 条
  • [1] Afshinpour B., 2020, CEUR Workshop Proceedings, V2799, P43
  • [2] Prioritization for Regression Testing Using Ant Colony Optimization Based on Test Factors
    Ahmad, Sheikh Fahad
    Singh, Deepak Kumar
    Suman, Preetam
    [J]. INTELLIGENT COMMUNICATION, CONTROL AND DEVICES, ICICCD 2017, 2018, 624 : 1353 - 1360
  • [3] A systematic review of machine learning methods in software testing
    Ajorloo, Sedighe
    Jamarani, Amirhossein
    Kashfi, Mehdi
    Kashani, Mostafa Haghi
    Najafizadeh, Abbas
    [J]. APPLIED SOFT COMPUTING, 2024, 162
  • [4] [Anonymous], ABOUT US
  • [5] [Anonymous], About us
  • [6] Atifi Meriem, 2017, Networked Systems. 5th International Conference, NETYS 2017. Proceedings: LNCS 10299, P373, DOI 10.1007/978-3-319-59647-1_27
  • [7] Balaban M.E., 2018, Veri Madenciligi ve Makine Ogrenmesi Temel Algoritmalari ve R Dili ile Uygulamalari, VSecond
  • [8] Random forest in remote sensing: A review of applications and future directions
    Belgiu, Mariana
    Dragut, Lucian
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 : 24 - 31
  • [9] Bird S., 2009, Natural language processing with python: Analyzing text with the natural language toolkit
  • [10] Blanck M., 2019, Predicting Price Residuals in Online Car Marketplaces with Natural Language Processing