Combining Spreadsheet Smells for Improved Fault Prediction

被引:5
作者
Koch, Patrick [1 ]
Schekotihin, Konstantin [1 ]
Jannach, Dietmar [1 ]
Hofer, Birgit [2 ]
Wotawa, Franz [2 ]
Schmitz, Thomas [3 ]
机构
[1] AAU Klagenfurt, Klagenfurt, Austria
[2] Graz Univ Technol, Graz, Austria
[3] TU Dortmund, Dortmund, Germany
来源
2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING TECHNOLOGIES RESULTS (ICSE-NIER) | 2018年
基金
奥地利科学基金会;
关键词
Spreadsheet Smells; Spreadsheet QA; Fault Prediction;
D O I
10.1145/3183399.3183402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Spreadsheets are commonly used in organizations as a programming tool for business-related calculations and decision making. Since faults in spreadsheets can have severe business impacts, a number of approaches from general software engineering have been applied to spreadsheets in recent years, among them the concept of code smells. Smells can in particular be used for the task of fault prediction. An analysis of existing spreadsheet smells, however, revealed that the predictive power of individual smells can be limited. In this work we therefore propose a machine learning based approach which combines the predictions of individual smells by using an AdaBoost ensemble classifier. Experiments on two public datasets containing real-world spreadsheet faults show significant improvements in terms of fault prediction accuracy.
引用
收藏
页码:25 / 28
页数:4
相关论文
共 18 条
  • [1] Smelling Faults in Spreadsheets
    Abreu, Rui
    Cunha, Jacome
    Fernandes, Joao Paulo
    Martins, Pedro
    Perez, Alexandre
    Saraiva, Joao
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 111 - 120
  • [2] Large-Scale Machine Learning with Stochastic Gradient Descent
    Bottou, Leon
    [J]. COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 177 - 186
  • [3] Cunha J, 2012, LECT NOTES COMPUT SC, V7336, P202, DOI 10.1007/978-3-642-31128-4_15
  • [4] Ensemble methods in machine learning
    Dietterich, TG
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 1 - 15
  • [5] Comparing and experimenting machine learning techniques for code smell detection
    Fontana, Francesca Arcelli
    Mantyla, Mika V.
    Zanoni, Marco
    Marino, Alessandro
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (03) : 1143 - 1191
  • [6] Fowler M., 1999, Refactoring: Improving the design of existing code
  • [7] Freund Y., 1999, Journal of Japanese Society for Artificial Intelligence, V14, P771
  • [8] Improving Spectrum-based Fault Localization for Spreadsheet Debugging
    Getzner, Elisabeth
    Hofer, Birgit
    Wotawa, Franz
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS), 2017, : 102 - 113
  • [9] Enron's Spreadsheets and Related Emails: A Dataset and Analysis
    Hermans, Felienne
    Murphy-Hill, Emerson
    [J]. 2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 2, 2015, : 7 - 16
  • [10] Hermans F, 2012, 2012 28TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE (ICSM), P409, DOI 10.1109/ICSM.2012.6405300