Manifesting Bugs in Machine Learning Code: An Explorative Study with Mutation Testing

被引:24
作者
Cheng, Dawei [1 ]
Cao, Chun [1 ]
Xu, Chang [1 ]
Ma, Xiaoxing [1 ]
机构
[1] Nanjing Univ, Inst Comp Software, State Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018) | 2018年
基金
国家重点研发计划;
关键词
machine learning programs; mutation testing; explorative study;
D O I
10.1109/QRS.2018.00044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Nowadays statistical machine learning is widely adopted in various domains such as data mining, image recognition and automated driving. However, software quality assurance for machine learning is still in its infancy. While recent efforts have been put into improving the quality of training data and trained models, this paper focuses on code-level bugs in the implementations of machine learning algorithms. In this explorative study we simulated program bugs by mutating Weka implementations of several classification algorithms. We observed that 8%-40% of the logically non-equivalent executable mutants were statistically indistinguishable from their golden versions. Moreover, other 15%-36% of the mutants were stubborn, as they performed not significantly worse than a reference classifier on at least one natural data set. We also experimented with several approaches to killing those stubborn mutants. Preliminary results indicate that bugs in machine learning code may have negative impacts on statistical properties such as robustness and learning curves, but they could be very difficult to detect, due to the lack of effective oracles.
引用
收藏
页码:313 / 324
页数:12
相关论文
共 49 条
  • [21] Predicting higher order mutation score based on machine learning
    Do, Van-Nho
    Nguyen, Quang-Vu
    Nguyen, Thanh-Binh
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2024, 8 (01) : 57 - 70
  • [22] Locating and Correcting Software Faults in Executable Code Slices via Evolutionary Mutation Testing
    Yiasemis, Pantelis Stylianos
    Andreou, Andreas S.
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2012, 2013, 141 : 207 - 227
  • [23] Mutation Testing of Deep Reinforcement Learning Based on Real Faults
    Tambon, Florian
    Majdinasab, Vahid
    Nikanjam, Amin
    Khomh, Foutse
    Antoniol, Giuliano
    2023 IEEE CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION, ICST, 2023, : 188 - 198
  • [24] Boundary sampling to boost mutation testing for deep learning models
    Shen, Weijun
    Li, Yanhui
    Han, Yuanlei
    Chen, Lin
    Wu, Di
    Zhou, Yuming
    Xu, Baowen
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 130 (130)
  • [25] A fine-grained evaluation of mutation operators to boost mutation testing for deep learning systems
    Zhang, Zhiyi
    Wang, Yichun
    Yao, Yongming
    Wang, Ziyuan
    Huang, Zhiqiu
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [26] A (Reverse) Mutation Testing Approach to Automatically generate parallel C/C plus plus Code
    Calanna, Salvatore
    Calvagna, Andrea
    Campisi, Salvatore
    Fornaia, Andrea
    Pappalardo, Giuseppe
    Torrisi, Riccardo
    Tramontana, Emiliano
    2021 IEEE 30TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE 2021), 2021, : 159 - 164
  • [27] DeepCrime: Mutation Testing of Deep Learning Systems Based on Real Faults
    Humbatova, Nargiz
    Jahangirova, Gunel
    Tonella, Paolo
    ISSTA '21: PROCEEDINGS OF THE 30TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, 2021, : 67 - 78
  • [28] DeepCrime: from Real Faults to Mutation Testing Tool for Deep Learning
    Humbatova, Nargiz
    Jahangirova, Gunel
    Tonella, Paolo
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS, ICSE-COMPANION, 2023, : 68 - 72
  • [29] A Comparative Study of Whole Issues and Challenges in Mutation Testing
    Bokaei, Negar Nazem
    Keyvanpour, Mohammad Reza
    2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 745 - 754
  • [30] Evaluation of Mutation Testing in a Nuclear Industry Case Study
    Delgado-Perez, Pedro
    Habli, Ibrahim
    Gregory, Steve
    Alexander, Rob
    Clark, John
    Medina-Bulo, Inmaculada
    IEEE TRANSACTIONS ON RELIABILITY, 2018, 67 (04) : 1406 - 1419