Mutation-Based White Box Testing of Deep Neural Networks

被引:1
|
作者
Cetiner, Gokhan [1 ]
Yayan, Ugur [2 ]
Yazici, Ahmet [1 ]
机构
[1] Univ Eskisehir Osmangazi, Comp Engn Dept, TR-26040 Eskisehir, Turkiye
[2] Univ Eskisehir Osmangazi, Software Engn Dept, TR-26040 Eskisehir, Turkiye
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Testing; Artificial neural networks; Robustness; Software testing; Long short term memory; Accuracy; Transformers; Predictive models; Libraries; Convolutional neural networks; Reinforcement learning; Convolutional neural network; deep neural networks; long short-term memory; machine learning; mutation-based testing; reinforcement learning; transformers;
D O I
10.1109/ACCESS.2024.3482114
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) are used in many critical areas, such as autonomous vehicles, generative AI systems, etc. Therefore, testing DNNs is vital, especially for models used in critical areas. Mutation-based testing is a very successful technique for testing DNNs by mutating their complex structures. Deep Mutation Module was developed to address mutation-based testing and the robustness challenges of DNNs. It analyses the structures of DNNs in detail. It tests models by applying mutation to parameters and structures using its fault library. Testing DNN structures and detecting faults is a highly complex and open-ended challenge. The method proposed in this study applies mutations to DNN parameters to expose faults and weaknesses in the models, thereby testing their robustness. The paper focuses on mutation-based tests of an Reinforce Learning (RL) model developed for electric vehicle routing, a Long Short-Term Memory (LSTM) model developed for prognostic predictions, and a Transformer-based neural network model for electric vehicle routing tasks. The best mutation scores for the LSTM model were measured as 96%, 91.02%, 71.19%, and 68.77%. The test results for the RL model resulted in mutation scores of 93.20%, 72.13%, 77.47%, 79.28%, and 55.74%. The mutation scores of the Transformer model were 75.87%, 76.36%, and 74.93%. These results show that the module can successfully test the targeted models and generate mutants classified as "survived mutants" that outperform the original models. In this way, it provides critical information to researchers to improve the overall performance of the models. Conducting these tests before using them in real-world applications minimizes faults and maximizes model success.
引用
收藏
页码:160156 / 160174
页数:19
相关论文
共 50 条
  • [1] A White-Box Testing for Deep Neural Networks Based on Neuron Coverage
    Yu, Jing
    Duan, Shukai
    Ye, Xiaojun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9185 - 9197
  • [2] DeepCNP: An efficient white-box testing of deep neural networks by aligning critical neuron paths
    Liu, Weiguang
    Luo, Senlin
    Pan, Limin
    Zhang, Zhao
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 179
  • [3] Testing for Multiple Faults in Deep Neural Networks
    Moussa, Dina A.
    Hefenbrock, Michael
    Tahoori, Mehdi
    IEEE DESIGN & TEST, 2024, 41 (03) : 47 - 53
  • [4] A FORTRAN LANGUAGE SYSTEM FOR MUTATION-BASED SOFTWARE TESTING
    KING, KN
    OFFUTT, AJ
    SOFTWARE-PRACTICE & EXPERIENCE, 1991, 21 (07): : 685 - 718
  • [5] Deep Drug Synergy Prediction Network Using Modified Triangular Mutation-Based Differential Evolution
    Singh, Dilbag
    Alzubi, Ahmad Ali
    Kaur, Manjit
    Kumar, Vijay
    Lee, Heung-No
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (01) : 669 - 678
  • [6] A New Approach Based on Deep Features of Convolutional Neural Networks for Partial Discharge Detection in Power Systems
    Eristi, Belkis
    IEEE ACCESS, 2024, 12 : 117026 - 117039
  • [7] Black-Box Testing of Deep Neural Networks through Test Case Diversity
    Aghababaeyan, Zohreh
    Abdellatif, Manel
    Briand, Lionel
    Ramesh, S.
    Bagherzadeh, Mojtaba
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (05) : 3182 - 3204
  • [8] DI-AA: An interpretable white-box attack for fooling deep neural networks
    Wang, Yixiang
    Liu, Jiqiang
    Chang, Xiaolin
    Rodriguez, Ricardo J.
    Wang, Jianhua
    INFORMATION SCIENCES, 2022, 610 : 14 - 32
  • [9] Short-Term Load Forecasting for Electrical Power Distribution Systems Using Enhanced Deep Neural Networks
    Tsegaye, Shewit
    Padmanaban, Sanjeevikumar
    Tjernberg, Lina Bertling
    Fante, Kinde Anlay
    IEEE ACCESS, 2024, 12 : 186856 - 186871
  • [10] Optimized Mutation of Grey-box Fuzzing: A Deep RL-based Approach
    Shao, Jiawei
    Zhou, Yan
    Liu, Guohua
    Zheng, Dezhi
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1296 - 1300