A Hierarchical Feature Ensemble Deep Learning Approach for Software Defect Prediction

被引:2
|
作者
Zhang, Shenggang [1 ]
Jiang, Shujuan [1 ]
Yan, Yue [1 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
基金
中国国家自然科学基金;
关键词
Software defect prediction; deep learning; abstract syntax tree; class dependency network; ensemble learning;
D O I
10.1142/S0218194023500079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction can detect modules that may have defects in advance and optimize resource allocation to improve test efficiency and reduce development costs. Traditional features cannot capture deep semantic and grammatical information, which limits the further development of software defect prediction. Therefore, it has gradually become a trend to use deep learning technology to automatically learn valuable deep features from source code or relevant data. However, most software defect prediction methods based on deep learning extraction features from a single information source or only use a single deep learning model, which leads to the fact that the extracted features are not comprehensive enough to affect the final prediction performance. In view of this, this paper proposes a Hierarchical Feature Ensemble Deep Learning (HFEDL) Approach for software defect prediction. Firstly, the HFEDL approach needs to obtain three types of information sources: abstract syntax tree (AST), class dependency network (CDN) and traditional features. Then, the Convolutional Neural Network (CNN) and the Bidirectional Long Short-Term Memory based on Attention mechanism (BiLSTM+Attention) are used to extract different valuable features from the three information sources and multiple prediction sub-models are constructed. Next, all the extracted features are fused by a filter mechanism to obtain more comprehensive features and construct a fusion prediction sub-model. Finally, all the sub-models are integrated by an ensemble learning method to obtain the final prediction model. We use 11 projects in the PROMISE defect repository and evaluate our approach in both non-effort-aware and effort-aware scenarios. The experimental results show that the prediction performance of our approach is superior to state-of-the-art methods in both scenarios.
引用
收藏
页码:543 / 573
页数:31
相关论文
共 50 条
  • [1] Deep Semantic Feature Learning for Software Defect Prediction
    Wang, Song
    Liu, Taiyue
    Nam, Jaechang
    Tan, Lin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (12) : 1267 - 1293
  • [2] Ensemble learning based software defect prediction
    Dong, Xin
    Liang, Yan
    Miyamoto, Shoichiro
    Yamaguchi, Shingo
    JOURNAL OF ENGINEERING RESEARCH, 2023, 11 (04): : 377 - 391
  • [3] LDFR: Learning deep feature representation for software defect prediction
    Xu, Zhou
    Li, Shuai
    Xu, Jun
    Liu, Jin
    Luo, Xiapu
    Zhang, Yifeng
    Zhang, Tao
    Keung, Jacky
    Tang, Yutian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 158
  • [4] Software Defect Prediction: A Machine Learning Approach with Voting Ensemble
    Mosquera, Marcela
    Hurtado, Remigio
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 585 - 595
  • [5] Bootstrap aggregation ensemble learning-based reliable approach for software defect prediction by using characterized code feature
    Suresh Kumar, P.
    Behera, H. S.
    Nayak, Janmenjoy
    Naik, Bighnaraj
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2021, 17 (04) : 355 - 379
  • [6] Bootstrap aggregation ensemble learning-based reliable approach for software defect prediction by using characterized code feature
    P. Suresh Kumar
    H. S. Behera
    Janmenjoy Nayak
    Bighnaraj Naik
    Innovations in Systems and Software Engineering, 2021, 17 : 355 - 379
  • [7] Multiple kernel ensemble learning for software defect prediction
    Wang, Tiejian
    Zhang, Zhiwu
    Jing, Xiaoyuan
    Zhang, Liqiang
    AUTOMATED SOFTWARE ENGINEERING, 2016, 23 (04) : 569 - 590
  • [8] Multiple kernel ensemble learning for software defect prediction
    Tiejian Wang
    Zhiwu Zhang
    Xiaoyuan Jing
    Liqiang Zhang
    Automated Software Engineering, 2016, 23 : 569 - 590
  • [9] An Ensemble Learning Approach for Software Defect Prediction in Developing Quality Software Product
    Saheed, Yakub Kayode
    Longe, Olumide
    Baba, Usman Ahmad
    Rakshit, Sandip
    Vajjhala, Narasimha Rao
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 317 - 326
  • [10] On the use of deep learning in software defect prediction
    Giray, Gorkem
    Bennin, Kwabena Ebo
    Koksal, Omer
    Babur, Onder
    Tekinerdogan, Bedir
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 195