Survey of Open-Source Software Defect Prediction Method

被引:0
作者
Tian X. [1 ,2 ]
Chang J. [3 ]
Zhang C. [2 ]
Rong J. [2 ,6 ]
Wang Z. [3 ]
Zhang G. [3 ]
Wang H. [1 ,2 ]
Wu G. [1 ,2 ,4 ]
Hu J. [5 ]
Zhang Y. [1 ,2 ,6 ,7 ]
机构
[1] School of Cyber Engineering, Xidian University, Xi’an
[2] National Computer Network Intrusion Protection Center, University of Chinese Academy of Sciences, Beijing
[3] School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang
[4] Guangxi Key Laboratory of Cryptography and Information Security, Guilin University of Electronic Technology, Guangxi, Guilin
[5] Graduate School of Information, Production and Systems, Waseda University
[6] College of Cyberspace Security, Hainan University, Haikou
[7] Zhongguancun Laboratory, Beijing
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2023年 / 60卷 / 07期
基金
中国国家自然科学基金;
关键词
deep learning; machine learning; metric; semantic and syntactic analysis; software defect prediction; vulnerability prediction;
D O I
10.7544/issn1000-1239.202221046
中图分类号
学科分类号
摘要
Open-source software defect prediction reduces software repair costs and improves product quality by mining data from software history warehouses, using the syntactic semantic features of metrics related to software defects or the source code itself, and utilizing machine learning or deep learning methods to find software defects in advance. Vulnerability prediction extracts and tags code modules by mining software instance repositories to predict whether new code instances contain vulnerabilities in order to reduce the cost of vulnerability discovery and fixing. We investigate and analyze the relevant literatures in the field of software defect prediction from 2000 to December 2022. Taking machine learning and deep learning as the starting point, we sort out two types of prediction models which are based on software metrics and grammatical semantics. Based on the two types of models, the difference and connection between software defect prediction and vulnerability prediction are analyzed. Moreover, six frontier hot issues such as dataset source and processing, code vector representation method, pre-training model improvement, deep learning model exploration, fine-grained prediction technology, software defect prediction and vulnerability prediction model migration are analyzed in detail. Finally, the future development direction of software defect prediction is pointed out. © 2023 Science Press. All rights reserved.
引用
收藏
页码:1467 / 1488
页数:21
相关论文
共 124 条
[1]  
Pachouly J, Ahirrao S, Kotecha K, Et al., A systematic literature review on software defect prediction using artificial intelligence: Datasets, data validation methods, approaches, and tools[J], Engineering Applications of Artificial Intelligence, 111, pp. 1-33, (2022)
[2]  
Xiang Chen, Qing Gu, Wangshu Liu, Et al., Survey of static software defect prediction[J], Journal of Software, 27, 1, (2016)
[3]  
Mianxue Gu, Hongyu Sun, Dan Han, Et al., Software security vulnerability mining based on deep learning[J], Journal of Computer Research and Development, 58, 10, (2021)
[4]  
Trachtenberg M., Discovering how to ensure software reliability[J], Radio Corporation of America Engineer, 27, 1, pp. 53-57, (1982)
[5]  
Lianfen Qian, Qingchuan Yao, Khoshgoftaar T M., Dynamic two-phase truncated Rayleigh model for release date prediction of software[J], Journal of Software Engineering and Applications, 3, pp. 603-609, (2010)
[6]  
Bustamante A, Bustamante B., Multinomial-exponential reliability function: A software reliability model[J], Reliability Engineering & System Safety, 79, 3, pp. 281-288, (2003)
[7]  
Zheng Yanyan, Xu Renzuo, An adaptive exponential smoothing approach for software reliability prediction, Proc of 2008 4th Int Conf on Wireless Communications, Networking and Mobile Computing, pp. 1-4, (2008)
[8]  
Yamada S, Ohba M, Osaki S., S-shaped reliability growth modeling for software error detection[J], IEEE Transactions on Reliability, 32, 5, (1983)
[9]  
Kececioglu D, Jiang S, Vassiliou P., The modified Gompertz reliability growth model[C], Proc of Annual Reliability and Maintainability Symp (RAMS), pp. 160-165, (1994)
[10]  
Ahmad N, Imam M Z., Software reliability growth models with log-logistic testing-effort function: A comparative study[J], International Journal of Computer Applications, 75, 12, (2014)