Cost-sensitive Dictionary Learning for Software Defect Prediction

被引:13
作者
Niu, Liang [1 ]
Wan, Jianwu [1 ,2 ]
Wang, Hongyuan [1 ]
Zhou, Kaiwei [1 ]
机构
[1] Changzhou Univ, Sch Informat Sci & Engn, Changzhou 213164, Jiangsu, Peoples R China
[2] Nanyang Technol Univ, Sch Civil & Environm Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金;
关键词
Software defect prediction; Cost-sensitive; Dictionary learning; Discrimination; LABEL PROPAGATION; NEURAL-NETWORKS; RECOGNITION; INFORMATION; MACHINE; QUALITY;
D O I
10.1007/s11063-020-10355-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, software defect prediction has been recognized as a cost-sensitive learning problem. To deal with the unequal misclassification losses resulted by different classification errors, some cost-sensitive dictionary learning methods have been proposed recently. Generally speaking, these methods usually define the misclassification costs to measure the unequal losses and then propose to minimize the cost-sensitive reconstruction loss by embedding the cost information into the reconstruction function of dictionary learning. Although promising performance has been achieved, their cost-sensitive reconstruction functions are not well-designed. In addition, no sufficient attentions are paid to the coding coefficients which can also be helpful to reduce the reconstruction loss. To address these issues, this paper proposes a new cost-sensitive reconstruction loss function and introduces an additional cost-sensitive discrimination regularization for the coding coefficients. Both the two terms are jointly optimized in a unified cost-sensitive dictionary learning framework. By doing so, we can achieve the minimum reconstruction loss and thus obtain a more cost-sensitive dictionary for feature encoding of test data. In the experimental part, we have conducted extensive experiments ontwenty-fivesoftware projects from four benchmark datasets of NASA, AEEEM, ReLink and Jureczko. The results, in comparison withtenstate-of-the-art software defect prediction methods, demonstrate the effectiveness of learned cost-sensitive dictionary for software defect prediction.
引用
收藏
页码:2415 / 2449
页数:35
相关论文
共 50 条
  • [41] Semi-supervised Software Defect Prediction Using Task-Driven Dictionary Learning
    CHENG Ming
    WU Guoqing
    YUAN Mengting
    WAN Hongyan
    ChineseJournalofElectronics, 2016, 25 (06) : 1089 - 1096
  • [42] Cost-sensitive Fuzzy Multiple Kernel Learning for imbalanced problem
    Wang, Zhe
    Wang, Bolu
    Cheng, Yang
    Li, Dongdong
    Zhang, Jing
    NEUROCOMPUTING, 2019, 366 : 178 - 193
  • [43] Cost-Sensitive Metaheuristic Optimization-Based Neural Network with Ensemble Learning for Financial Distress Prediction
    Safi, Salah Al-Deen
    Castillo, Pedro A.
    Faris, Hossam
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [44] Cost-sensitive learning for imbalanced data streams
    Loezer, Lucas
    Enembreck, Fabricio
    Barddal, Jean Paul
    Britto Jr, Alceu de Souza
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 498 - 504
  • [45] Speech Separation By Cost-Sensitive Deep Learning
    Zhang, Xiao-Lei
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 159 - 162
  • [46] A cost-sensitive semi-supervised learning modelbased on uncertainty
    Zhu, Hongyu
    Wang, Xizhao
    NEUROCOMPUTING, 2017, 251 : 106 - 114
  • [47] Cost-Sensitive Active Learning for Incomplete Data
    Wang, Min
    Yang, Chunyu
    Zhao, Fei
    Min, Fan
    Wang, Xizhao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (01): : 405 - 416
  • [48] Heterogeneous fault prediction with cost-sensitive domain adaptation
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    SOFTWARE TESTING VERIFICATION & RELIABILITY, 2018, 28 (02)
  • [49] Cost-Sensitive LVQ for Bankruptcy Prediction: An Empirical Study
    Chen, Ning
    Vieira, Armando
    Duarte, Joao
    2009 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 5, 2009, : 115 - 119
  • [50] A hybrid cost-sensitive ensemble for heart disease prediction
    Qi Zhenya
    Zuoru Zhang
    BMC Medical Informatics and Decision Making, 21