Heterogeneous Defect Prediction Based on Federated Prototype Learning

被引:3
作者
Wang, Aili [1 ]
Yang, Linlin [1 ]
Wu, Haibin [1 ]
Iwahori, Yuji [2 ]
机构
[1] Harbin Univ Sci & Technol, Heilongjiang Prov Key Lab Laser Spect Technol & Ap, Harbin 150080, Peoples R China
[2] Chubu Univ, Dept Comp Sci, Kasugai, Aichi 4878501, Japan
关键词
Heterogeneous defect prediction; federated learning; prototype learning; data islands;
D O I
10.1109/ACCESS.2023.3313001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Software defect prediction is used to identify modules in software projects that may have defects. Heterogeneous Defect Prediction (HDP) establishes a cross project defect prediction model based on different software defect datasets. However, due to the heterogeneity of multi-source data, the model performance is usually not ideal. In addition, the project data holder is unwilling to disclose the data due to privacy regulations and other reasons, resulting in data islands. This paper presents a federal prototype learning based on prototype averaging (FPLPA), which combines federated learning (FL) with prototype learning for heterogeneous defect prediction. Firstly, the client used one-sided selection (OSS) algorithm to remove noise from local training data, and applied Chi-Squares Test algorithm to select the optimal subset of features. Secondly, the client constructed the convolution prototype network (CPN) to generate their own local prototypes. CPN are more robust to heterogeneous data than convolutional neural networks (CNN), while avoiding the deviation effect of class imbalances in software data. The prototype is used as the communication subject between the clients and the server. Because the local prototype is generated in an irreversible way, it can play a role of privacy protection in the communication process. Finally, the local CPN network is updated with the loss of local prototype and global prototype as regularization. We have verified on 10 projects in three public data sets (AEEEM, NASA and Relink), and the experimental results show that FPLPA is superior to other HDP solutions.
引用
收藏
页码:98618 / 98632
页数:15
相关论文
共 58 条
[1]  
Batista G.E.A.P.A., 2004, SIGKDD Explorations, V6, P20, DOI [10.1145/1007730.1007735, DOI 10.1145/1007730.1007735]
[2]   MAHAKIL: Diversity based Oversampling Approach to Alleviate the Class Imbalance Issue in Software Defect Prediction Extended Abstract [J].
Bennin, Kwabena E. ;
Keung, Jacky ;
Phannachitta, Passakorn ;
Monden, Akito ;
Mensah, Solomon .
PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, :699-699
[3]   An under-sampled software defect prediction method based on hybrid multi-objective cuckoo search [J].
Cai, Xingjuan ;
Niu, Yun ;
Geng, Shaojin ;
Zhang, Jiangjiang ;
Cui, Zhihua ;
Li, Jianwei ;
Chen, Jinjun .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (05)
[4]   Multi-Objective Cross-Project Defect Prediction [J].
Canfora, Gerardo ;
De Lucia, Andrea ;
Di Penta, Massimiliano ;
Oliveto, Rocco ;
Panichella, Annibale ;
Panichella, Sebastiano .
2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST 2013), 2013, :252-261
[5]   Heterogeneous Defect Prediction through Joint Metric Selection and Matching [J].
Chen, Haowen ;
Jing, Xiao-Yuan ;
Xu, Baowen .
2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, :367-377
[6]   Multiview Transfer Learning for Software Defect Prediction [J].
Chen, Jinyin ;
Yang, Yitao ;
Hu, Keke ;
Xuan, Qi ;
Liu, Yi ;
Yang, Chao .
IEEE ACCESS, 2019, 7 :8901-8916
[7]   Evaluating defect prediction approaches: a benchmark and an extensive comparison [J].
D'Ambros, Marco ;
Lanza, Michele ;
Robbes, Romain .
EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (4-5) :531-577
[8]  
Devi K. Jyoshna, 2017, 2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS), P388, DOI 10.1109/ICECDS.2017.8390162
[9]  
Elhamifar E, 2012, PROC CVPR IEEE, P1600, DOI 10.1109/CVPR.2012.6247852
[10]   Unsupervised Deep Domain Adaptation for Heterogeneous Defect Prediction [J].
Gong, Lina ;
Jiang, Shujuan ;
Yu, Qiao ;
Jiang, Li .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (03) :537-549