MLChain: a privacy-preserving model learning framework using blockchain

被引:0
作者
Bansal, Vidhi [1 ]
Baliyan, Niyati [2 ]
Ghosh, Mohona [1 ]
机构
[1] Indira Gandhi Delhi Tech Univ Women, Dept Informat Technol, New Delhi, India
[2] Natl Inst Technol Kurukshetra, Dept Comp Engn, Kurukshetra, India
关键词
Machine learning; Blockchain; Classification; Privacy-preserving; Predictive modeling; PROPAGATION LOGISTIC-REGRESSION;
D O I
10.1007/s10207-023-00754-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we present a blockchain-based secure and flexible distributed privacy-preserving online model that helps in sharing key features of datasets across multiple organizations without violating the privacy of data. In our model, all members are encouraged to participate, discouraged to write fake data. Learning is carried out without sharing of raw data, and data sharing is immutable that improves prediction results of the data held by each member of an industry. We also propose a new consensus algorithm-Proof of Share for adding a valid transaction to the blockchain, thus preventing non participating members from reading any of the data shared by the peer and discouraging fake writes. We evaluated our model on 3, 5, and 10 members setup by applying decision tree, logistic regression, Gaussian naive Bayes, and support vector machine classifiers. The maximum increase of 26.9231% was observed in accuracy where results of a member's data were taken as baseline. F-beta(beta = 0.5) score increased by 0.4533 and F-1 score by 0.0800. The proposed model to the best of our knowledge is the only one that encourages all members to participate, rather than being passive listeners and discourages a member from forging results thus rendering it suitable for utilization in domains like health care, finance, education, etc. where data are unevenly split and secrecy of data and peers is required.
引用
收藏
页码:649 / 677
页数:29
相关论文
共 48 条
[1]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[2]   Vulnerabilities in Federated Learning [J].
Bouacida, Nader ;
Mohapatra, Prasant .
IEEE ACCESS, 2021, 9 :63229-63249
[3]  
Breiman L, 1984, Classification and Regression Trees, V1st, DOI DOI 10.1201/9781315139470
[4]   Federated learning of predictive models from federated Electronic Health Records [J].
Brisimi, Theodora S. ;
Chen, Ruidi ;
Mela, Theofanie ;
Olshevsky, Alex ;
Paschalidis, Ioannis Ch. ;
Shi, Wei .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 :59-67
[5]  
Choudhury O., 2019, arXiv
[6]  
Daemen J., 1999, Aes proposal
[7]  
Dheeru Dua and Casey Graff, 2017, UCI machine learning repository
[8]  
Du MX, 2017, IEEE SYS MAN CYBERN, P2567, DOI 10.1109/SMC.2017.8123011
[9]   Astraea: Self-balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications [J].
Duan, Moming ;
Liu, Duo ;
Chen, Xianzhang ;
Tan, Yujuan ;
Ren, Jinting ;
Qiao, Lei ;
Liang, Liang .
2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, :246-254
[10]   Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records [J].
Huang, Li ;
Shea, Andrew L. ;
Qian, Huining ;
Masurkar, Aditya ;
Deng, Hao ;
Liu, Dianbo .
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99