MLChain: a privacy-preserving model learning framework using blockchain

被引:0
作者
Bansal, Vidhi [1 ]
Baliyan, Niyati [2 ]
Ghosh, Mohona [1 ]
机构
[1] Indira Gandhi Delhi Tech Univ Women, Dept Informat Technol, New Delhi, India
[2] Natl Inst Technol Kurukshetra, Dept Comp Engn, Kurukshetra, India
关键词
Machine learning; Blockchain; Classification; Privacy-preserving; Predictive modeling; PROPAGATION LOGISTIC-REGRESSION;
D O I
10.1007/s10207-023-00754-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we present a blockchain-based secure and flexible distributed privacy-preserving online model that helps in sharing key features of datasets across multiple organizations without violating the privacy of data. In our model, all members are encouraged to participate, discouraged to write fake data. Learning is carried out without sharing of raw data, and data sharing is immutable that improves prediction results of the data held by each member of an industry. We also propose a new consensus algorithm-Proof of Share for adding a valid transaction to the blockchain, thus preventing non participating members from reading any of the data shared by the peer and discouraging fake writes. We evaluated our model on 3, 5, and 10 members setup by applying decision tree, logistic regression, Gaussian naive Bayes, and support vector machine classifiers. The maximum increase of 26.9231% was observed in accuracy where results of a member's data were taken as baseline. F-beta(beta = 0.5) score increased by 0.4533 and F-1 score by 0.0800. The proposed model to the best of our knowledge is the only one that encourages all members to participate, rather than being passive listeners and discourages a member from forging results thus rendering it suitable for utilization in domains like health care, finance, education, etc. where data are unevenly split and secrecy of data and peers is required.
引用
收藏
页码:649 / 677
页数:29
相关论文
共 48 条
  • [1] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [2] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [3] Vulnerabilities in Federated Learning
    Bouacida, Nader
    Mohapatra, Prasant
    [J]. IEEE ACCESS, 2021, 9 : 63229 - 63249
  • [4] Federated learning of predictive models from federated Electronic Health Records
    Brisimi, Theodora S.
    Chen, Ruidi
    Mela, Theofanie
    Olshevsky, Alex
    Paschalidis, Ioannis Ch.
    Shi, Wei
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 : 59 - 67
  • [5] Choudhury O, 2019, ARXIV
  • [6] Daemen J., 1999, AES PROPOSAL
  • [7] Du MX, 2017, IEEE SYS MAN CYBERN, P2567, DOI 10.1109/SMC.2017.8123011
  • [8] Astraea: Self-balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications
    Duan, Moming
    Liu, Duo
    Chen, Xianzhang
    Tan, Yujuan
    Ren, Jinting
    Qiao, Lei
    Liang, Liang
    [J]. 2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 246 - 254
  • [9] Graff C, 2017, UCI MACHINE LEARNING
  • [10] Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records
    Huang, Li
    Shea, Andrew L.
    Qian, Huining
    Masurkar, Aditya
    Deng, Hao
    Liu, Dianbo
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 99