Predicting the Prognosis of Stroke Patients Based on Personalized Federated Learning

被引:0
作者
Yang, Jie [1 ]
Xie, Haoyu [2 ]
Huang, Lianfen [2 ]
Gao, Zhibin [3 ]
Shen, Shaowei [2 ]
机构
[1] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[3] Jimei Univ, Nav Inst, Xiamen, Peoples R China
来源
JOURNAL OF INTERNET TECHNOLOGY | 2024年 / 25卷 / 06期
基金
中国国家自然科学基金;
关键词
Stroke; Non-IID data; Personalized federated learning; Ensemble learning; Machine learning;
D O I
10.70003/160792642024112506002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the incidence of stroke has been increasing and showing a trend of younger people. Based on the distributed stroke risk assessment modeling scenario, this paper solves the problems of insufficient data and difficult data sharing in medical institutions through federated learning. Considering the features of structured data, the proposed algorithm takes the non-neural network model as the base model, and combines bagging and gradient boosting algorithms to achieve model updating and aggregation. This paper also proposes the model pruning method to realize the personalization of each participant's model and reduces the data transmission cost of the algorithms by separating the weight matrix of the model and the model parameters. Experiments show that the proposed method greatly outperforms existing baseline approaches according to the predictive results, and the accuracy of the personalized model and the global model in the International Stroke Trial (IST) dataset reaches 78.20% and 76.85%, respectively, which has broader application scenarios.
引用
收藏
页码:815 / 824
页数:10
相关论文
共 26 条
  • [1] Deep Learning and Machine Learning for Early Detection of Stroke and Haemorrhage
    Al-Mekhlafi, Zeyad Ghaleb
    Senan, Ebrahim Mohammed
    Rassem, Taha H.
    Mohammed, Badiea Abdulkarem
    Makbol, Nasrin M.
    Alanazi, Adwan Alownie
    Almurayziq, Tariq S.
    Ghaleb, Fuad A.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 775 - 796
  • [2] Alwan A, 2011, GLOBAL STATUS REPORT ON NONCOMMUNICABLE DISEASES 2010, P1
  • [3] Ang Li, 2021, SenSys '21: Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, P42, DOI 10.1145/3485730.3485929
  • [4] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [5] Health Insurance Portability and Accountability Act (HIPAA) Compliant Access Control Model for Web Services
    Cheng, Vivying S. Y.
    Hung, Patrick C. K.
    [J]. INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2006, 1 (01) : 22 - 39
  • [6] Fuglede B, 2004, 2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, P31
  • [7] Towards Fair and Decentralized Federated Learning System for Gradient Boosting Decision Trees
    Gao, Shiqi
    Li, Xianxian
    Shi, Zhenkui
    Liu, Peng
    Li, Chunpei
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [8] Epidemiology of Ischemic and Hemorrhagic Stroke: Incidence, Prevalence, Mortality, and Risk Factors
    Grysiewicz, Rebbeca A.
    Thomas, Kurian
    Pandey, Dillp K.
    [J]. NEUROLOGIC CLINICS, 2008, 26 (04) : 871 - 895
  • [9] COMMUTE: Communication-efficient transfer learning for multi-site risk prediction
    Gu, Tian
    Lee, Phil H.
    Duan, Rui
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 137
  • [10] The coming era of artificial intelligence in biological data science
    Han, Henry
    Liu, Wenbin
    [J]. BMC BIOINFORMATICS, 2019, 20 (01)