Non-IID Medical Imaging Data on COVID-19 in the Federated Learning Framework: Impact and Directions

被引:2
作者
Alhafiz, Fatimah Saeed [1 ]
Basuhail, Abdullah Ahmad [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah 21589, Saudi Arabia
来源
COVID | 2024年 / 4卷 / 12期
关键词
COVID-19 lung medical image; federated learning; data heterogenity; non-IID type; generalization; personalization; MODELS;
D O I
10.3390/covid4120140
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
After first appearing in December 2019, coronavirus disease 2019 (COVID-19) spread rapidly, leading to global effects and significant risks to health systems. The virus's high replication competence in the human lung accelerated the severity of lung pneumonia cases, resulting in a catastrophic death rate. Variable observations in the clinical testing of virus-related and patient-related cases across different populations led to ambiguous results. Medical and epidemiological studies on the virus effectively use imaging and scanning devices to help explain the virus's behavior and its impact on the lungs. Varying equipment resources and a lack of uniformity in medical imaging acquisition led to disorganized and widely dispersed data collection worldwide, while high heterogeneity in datasets caused a poor understanding of the virus and related strains, consequently leading to unstable results that could not be generalized. Hospitals and medical institutions, therefore, urgently need to collaborate to share and extract useful knowledge from these COVID-19 datasets while preserving the privacy of medical records. Researchers are turning to an emerging technology that enhances the reliability and accessibility of information without sharing actual patient data. Federated learning (FL) is a technique that learns distributed data locally, sharing only the weights of each local model to compute a global model, and has the potential to improve the generalization of diagnosis and treatment decisions. This study investigates the applicability of FL for COVID-19 under the impact of data heterogeneity, defining the lung imaging characteristics and identifying the practical constraints of FL in medical fields. It describes the challenges of implementation from a technical perspective, with reference to valuable research directions, and highlights the research challenges that present opportunities for further efforts to overcome the pitfalls of distributed learning performance. The primary objective of this literature review is to provide valuable insights that will aid in the formulation of effective technical strategies to mitigate the impact of data heterogeneity on the generalization of FL results, particularly in light of the ongoing and evolving COVID-19 pandemic.
引用
收藏
页码:1985 / 2016
页数:32
相关论文
共 90 条
[1]  
Adhikari R, 2024, Arxiv, DOI arXiv:2401.12438
[2]   How does DICOM support big data management? Investigating its use in medical imaging community [J].
Aiello, Marco ;
Esposito, Giuseppina ;
Pagliari, Giulio ;
Borrelli, Pasquale ;
Brancato, Valentina ;
Salvatore, Marco .
INSIGHTS INTO IMAGING, 2021, 12 (01)
[3]   Diagnostic Value of Imaging Modalities for COVID-19: Scoping Review [J].
Aljondi, Rowa ;
Alghamdi, Salem .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (08)
[4]  
[Anonymous], 2022, Timeline: WHO's COVID-19 response [Internet]
[5]   Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence [J].
Bai, Xiang ;
Wang, Hanchen ;
Ma, Liya ;
Xu, Yongchao ;
Gan, Jiefeng ;
Fan, Ziwei ;
Yang, Fan ;
Ma, Ke ;
Yang, Jiehua ;
Bai, Song ;
Shu, Chang ;
Zou, Xinyu ;
Huang, Renhao ;
Zhang, Changzheng ;
Liu, Xiaowu ;
Tu, Dandan ;
Xu, Chuou ;
Zhang, Wenqing ;
Wang, Xi ;
Chen, Anguo ;
Zeng, Yu ;
Yang, Dehua ;
Wang, Ming-Wei ;
Holalkere, Nagaraj ;
Halin, Neil J. ;
Kamel, Ihab R. ;
Wu, Jia ;
Peng, Xuehua ;
Wang, Xiang ;
Shao, Jianbo ;
Mongkolwat, Pattanasak ;
Zhang, Jianjun ;
Liu, Weiyang ;
Roberts, Michael ;
Teng, Zhongzhao ;
Beer, Lucian ;
Sanchez, Lorena E. ;
Sala, Evis ;
Rubin, Daniel L. ;
Weller, Adrian ;
Lasenby, Joan ;
Zheng, Chuangsheng ;
Wang, Jianming ;
Li, Zhen ;
Schonlieb, Carola ;
Xia, Tian .
NATURE MACHINE INTELLIGENCE, 2021, 3 (12) :1081-1089
[6]   Accounting for data variability in multi-institutional distributed deep learning for medical imaging [J].
Balachandar, Niranjan ;
Chang, Ken ;
Kalpathy-Cramer, Jayashree ;
Rubin, Daniel L. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (05) :700-708
[7]   Federated learning review: Fundamentals, enabling technologies, and future applications [J].
Banabilah, Syreen ;
Aloqaily, Moayad ;
Alsayed, Eitaa ;
Malik, Nida ;
Jararweh, Yaser .
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)
[8]   A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research-An International Collaboration [J].
Banda, Juan M. ;
Tekumalla, Ramya ;
Wang, Guanyu ;
Yu, Jingyuan ;
Liu, Tuo ;
Ding, Yuning ;
Artemova, Ekaterina ;
Tutubalina, Elena ;
Chowell, Gerardo .
EPIDEMIOLOGIA, 2021, 2 (03) :315-324
[9]  
Bhattacharya A, 2022, arXiv
[10]   Coronavirus disease (COVID-19) detection in Chest X-Ray images using majority voting based classifier ensemble [J].
Chandra, Tej Bahadur ;
Verma, Kesari ;
Singh, Bikesh Kumar ;
Jain, Deepak ;
Netam, Satyabhuwan Singh .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165 (165)