Health Synthetic Data to Enable Health Learning System and Innovation: A Scoping Review

被引:3
作者
Tsao, Shu-Feng [1 ]
Sharma, Kam [1 ]
Noor, Hateem [1 ]
Forster, Alan [2 ]
Chen, Helen [1 ]
机构
[1] Univ Waterloo, Sch Publ Hlth Sci, Waterloo, ON, Canada
[2] Ottawa Hosp, Ottawa, ON, Canada
来源
CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023 | 2023年 / 302卷
关键词
Synthetic data; data governance; data sharing; FAIR; CARE;
D O I
10.3233/SHTI230063
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the recent advancement in the field of machine learning, health synthetic data has become a promising technique to address difficulties with time consumption when accessing and using electronic medical records for research and innovations. However, health synthetic data utility and governance have not been extensively studied. A scoping review was conducted to understand the status of evaluations and governance of health synthetic data following the PRISMA guidelines. The results showed that if synthetic health data are generated via proper methods, the risk of privacy leaks has been low and data quality is comparative to real data. However, the generation of health synthetic data has been generated on a case-by-case basis instead of being scaled up. Furthermore, regulations, ethics, and data sharing of health synthetic data have primarily been inexplicit, although common principles for sharing such data do exist.
引用
收藏
页码:53 / 57
页数:5
相关论文
共 25 条
[1]  
ahrq, Synthetic healthcare database for research (SyH-DR)
[2]   Federated Learning for Healthcare: Systematic Review and Architecture Proposal [J].
Antunes, Rodolfo Stoffel ;
da Costa, Cristiano Andre ;
Kuederle, Arne ;
Yari, Imrana Abdullahi ;
Eskofier, Bjoern .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2022, 13 (04)
[3]   Analyzing Medical Research Results Based on Synthetic Data and Their Relation to Real Data Results: Systematic Comparison From Five Observational Studies [J].
Benaim, Anat Reiner ;
Almog, Ronit ;
Gorelik, Yuri ;
Hochberg, Irit ;
Nassar, Laila ;
Mashiach, Tanya ;
Khamaisi, Mogher ;
Lurie, Yael ;
Azzam, Zaher S. ;
Khoury, Johad ;
Kurnik, Daniel ;
Beyar, Rafael .
JMIR MEDICAL INFORMATICS, 2020, 8 (02)
[4]   The Problem of Fairness in Synthetic Healthcare Data [J].
Bhanot, Karan ;
Qi, Miao ;
Erickson, John S. ;
Guyon, Isabelle ;
Bennett, Kristin P. .
ENTROPY, 2021, 23 (09)
[5]   Federated learning and Indigenous genomic data sovereignty [J].
Boscarino, Nima ;
Cartwright, Reed A. ;
Fox, Keolu ;
Tsosie, Krystal S. .
NATURE MACHINE INTELLIGENCE, 2022, 4 (11) :909-911
[6]   Federated learning of predictive models from federated Electronic Health Records [J].
Brisimi, Theodora S. ;
Chen, Ruidi ;
Mela, Theofanie ;
Olshevsky, Alex ;
Paschalidis, Ioannis Ch. ;
Shi, Wei .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 :59-67
[7]   Operationalizing the CARE and FAIR Principles for Indigenous data futures [J].
Carroll, Stephanie Russo ;
Herczog, Edit ;
Hudson, Maui ;
Russell, Keith ;
Stall, Shelley .
SCIENTIFIC DATA, 2021, 8 (01)
[8]  
Carroll Stephanie Russo, 2020, DATA SCI J, V19, P1, DOI [10.5334/dsj-2020-043, DOI 10.5334/DSJ-2020-043, 10.5334/DSJ-2020-043]
[9]  
cineca-project, CINECA-common infrastructure for national cohorts in Europe, Canada, and Africa
[10]  
cprd, Synthetic data