Machine learning for administrative health records: A systematic review of techniques and applications

被引:9
作者
Caruana, Adrian [1 ]
Bandara, Madhushi [1 ]
Musial, Katarzyna [2 ]
Catchpoole, Daniel [1 ,3 ]
Kennedy, Paul J. [1 ,4 ,5 ]
机构
[1] Univ Technol Sydney, Fac Engn & IT, Australian Artificial Intelligence Inst, Ultimo, Australia
[2] Univ Technol Sydney, Data Sci Inst, Complex Adapt Syst Lab, Fac Engn & IT, Ultimo, Australia
[3] Childrens Hosp Westmead, Biospecimen Res Serv, Childrens Canc Res Unit, Westmead, NSW, Australia
[4] Univ Technol Sydney, Joint Res Ctr AI Hlth & Wellness, Sydney, NSW, Australia
[5] Ontario Tech Univ, Oshawa, ON, Canada
关键词
Machine learning; Administrative Health Record; Health informatics; Systematic review; Pattern mining; Population health; MEDICAL-RECORDS; BAYESIAN NETWORK; CLINICAL PATHWAY; CARE; RISK; PREDICTION; EXTRACTION;
D O I
10.1016/j.artmed.2023.102642
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning provides many powerful and effective techniques for analysing heterogeneous electronic health records (EHR). Administrative Health Records (AHR) are a subset of EHR collected for administrative purposes, and the use of machine learning on AHRs is a growing subfield of EHR analytics. Existing reviews of EHR analytics emphasise that the data-modality of the EHR limits the breadth of suitable machine learning techniques, and pursuable healthcare applications. Despite emphasising the importance of data modality, the literature fails to analyse which techniques and applications are relevant to AHRs. AHRs contain uniquely well-structured, categorically encoded records which are distinct from other data-modalities captured by EHRs, and they can provide valuable information pertaining to how patients interact with the healthcare system.This paper systematically reviews AHR-based research, analysing 70 relevant studies and spanning multiple databases. We identify and analyse which machine learning techniques are applied to AHRs and which health informatics applications are pursued in AHR-based research. We also analyse how these techniques are applied in pursuit of each application, and identify the limitations of these approaches. We find that while AHR-based studies are disconnected from each other, the use of AHRs in health informatics research is substantial and accelerating. Our synthesis of these studies highlights the utility of AHRs for pursuing increasingly complex and diverse research objectives despite a number of pervading data-and technique-based limitations. Finally, through our findings, we propose a set of future research directions that can enhance the utility of AHR data and machine learning techniques for health informatics research.
引用
收藏
页数:17
相关论文
共 112 条
[1]  
Akl E.A., 2021, BMJ (Clinical Research Ed.), V372, pn71, DOI [10.1136/bmj.n71, DOI 10.1136/BMJ.N71]
[2]   TimeCluster: dimension reduction applied to temporal data for visual analytics [J].
Ali, Mohammed ;
Jones, Mark W. ;
Xie, Xianghua ;
Williams, Mark .
VISUAL COMPUTER, 2019, 35 (6-8) :1013-1026
[3]  
[Anonymous], 2011, Systematic Reviews to Support Evidence-Based Medicine, DOI DOI 10.1201/B13411
[4]   Analyses of Public Health Databases via Clinical Pathway Modelling: TBWEB [J].
Apunike, Anderson C. ;
Oliveira-Ciabati, Livia ;
Sanches, Tiago L. M. ;
de Oliveira, Lariza L. ;
Sanchez, Mauro N. ;
Galliez, Rafael M. ;
Alves, Domingos .
COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 :550-562
[5]   Modified Needleman-Wunsch algorithm for clinical pathway clustering [J].
Aspland, Emma ;
Harper, Paul R. ;
Gartner, Daniel ;
Webb, Philip ;
Barrett-Lee, Peter .
JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 115
[6]  
Australian Institute of Health and Wellness (AIHW), 2021, Our data collections
[7]   Process mining routinely collected electronic health records to define real-life clinical pathways during chemotherapy [J].
Bakera, Karl ;
Dunwoodie, Elaine ;
Jones, Richard G. ;
Newsham, Alex ;
Johnson, Owen ;
Price, Christopher P. ;
Wolstenholme, Jane ;
Leal, Jose ;
McGinley, Patrick ;
Twelves, Chris ;
Hall, Geoff .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 103 :32-41
[8]   Network analysis of patient flow in two UK acute care hospitals identifies key subnetworks for A&E performance [J].
Bean, Daniel M. ;
Stringer, Clive ;
Beeknoo, Neeraj ;
Teo, James ;
Dobson, Richard J. B. .
PLOS ONE, 2017, 12 (10)
[9]  
Beaulieu-Jones BK, 2018, BIOCOMPUT-PAC SYM, P123
[10]   Development and validation of a classification approach for extracting severity automatically from electronic health records [J].
Boland, Mary Regina ;
Tatonetti, Nicholas P. ;
Hripcsak, George .
JOURNAL OF BIOMEDICAL SEMANTICS, 2015, 6