Detecting MRSA Infections by Fusing Structured and Unstructured Electronic Health Record Data

被引:2
作者
Hartvigsen, Thomas [1 ]
Sen, Cansu [1 ]
Rundensteiner, Elke A. [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
来源
BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, BIOSTEC 2018 | 2019年 / 1024卷
关键词
MRSA; Machine learning; Early prediction; Feature fusion; BIG DATA; PREDICTION; DISEASE;
D O I
10.1007/978-3-030-29196-9_21
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Methicillin-resistant Staphylococcus aureus (MRSA), an antibiotic resistant bacteria, is a common cause of one of the more devastating hospital-acquired infections (HAI) in the United States. In this work, we study the practicality of leveraging machine learning methods for early detection of MRSA infections based on a rich variety of patient information commonly available in modern Electronic Health Records (EHR). We explore heterogeneous types of data in EHRs including on-admission demographics, throughout-stay time series and free-form clinical notes. On-admission data capture non-clinical information (e.g., age, marital status) while Throughout-stay data include vital signs, medications, laboratory studies, and other clinical assessments. Clinical notes, free-from text documents created by medical professionals, contain expert observations about patients. Our proposed system generates dense patient-level representations for each data type, extracting features from each of our data types. It then generates scores for each patient, indicating their risk of acquiring MRSA. We evaluate prediction performance achieved by core Machine Learning methods, namely Logistic Regression, Support Vector Machine, and Random Forest, when mining these different types of EHR data retrospectively to detect patterns predictive of MRSA infection. We evaluate classification performance using MIMIC III a critical care data set comprised of 12 years of patient records from the Beth Israel Deaconess Medical Center Intensive Care Unit in Boston, MA. Our experiments show that while all types of data contain predictive signals, the fusion of all sources of data leads to the most effective prediction accuracy.
引用
收藏
页码:399 / 419
页数:21
相关论文
共 31 条
  • [1] [Anonymous], 2011, EL HLTH REC EHR INC
  • [2] AUREDEN Kathy., 2010, Guide to the elimination of Methicilin-resistant Staphylococcus aureus (MRSA) transmission in hospital settings, V2nd
  • [3] Distributed solar photovoltaic array location and extent dataset for remote sensing object identification
    Bradbury, Kyle
    Saboo, Raghav
    Johnson, Timothy L.
    Malof, Jordan M.
    Devarajan, Arjun
    Zhang, Wuming
    Collins, Leslie M.
    Newell, Richard G.
    [J]. SCIENTIFIC DATA, 2016, 3
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] "Big Data" in the Intensive Care Unit Closing the Data Loop
    Celi, Leo Anthony
    Mark, Roger G.
    Stone, David J.
    Montgomery, Robert A.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2013, 187 (11) : 1157 - 1160
  • [6] Secretome-Based Identification of ULBP2 as a Novel Serum Marker for Pancreatic Cancer Detection
    Chang, Ya-Ting
    Wu, Chih-Ching
    Shyr, Yi-Ming
    Chen, Tse-Ching
    Hwang, Tsann-Long
    Yeh, Ta-Sen
    Chang, Kai-Ping
    Liu, Hao-Ping
    Liu, Yu-Ling
    Tsai, Ming-Hung
    Chang, Yu-Sun
    Yu, Jau-Song
    [J]. PLOS ONE, 2011, 6 (05):
  • [7] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [8] National Burden of Invasive Methicillin-Resistant Staphylococcus aureus Infections, United States, 2011
    Dantes, Raymund
    Mu, Yi
    Belflower, Ruth
    Aragon, Deborah
    Dumyati, Ghinwa
    Harrison, Lee H.
    Lessa, Fernanda C.
    Lynfield, Ruth
    Nadle, Joelle
    Petit, Susan
    Ray, Susan M.
    Schaffner, William
    Townes, John
    Fridkin, Scott
    [J]. JAMA INTERNAL MEDICINE, 2013, 173 (21) : 1970 - 1978
  • [9] Dubois S., 2017, STAT-US, V1050, P15
  • [10] Maximum probability rule based classification of MRSA infections in hospital environment: Using electronic nose
    Dutta, Ritabrata
    Dutta, Ritaban
    [J]. SENSORS AND ACTUATORS B-CHEMICAL, 2006, 120 (01) : 156 - 165