Identifying Ventricular Arrhythmias and Their Predictors by Applying Machine Learning Methods to Electronic Health Records in Patients With Hypertrophic Cardiomyopathy (HCM-VAr-Risk Model)

被引:44
作者
Bhattacharya, Moumita [1 ]
Lu, Dai-Yin [2 ,3 ,4 ]
Kudchadkar, Shibani M. [2 ]
Greenland, Gabriela Villarreal [2 ,5 ]
Lingamaneni, Prasanth [2 ]
Corona-Villalobos, Celia P. [2 ,6 ]
Guan, Yufan [2 ]
Marine, Joseph E. [2 ]
Olgin, Jeffrey E. [5 ]
Zimmerman, Stefan [6 ]
Abraham, Theodore P. [2 ,5 ]
Shatkay, Hagit [1 ,2 ]
Abraham, Maria Roselle [2 ,5 ]
机构
[1] Univ Delaware, Dept Comp & Informat Sci, Computat Biomed Lab, Newark, DE 19716 USA
[2] Johns Hopkins Univ, Hypertroph Cardiomyopathy Ctr Excellence, Baltimore, MD 21218 USA
[3] Taipei Vet Gen Hosp, Div Cardiol, Taipei, Taiwan
[4] Natl Yang Ming Univ, Inst Publ Hlth, Taipei, Taiwan
[5] Univ Calif San Francisco, Div Cardiol, Hypertroph Cardiomyopathy Ctr Excellence, San Francisco, CA 94143 USA
[6] Johns Hopkins Univ, Dept Radiol, Baltimore, MD USA
基金
美国国家科学基金会;
关键词
SUDDEN CARDIAC DEATH; TRANSGENIC RABBIT MODEL; MANAGEMENT; FIBROSIS;
D O I
10.1016/j.amjcard.2019.02.022
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Clinical risk stratification for sudden cardiac death (SCD) in hypertrophic cardiomyopathy (HC) employs rules derived from American College of Cardiology Foundation/American Heart Association (ACCF/AHA) guidelines or the HCM Risk-SCD model (C-index similar to 0.69), which utilize a few clinical variables. We assessed whether data-driven machine learning methods that consider a wider range of variables can effectively identify HC patients with ventricular arrhythmias (VAr) that lead to SCD. We scanned the electronic health records of 711 HC patients for sustained ventricular tachycardia or ventricular fibrillation. Patients with ventricular tachycardia or ventricular fibrillation (n = 61) were tagged as VAr cases and the remaining (n = 650) as non-VAr. The 2-sample t test and information gain criterion were used to identify the most informative clinical variables that distinguish VAr from non-VAr; patient records were reduced to include only these variables. Data imbalance stemming from low number of VAr cases was addressed by applying a combination of over- and under sampling strategies. We trained and tested multiple classifiers under this sampling approach, showing effective classification. We evaluated 93 clinical variables, of which 22 proved predictive of VAr. The ensemble of logistic regression and naive Bayes classifiers, trained based on these 22 variables and corrected for data imbalance, was most effective in separating VAr from non-VAr cases (sensitivity = 0.73, specificity = 0.76, C-index = 0.83). Our method (HCM-VAr-Risk Model) identified 12 new predictors of VAr, in addition to 10 established SCD predictors. In conclusion, this is the first application of machine learning for identifying HC patients with VAr, using clinical attributes. Our model demonstrates good performance (C-index) compared with currently employed SCD prediction algorithms, while addressing imbalance inherent in clinical data. (C) 2019 The Authors. Published by Elsevier Inc.
引用
收藏
页码:1681 / 1689
页数:9
相关论文
共 29 条
  • [1] Machine learning for neuroirnaging with scikit-learn
    Abraham, Alexandre
    Pedregosa, Fabian
    Eickenberg, Michael
    Gervais, Philippe
    Mueller, Andreas
    Kossaifi, Jean
    Gramfort, Alexandre
    Thirion, Bertrand
    Varoquaux, Gael
    [J]. FRONTIERS IN NEUROINFORMATICS, 2014, 8
  • [2] [Anonymous], 1987, INTRO BIOSTATISTICS
  • [3] [Anonymous], 2012, MACHINE LEARNING PRO
  • [4] Texture analysis and machine learning of non-contrast T1-weighted MR images in patients with hypertrophic cardiomyopathy-Preliminary results
    Baessler, Bettina
    Mannil, Manoj
    Maintz, David
    Alkadhi, Hatem
    Manka, Robert
    [J]. EUROPEAN JOURNAL OF RADIOLOGY, 2018, 102 : 61 - 67
  • [5] Bhattacharya M, 2017, IEEE INT C BIOINFORM, P663, DOI 10.1109/BIBM.2017.8217733
  • [6] Risk stratification for sudden cardiac death in hypertrophic cardiomyopathy: systematic review of clinical risk markers
    Christiaans, Imke
    van Engelen, Klaartje
    van Langen, Irene M.
    Birnie, Erwin
    Bonsel, Gouke J.
    Elliott, Perry M.
    Wilde, Arthur A. M.
    [J]. EUROPACE, 2010, 12 (03): : 313 - 321
  • [7] Drasgow F., 1988, ENCY STAT SCI, V7, P69
  • [8] Hypertrophic Cardiomyopathy Job Done or Work in Progress?
    Elliott, Perry M.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2016, 67 (12) : 1410 - 1411
  • [9] 2014 ESC Guidelines on diagnosis and management of hypertrophic cardiomyopathy The Task Force for the Diagnosis and Management of Hypertrophic Cardiomyopathy of the European Society of Cardiology (ESC)
    Elliott, Perry M.
    Anastasakis, Aris
    Borger, Michael A.
    Borggrefe, Martin
    Cecchi, Franco
    Charron, Philippe
    Hagege, Albert Alain
    Lafont, Antoine
    Limongelli, Giuseppe
    Mahrholdt, Heiko
    McKenna, William J.
    Mogensen, Jens
    Nihoyannopoulos, Petros
    Nistri, Stefano
    Pieper, Petronella G.
    Pieske, Burkert
    Rapezzi, Claudio
    Rutten, Frans H.
    Tillmanns, Christoph
    Watkins, Hugh
    [J]. EUROPEAN HEART JOURNAL, 2014, 35 (39) : 2733 - +
  • [10] Gersh BJ, 2011, CIRCULATION, V124, pE783, DOI 10.1161/CIR.0b013e318223e2bd