Predicting Atrial Fibrillation and Flutter using Electronic Health Records

被引:0
作者
Karnik, Shreyas [1 ]
Tan, Sin Lam [1 ]
Berg, Bess
Glurich, Ingrid [1 ]
Zhang, Jinfeng
Vidaillet, Humberto J. [1 ]
Page, C. David
Chowdhary, Esh [1 ]
机构
[1] Marshfield Clin Res Fdn, Marshfield Clin, Biomed Informat Res Ctr, Marshfield, WI 54449 USA
来源
2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC) | 2012年
关键词
GENETICS; UMLS; RISK;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Electronic Health Records (EHR) contain large amounts of useful information that could potentially be used for building models for predicting onset of diseases. In this study, we have investigated the use of free-text and coded data in Marshfield Clinic's EHR, individually and in combination for building machine learning based models to predict the first ever episode of atrial fibrillation and/or atrial flutter (AFF). We trained and evaluated our AFF models on the EHR data across different time intervals (1, 3, 5 and all years) prior to first documented onset of AFF. We applied several machine learning methods, including naive bayes, support vector machines (SVM), logistic regression and random forests for building AFF prediction models and evaluated these using 10-fold cross-validation approach. On text-based datasets, the best model achieved an F-measure of 60.1%, when applied exclusively to coded data. The combination of textual and coded data achieved comparable performance. The study results attest to the relative merit of utilizing textual data to complement the use of coded data for disease onset prediction modeling.
引用
收藏
页码:5562 / 5565
页数:4
相关论文
共 17 条
[1]  
[Anonymous], 1999, Ph.D. Thesis
[2]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[3]   A role for CETP TaqIB polymorphism in determining susceptibility to atrial fibrillation: a nested case control study [J].
Asselbergs, Folkert W. ;
Moore, Jason H. ;
van den Berg, Maarten P. ;
Rimm, Eric B. ;
de Boer, Rudolf A. ;
Dullaart, Robin P. ;
Navis, Gerjan ;
van Gilst, Wiek H. .
BMC MEDICAL GENETICS, 2006, 7
[4]  
Berg B., 2010, NIPS WORKSH PRED MOD
[5]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[6]   A decision support system based on support vector machines for diagnosis of the heart valve diseases [J].
Comak, Emre ;
Arslan, Ahmet ;
Turkoglu, Ibrahim .
COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (01) :21-27
[7]   Genetics of atrial fibrillation [J].
Ellinor, Patrick T. ;
Yi, B. Alexander ;
MacRae, Calum A. .
MEDICAL CLINICS OF NORTH AMERICA, 2008, 92 (01) :41-+
[8]   Parental atrial fibrillation as a risk factor for atrial fibrillation in offspring [J].
Fox, CS ;
Parise, H ;
D'Agostino, RB ;
Lloyd-Jones, DM ;
Vasan, RS ;
Wang, TJ ;
Levy, D ;
Wolf, PA ;
Benjamin, EJ .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2004, 291 (23) :2851-2855
[9]  
Frank E, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P1305, DOI 10.1007/0-387-25465-X_62
[10]   Incidence and predictors of atrial flutter in the general population [J].
Granada, J ;
Uribe, W ;
Chyou, PH ;
Maassen, K ;
Vierkant, R ;
Smith, PN ;
Hayes, J ;
Eaker, E ;
Vidaillet, H .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2000, 36 (07) :2242-2246