Development and Validation of an Algorithm to Identify Nonalcoholic Fatty Liver Disease in the Electronic Medical Record

被引:61
作者
Corey, Kathleen E. [1 ,2 ]
Kartoun, Uri [2 ,3 ]
Zheng, Hui [4 ]
Shaw, Stanley Y. [2 ,3 ]
机构
[1] Massachusetts Gen Hosp, Gastrointestinal Unit, 55 Fruit St,Blake 4, Boston, MA 02114 USA
[2] Harvard Univ, Sch Med, Boston, MA USA
[3] Massachusetts Gen Hosp, Ctr Syst Biol, Boston, MA 02114 USA
[4] Massachusetts Gen Hosp, Biostat Ctr, Boston, MA 02114 USA
关键词
Nonalcoholic fatty liver disease; Nonalcoholic steatohepatitis; Electronic medical records; Triglycerides; DISCOVERY RESEARCH; FOLLOW-UP; RISK; POPULATION; NAFLD; MORTALITY;
D O I
10.1007/s10620-015-3952-x
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Nonalcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease worldwide. Risk factors for NAFLD disease progression and liver-related outcomes remain incompletely understood due to the lack of computational identification methods. The present study sought to design a classification algorithm for NAFLD within the electronic medical record (EMR) for the development of large-scale longitudinal cohorts. We implemented feature selection using logistic regression with adaptive LASSO. A training set of 620 patients was randomly selected from the Research Patient Data Registry at Partners Healthcare. To assess a true diagnosis for NAFLD we performed chart reviews and considered either a documentation of a biopsy or a clinical diagnosis of NAFLD. We included in our model variables laboratory measurements, diagnosis codes, and concepts extracted from medical notes. Variables with P < 0.05 were included in the multivariable analysis. The NAFLD classification algorithm included number of natural language mentions of NAFLD in the EMR, lifetime number of ICD-9 codes for NAFLD, and triglyceride level. This classification algorithm was superior to an algorithm using ICD-9 data alone with AUC of 0.85 versus 0.75 (P < 0.0001) and leads to the creation of a new independent cohort of 8458 individuals with a high probability for NAFLD. The NAFLD classification algorithm is superior to ICD-9 billing data alone. This approach is simple to develop, deploy, and can be applied across different institutions to create EMR-based cohorts of individuals with NAFLD.
引用
收藏
页码:913 / 919
页数:7
相关论文
共 26 条
[11]   The validity of viral hepatitis and chronic liver disease diagnoses in Veterans Affairs administrative databases [J].
Kramer, J. R. ;
Davila, J. A. ;
Miller, E. D. ;
Richardson, P. ;
Giordano, T. P. ;
El-Serag, H. B. .
ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2008, 27 (03) :274-282
[12]   Electronic Medical Records for Discovery Research in Rheumatoid Arthritis [J].
Liao, Katherine P. ;
Cai, Tianxi ;
Gainer, Vivian ;
Goryachev, Sergey ;
Zeng-Treitler, Qing ;
Raychaudhuri, Soumya ;
Szolovits, Peter ;
Churchill, Susanne ;
Murphy, Shawn ;
Kohane, Isaac ;
Karlson, Elizabeth W. ;
Plenge, Robert M. .
ARTHRITIS CARE & RESEARCH, 2010, 62 (08) :1120-1127
[13]   Nonalcoholic fatty liver disease: A spectrum of clinical and pathological severity [J].
Matteoni, CA ;
Younossi, ZM ;
Gramlich, T ;
Boparai, N ;
Liu, YC ;
McCullough, AJ .
GASTROENTEROLOGY, 1999, 116 (06) :1413-1419
[14]   Instrumenting the health care enterprise for discovery research in the genomic era [J].
Murphy, Shawn ;
Churchill, Susanne ;
Bry, Lynn ;
Chueh, Henry ;
Weiss, Scott ;
Lazarus, Ross ;
Zeng, Qing ;
Dubey, Anil ;
Gainer, Vivian ;
Mendis, Michael ;
Glaser, John ;
Kohane, Isaac .
GENOME RESEARCH, 2009, 19 (09) :1675-1681
[15]   Meta-analysis: Natural history of non-alcoholic fatty liver disease (NAFLD) and diagnostic accuracy of non-invasive tests for liver disease severity [J].
Musso, Giovanni ;
Gambino, Roberto ;
Cassader, Maurizio ;
Pagano, Gianfranco .
ANNALS OF MEDICINE, 2011, 43 (08) :617-649
[16]   Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network [J].
Newton, Katherine M. ;
Peissig, Peggy L. ;
Kho, Abel Ngo ;
Bielinski, Suzette J. ;
Berg, Richard L. ;
Choudhary, Vidhu ;
Basford, Melissa ;
Chute, Christopher G. ;
Kullo, Iftikhar J. ;
Li, Rongling ;
Pacheco, Jennifer A. ;
Rasmussen, Luke V. ;
Spangler, Leslie ;
Denny, Joshua C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E1) :E147-E154
[17]   Increased overall mortality and liver-related mortality in non-alcoholic fatty liver disease [J].
Ong, Janus P. ;
Pitts, Angela ;
Younossi, Zobair M. .
JOURNAL OF HEPATOLOGY, 2008, 49 (04) :608-612
[18]   Assessing the accuracy of administrative data in health information systems [J].
Peabody, JW ;
Luck, J ;
Jain, S ;
Bertenthal, D ;
Glassman, P .
MEDICAL CARE, 2004, 42 (11) :1066-1072
[19]   Decreased Survival of Subjects with Elevated Liver Function Tests During a 28-Year Follow-Up [J].
Soderberg, Cecilia ;
Stal, Per ;
Askling, Johan ;
Glaumann, Hans ;
Lindberg, Greger ;
Marmur, Joel ;
Hultcrantz, Rolf .
HEPATOLOGY, 2010, 51 (02) :595-602
[20]   Predicting incident fatty liver using simple cardio-metabolic risk factors at baseline [J].
Sung, Ki-Chul ;
Kim, Bum-Soo ;
Cho, Yong-Kyun ;
Park, Dong-il ;
Woo, Sookyoung ;
Kim, Seonwoo ;
Wild, Sarah H. ;
Byrne, Christopher D. .
BMC GASTROENTEROLOGY, 2012, 12