Automated extraction of sudden cardiac death risk factors in hypertrophic cardiomyopathy patients by natural language processing

被引:15
作者
Moon, Sungrim [1 ]
Liu, Sijia [1 ]
Scott, Christopher G. [2 ]
Samudrala, Sujith [3 ]
Abidian, Mohamed M. [3 ]
Geske, Jeffrey B. [3 ]
Noseworthy, Peter A. [3 ]
Shellum, Jane L. [4 ]
Chaudhry, Rajeev [4 ,5 ]
Ommen, Steve R. [3 ]
Nishimura, Rick A. [3 ]
Liu, Hongfang [1 ]
Arruda-Olson, Adelaide M. [1 ,3 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Div Digital Hlth Sci, Rochester, MN USA
[2] Mayo Clin, Dept Hlth Sci Res, Div Biomed Stat & Informat, Rochester, MN USA
[3] Mayo Clin, Dept Cardiovasc Med, Rochester, MN USA
[4] Mayo Clin, Robert & Patricia Kern Ctr Sci Hlth Care Delivery, Rochester, MN USA
[5] Mayo Clin, Div Community Internal Med, Rochester, MN USA
基金
美国国家卫生研究院;
关键词
Hypertrophic cardiomyopathy; Sudden cardiac death; Natural language processing; Electronic health records; HEALTH-CARE-SYSTEM; CLINICAL NOTES; BIG DATA; INFORMATICS; VALIDATION; STATEMENT; DISEASE; RECORDS;
D O I
10.1016/j.ijmedinf.2019.05.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: The management of hypertrophic cardiomyopathy (HCM) patients requires the knowledge of risk factors associated with sudden cardiac death (SCD). SCD risk factors such as syncope and family history of SCD (FH-SCD) as well as family history of HCM (FH-HCM) are documented in electronic health records (EHRs) as clinical narratives. Automated extraction of risk factors from clinical narratives by natural language processing (NLP) may expedite management workflow of HCM patients. The aim of this study was to develop and deploy NLP algorithms for automated extraction of syncope, FH-SCD, and FH-HCM from clinical narratives. Methods and Results: We randomly selected 200 patients from the Mayo HCM registry for development (n = 100) and testing (n = 100) of NLP algorithms for extraction of syncope, FH-SCD as well as FH-HCM from clinical narratives of EHRs. The clinical reference standard was manually abstracted by 2 independent annotators. Performance of NLP algorithms was compared to aggregation and summarization of data entries in the HCM registry for syncope, FH-SCD, and FH-HCM. We also compared the NLP algorithms with billing codes for syncope as well as responses to patient survey questions for FH-SCD and FH-HCM. These analyses demonstrated NLP had superior sensitivity (0.96 vs 0.39, p < 0.001) and comparable specificity (0.90 vs 0.92, p = 0.74) and PPV (0.90 vs 0.83, p = 0.37) compared to billing codes for syncope. For FH-SCD, NLP outperformed survey responses for all parameters (sensitivity: 0.91 vs 0.59, p = 0.002; specificity: 0.98 vs 0.50, p < 0.001; PPV: 0.97 vs 0.38, p < 0.001). NLP also achieved superior sensitivity (0.95 vs 0.24, p < 0.001) with comparable specificity (0.95 vs 1.0, p-value not calculable) and positive predictive value (PPV) (0.92 vs 1.0, p = 0.09) compared to survey responses for FH-HCM. Conclusions: Automated extraction of syncope, FH-SCD and FH-HCM using NLP is feasible and has promise to increase efficiency of workflow for providers managing HCM patients.
引用
收藏
页码:32 / 38
页数:7
相关论文
共 24 条
  • [1] Natural language processing of clinical notes for identification of critical limb ischemia
    Afzal, Naveed
    Mallipeddi, Vishnu Priya
    Sohn, Sunghwan
    Liu, Hongfang
    Chaudhry, Rajeev
    Scott, Christopher G.
    Kullo, Iftikhar J.
    Arruda-Olson, Adelaide M.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 111 : 83 - 89
  • [2] Mining peripheral arterial disease cases from narrative clinical notes using natural language processing
    Afzal, Naveed
    Sohn, Sunghwan
    Abram, Sara
    Scott, Christopher G.
    Chaudhry, Rajeev
    Liu, Hongfang
    Kullo, Iftikhar J.
    Arruda-Olson, Adelaide M.
    [J]. JOURNAL OF VASCULAR SURGERY, 2017, 65 (06) : 1753 - 1761
  • [3] Improving Case Definition of Crohn's Disease and Ulcerative Colitis in Electronic Medical Records Using Natural Language Processing: A Novel Informatics Approach
    Ananthakrishnan, Ashwin N.
    Cai, Tianxi
    Savova, Guergana
    Cheng, Su-Chun
    Chen, Pei
    Perez, Raul Guzman
    Gainer, Vivian S.
    Murphy, Shawn N.
    Szolovits, Peter
    Xia, Zongqi
    Shaw, Stanley
    Churchill, Susanne
    Karlson, Elizabeth W.
    Kohane, Isaac
    Plenge, Robert M.
    Liao, Katherine P.
    [J]. INFLAMMATORY BOWEL DISEASES, 2013, 19 (07) : 1411 - 1420
  • [4] 2017 Roadmap for Innovation-ACC Health Policy Statement on Healthcare Transformation in the Era of Digital Health, Big Data, and Precision Health A Report of the American College of Cardiology Task Force on Health Policy Statements and Systems of Care
    Bhavnani, Sanjeev P.
    Parakh, Kapil
    Atreja, Ashish
    Druz, Regina
    Graham, Garth N.
    Hayek, Salim S.
    Krumholz, Harlan M.
    Maddox, Thomas M.
    Majmudar, Maulik D.
    Rumsfeld, John S.
    Shah, Bimal R.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2017, 70 (21) : 2696 - 2718
  • [5] What can natural language processing do for clinical decision support?
    Demner-Fushman, Dina
    Chapman, Wendy W.
    McDonald, Clement J.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 760 - 772
  • [6] Gersh BJ, 2011, J AM COLL CARDIOL, V58, pE212, DOI 10.1016/j.jacc.2011.06.011
  • [7] Mining electronic health records: towards better research applications and clinical care
    Jensen, Peter B.
    Jensen, Lars J.
    Brunak, Soren
    [J]. NATURE REVIEWS GENETICS, 2012, 13 (06) : 395 - 405
  • [8] Toward a Learning Health-care System - Knowledge Delivery at the Point of Care Empowered by Big Data and NLP
    Kaggal, Vinod C.
    Elayavilli, Ravikumar Komandur
    Mehrabi, Saeed
    Pankratz, Joshua J.
    Sohn, Sunghwan
    Wang, Yanshan
    Li, Dingcheng
    Rastegar, Majid Mojarad
    Murphy, Sean P.
    Ross, Jason L.
    Chaudhry, Rajeev
    Buntrock, James D.
    Liu, Hongfang
    [J]. BIOMEDICAL INFORMATICS INSIGHTS, 2016, 8 : 13 - 22
  • [9] Liu HB, 2013, 2013 3RD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, COMMUNICATIONS AND NETWORKS (CECNET), P149, DOI 10.1109/CECNet.2013.6703294
  • [10] The Learning Healthcare System and Cardiovascular Care: A Scientific Statement From the American Heart Association
    Maddox, Thomas M.
    Albert, Nancy M.
    Borden, William B.
    Curtis, Lesley H.
    Ferguson, T. Bruce, Jr.
    Kao, David P.
    Marcus, Gregory M.
    Peterson, Eric D.
    Redberg, Rita
    Rumsfeld, John S.
    Shah, Nilay D.
    Tcheng, James E.
    [J]. CIRCULATION, 2017, 135 (14) : E826 - E857