A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry

被引:2
作者
Macri, Carmelo Z. [1 ,2 ]
Teoh, Sheng Chieh [2 ]
Bacchi, Stephen [1 ,2 ]
Tan, Ian [2 ]
Casson, Robert [1 ,2 ]
Sun, Michelle T. [1 ,2 ]
Selva, Dinesh [1 ,2 ]
Chan, WengOnn [1 ,2 ]
机构
[1] Univ Adelaide, Discipline Ophthalmol & Visual Sci, Adelaide, SA, Australia
[2] Royal Adelaide Hosp, Dept Ophthalmol, Adelaide, SA, Australia
关键词
Named entity recognition; Electronic health records; Artificial intelligence; Registry; Case study; Application; Tool; DISAMBIGUATION; IDENTIFICATION; COMORBIDITY; ANNOTATION; ACCURACY; IDENTIFY; FAILURE; IMPACT;
D O I
10.1007/s00417-023-06190-2
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
PurposeAdvances in artificial intelligence (AI)-based named entity extraction (NER) have improved the ability to extract diagnostic entities from unstructured, narrative, free-text data in electronic health records. However, there is a lack of ready-to-use tools and workflows to encourage the use among clinicians who often lack experience and training in AI. We sought to demonstrate a case study for developing an automated registry of ophthalmic diseases accompanied by a ready-to-use low-code tool for clinicians.MethodsWe extracted deidentified electronic clinical records from a single centre's adult outpatient ophthalmology clinic from November 2019 to May 2022. We used a low-code annotation software tool (Prodigy) to annotate diagnoses and train a bespoke spaCy NER model to extract diagnoses and create an ophthalmic disease registry.ResultsA total of 123,194 diagnostic entities were extracted from 33,455 clinical records. After decapitalisation and removal of non-alphanumeric characters, there were 5070 distinct extracted diagnostic entities. The NER model achieved a precision of 0.8157, recall of 0.8099, and F score of 0.8128.ConclusionWe presented a case study using low-code artificial intelligence-based NLP tools to produce an automated ophthalmic disease registry. The workflow created a NER model with a moderate overall ability to extract diagnoses from free-text electronic clinical records. We have produced a ready-to-use tool for clinicians to implement this low-code workflow in their institutions and encourage the uptake of artificial intelligence methods for case finding in electronic health records.
引用
收藏
页码:3335 / 3344
页数:10
相关论文
共 57 条
[1]   Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis [J].
Abhyankar, Swapna ;
Demner-Fushman, Dina ;
Callaghan, Fiona M. ;
McDonald, Clement J. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) :801-807
[2]   Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms [J].
Al-Haddad, Mohammad A. ;
Friedlin, Jeff ;
Kesterson, Joe ;
Waters, Joshua A. ;
Aguilar-Saavedra, Juan R. ;
Schmidt, C. Max .
HPB, 2010, 12 (10) :688-695
[3]   Rare diseases in ICD11: making rare diseases visible in health information systems through appropriate coding [J].
Ayme, Segolene ;
Bellet, Bertrand ;
Rath, Ana .
ORPHANET JOURNAL OF RARE DISEASES, 2015, 10
[4]   Improving early diagnosis of rare diseases using Natural Language Processing in unstructured medical records: an illustration from Dravet syndrome [J].
Barco, Tommaso Lo ;
Kuchenbuch, Mathieu ;
Garcelon, Nicolas ;
Neuraz, Antoine ;
Nabbout, Rima .
ORPHANET JOURNAL OF RARE DISEASES, 2021, 16 (01)
[5]   Study of lipoprotein(a) and its impact on atherosclerotic cardiovascular disease: Design and rationale of the Mass General Brigham Lp(a) Registry [J].
Berman, Adam N. ;
Biery, David W. ;
Ginder, Curtis ;
Hulme, Olivia L. ;
Marcusa, Daniel ;
Leiva, Orly ;
Wu, Wanda Y. ;
Singh, Avinainder ;
Divakaran, Sanjay ;
Hainer, Jon ;
Turchin, Alexander ;
Januzzi, James L. ;
Natarajan, Pradeep ;
Cannon, Christopher P. ;
Di Carli, Marcelo F. ;
Bhatt, Deepak L. ;
Blankstein, Ron .
CLINICAL CARDIOLOGY, 2020, 43 (11) :1209-1215
[6]   Comparison of Approaches for Heart Failure Case Identification From Electronic Health Record Data [J].
Blecker, Saul ;
Katz, Stuart D. ;
Horwitz, Leora I. ;
Kuperman, Gilad ;
Park, Hannah ;
Gold, Alex ;
Sontag, David .
JAMA CARDIOLOGY, 2016, 1 (09) :1014-1020
[7]   Readiness to Embrace Artificial Intelligence Among Medical Doctors and Students: Questionnaire-Based Study [J].
Boillat, Thomas ;
Nawaz, Faisal A. ;
Rivas, Homero .
JMIR MEDICAL EDUCATION, 2022, 8 (02)
[8]   Is Administratively Coded Comorbidity and Complication Data in Total Joint Arthroplasty Valid? [J].
Bozic, Kevin J. ;
Bashyal, Ravi K. ;
Anthony, Shawn G. ;
Chiu, Vanessa ;
Shulman, Brandon ;
Rubash, Harry E. .
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2013, 471 (01) :201-205
[9]   Systematic review of discharge coding accuracy [J].
Burns, E. M. ;
Rigby, E. ;
Mamidanna, R. ;
Bottle, A. ;
Aylin, P. ;
Ziprin, P. ;
Faiz, O. D. .
JOURNAL OF PUBLIC HEALTH, 2012, 34 (01) :138-148
[10]   Evaluation of training with an annotation schema for manual annotation of clinical conditions from emergency department reports [J].
Chapman, Wendy W. ;
Dowling, John N. ;
Hripcsak, George .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :107-113