A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry

被引：2

作者：

Macri, Carmelo Z. ^{[1
,2
]}

Teoh, Sheng Chieh ^{[2
]}

Bacchi, Stephen ^{[1
,2
]}

Tan, Ian ^{[2
]}

Casson, Robert ^{[1
,2
]}

Sun, Michelle T. ^{[1
,2
]}

Selva, Dinesh ^{[1
,2
]}

Chan, WengOnn ^{[1
,2
]}

机构：

[1] Univ Adelaide, Discipline Ophthalmol & Visual Sci, Adelaide, SA, Australia

[2] Royal Adelaide Hosp, Dept Ophthalmol, Adelaide, SA, Australia

来源：

GRAEFES ARCHIVE FOR CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY | 2023年 / 261卷 / 11期

关键词：

Named entity recognition; Electronic health records; Artificial intelligence; Registry; Case study; Application; Tool; DISAMBIGUATION; IDENTIFICATION; COMORBIDITY; ANNOTATION; ACCURACY; IDENTIFY; FAILURE; IMPACT;

D O I：

10.1007/s00417-023-06190-2

中图分类号：

R77 [眼科学];

学科分类号：

100212 ;

摘要：

PurposeAdvances in artificial intelligence (AI)-based named entity extraction (NER) have improved the ability to extract diagnostic entities from unstructured, narrative, free-text data in electronic health records. However, there is a lack of ready-to-use tools and workflows to encourage the use among clinicians who often lack experience and training in AI. We sought to demonstrate a case study for developing an automated registry of ophthalmic diseases accompanied by a ready-to-use low-code tool for clinicians.MethodsWe extracted deidentified electronic clinical records from a single centre's adult outpatient ophthalmology clinic from November 2019 to May 2022. We used a low-code annotation software tool (Prodigy) to annotate diagnoses and train a bespoke spaCy NER model to extract diagnoses and create an ophthalmic disease registry.ResultsA total of 123,194 diagnostic entities were extracted from 33,455 clinical records. After decapitalisation and removal of non-alphanumeric characters, there were 5070 distinct extracted diagnostic entities. The NER model achieved a precision of 0.8157, recall of 0.8099, and F score of 0.8128.ConclusionWe presented a case study using low-code artificial intelligence-based NLP tools to produce an automated ophthalmic disease registry. The workflow created a NER model with a moderate overall ability to extract diagnoses from free-text electronic clinical records. We have produced a ready-to-use tool for clinicians to implement this low-code workflow in their institutions and encourage the uptake of artificial intelligence methods for case finding in electronic health records.

引用

页码：3335 / 3344

页数：10

共 57 条

[1] Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis [J].

Abhyankar, Swapna ;

Demner-Fushman, Dina ;

Callaghan, Fiona M. ;

McDonald, Clement J. .

JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) :801-807

[2] Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms [J].

Al-Haddad, Mohammad A. ;

Friedlin, Jeff ;

Kesterson, Joe ;

Waters, Joshua A. ;

Aguilar-Saavedra, Juan R. ;

Schmidt, C. Max .

HPB, 2010, 12 (10) :688-695

[3] Rare diseases in ICD11: making rare diseases visible in health information systems through appropriate coding [J].

Ayme, Segolene ;

Bellet, Bertrand ;

Rath, Ana .

ORPHANET JOURNAL OF RARE DISEASES, 2015, 10

[4] Improving early diagnosis of rare diseases using Natural Language Processing in unstructured medical records: an illustration from Dravet syndrome [J].

Barco, Tommaso Lo ;

Kuchenbuch, Mathieu ;

Garcelon, Nicolas ;

Neuraz, Antoine ;

Nabbout, Rima .

ORPHANET JOURNAL OF RARE DISEASES, 2021, 16 (01)

[5] Study of lipoprotein(a) and its impact on atherosclerotic cardiovascular disease: Design and rationale of the Mass General Brigham Lp(a) Registry [J].

Berman, Adam N. ;

Biery, David W. ;

Ginder, Curtis ;

Hulme, Olivia L. ;

Marcusa, Daniel ;

Leiva, Orly ;

Wu, Wanda Y. ;

Singh, Avinainder ;

Divakaran, Sanjay ;

Hainer, Jon ;

Turchin, Alexander ;

Januzzi, James L. ;

Natarajan, Pradeep ;

Cannon, Christopher P. ;

Di Carli, Marcelo F. ;

Bhatt, Deepak L. ;

Blankstein, Ron .

CLINICAL CARDIOLOGY, 2020, 43 (11) :1209-1215

[6] Comparison of Approaches for Heart Failure Case Identification From Electronic Health Record Data [J].

Blecker, Saul ;

Katz, Stuart D. ;

Horwitz, Leora I. ;

Kuperman, Gilad ;

Park, Hannah ;

Gold, Alex ;

Sontag, David .

JAMA CARDIOLOGY, 2016, 1 (09) :1014-1020

[7] Readiness to Embrace Artificial Intelligence Among Medical Doctors and Students: Questionnaire-Based Study [J].

Boillat, Thomas ;

Nawaz, Faisal A. ;

Rivas, Homero .

JMIR MEDICAL EDUCATION, 2022, 8 (02)

[8] Is Administratively Coded Comorbidity and Complication Data in Total Joint Arthroplasty Valid? [J].

Bozic, Kevin J. ;

Bashyal, Ravi K. ;

Anthony, Shawn G. ;

Chiu, Vanessa ;

Shulman, Brandon ;

Rubash, Harry E. .

CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2013, 471 (01) :201-205

[9] Systematic review of discharge coding accuracy [J].

Burns, E. M. ;

Rigby, E. ;

Mamidanna, R. ;

Bottle, A. ;

Aylin, P. ;

Ziprin, P. ;

Faiz, O. D. .

JOURNAL OF PUBLIC HEALTH, 2012, 34 (01) :138-148

[10] Evaluation of training with an annotation schema for manual annotation of clinical conditions from emergency department reports [J].

Chapman, Wendy W. ;

Dowling, John N. ;

Hripcsak, George .

INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2008, 77 (02) :107-113

← 1 2 3 4 5 6 →