Clinical concept extraction: A methodology review

被引:92
作者
Fu, Sunyang [1 ,3 ]
Chen, David [1 ]
He, Huan [1 ]
Liu, Sijia [1 ]
Moon, Sungrim [1 ]
Peterson, Kevin J. [2 ,3 ]
Shen, Feichen [1 ]
Wang, Liwei [1 ]
Wang, Yanshan [1 ]
Wen, Andrew [1 ]
Zhao, Yiqing [1 ]
Sohn, Sunghwan [1 ]
Liu, Hongfang [1 ,3 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, 200 First St SW, Rochester, MN 55905 USA
[2] Mayo Clin, Dept Informat Technol, 200 First St SW, Rochester, MN 55905 USA
[3] Univ Minnesota Twin Cities, Minneapolis, MN 55455 USA
关键词
Concept extraction; Natural language processing; Information extraction; Electronic health records; Machine learning; Deep learning; NAMED ENTITY RECOGNITION; OF-THE-ART; PROTECTED HEALTH INFORMATION; CONVOLUTIONAL NEURAL-NETWORK; MEDICATION INFORMATION; DE-IDENTIFICATION; RISK-FACTORS; MULTIPLE-SCLEROSIS; RADIOLOGY REPORTS; ADVERSE EVENTS;
D O I
10.1016/j.jbi.2020.103526
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Concept extraction, a subdomain of natural language processing (NLP) with a focus on extracting concepts of interest, has been adopted to computationally extract clinical information from text for a wide range of applications ranging from clinical decision support to care quality improvement. Objectives: In this literature review, we provide a methodology review of clinical concept extraction, aiming to catalog development processes, available methods and tools, and specific considerations when developing clinical concept extraction applications. Methods: Based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, a literature search was conducted for retrieving EHR-based information extraction articles written in English and published from January 2009 through June 2019 from Ovid MEDLINE In-Process & Other Non-Indexed Citations, Ovid MEDLINE, Ovid EMBASE, Scopus, Web of Science, and the ACM Digital Library. Results: A total of 6,686 publications were retrieved. After title and abstract screening, 228 publications were selected. The methods used for developing clinical concept extraction applications were discussed in this review.
引用
收藏
页数:16
相关论文
共 216 条
[31]   THE EPISTEMOLOGY OF A RULE-BASED EXPERT SYSTEM - A FRAMEWORK FOR EXPLANATION [J].
CLANCEY, WJ .
ARTIFICIAL INTELLIGENCE, 1983, 20 (03) :215-251
[32]   Identifying smokers with a medical extraction system [J].
Clark, Cheryl ;
Good, Kathleen ;
Jezierny, Lesley ;
Macpherson, Melissa ;
Wilson, Brian ;
Chajewska, Urszula .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) :36-39
[33]   Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts [J].
Cocos, Anne ;
Fiks, Alexander G. ;
Masino, Aaron J. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (04) :813-821
[34]   Agile text mining for the 2014 i2b2/UTHealth Cardiac risk factors challenge [J].
Cormack, James ;
Nath, Chinmoy ;
Milward, David ;
Raja, Kalpana ;
Jonnalagadda, Siddhartha R. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S120-S127
[35]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[36]  
Cowie J., 2000, Handbook of Natural Language Processing, V56, P57
[37]   A neural network multi-task learning approach to biomedical named entity recognition [J].
Crichton, Gamal ;
Pyysalo, Sampo ;
Chiu, Billy ;
Korhonen, Anna .
BMC BIOINFORMATICS, 2017, 18
[38]   Complex epilepsy phenotype extraction from narrative clinical discharge summaries [J].
Cui, Licong ;
Sahoo, Satya S. ;
Lhatoo, Samden D. ;
Garg, Gaurav ;
Rai, Prashant ;
Bozorgi, Alireza ;
Zhang, Guo-Qiang .
JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 51 :272-279
[39]   Automated extraction of clinical traits of multiple sclerosis in electronic medical records [J].
Davis, Mary F. ;
Sriram, Subramaniam ;
Bush, William S. ;
Denny, Joshua C. ;
Haines, Jonathan L. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) :E334-E340
[40]  
Dehghan A, 2017, LUNG CANCER, V103, pS57