Use of the Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) for Processing Free Text in Health Care: Systematic Scoping Review

被引:45
作者
Gaudet-Blavignac, Christophe [1 ,2 ]
Foufi, Vasiliki [1 ,2 ]
Bjelogrlic, Mina [1 ,2 ]
Lovis, Christian [1 ,2 ]
机构
[1] Geneva Univ Hosp, Div Med Informat Sci, Rue Gabrielle Perret Gentil 4, CH-1205 Geneva, Switzerland
[2] Univ Geneva, Dept Radiol & Med Informat, Geneva, Switzerland
关键词
SNOMED CT; natural language processing; scoping review; terminology; INFORMATION EXTRACTION; LEARNING-SYSTEM; CLASSIFICATION; RADIOLOGY; ANNOTATION; DOCUMENTS; RECORDS; REUSE; NLP;
D O I
10.2196/24594
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Interoperability and secondary use of data is a challenge in health care. Specifically, the reuse of clinical free text remains an unresolved problem. The Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) has become the universal language of health care and presents characteristics of a natural language. Its use to represent clinical free text could constitute a solution to improve interoperability. Objective: Although the use of SNOMED and SNOMED CT has already been reviewed, its specific use in processing and representing unstructured data such as clinical free text has not. This review aims to better understand SNOMED CT's use for representing free text in medicine. Methods: A scoping review was performed on the topic by searching MEDLINE, Embase, and Web of Science for publications featuring free-text processing and SNOMED CT. A recursive reference review was conducted to broaden the scope of research. The review covered the type of processed data, the targeted language, the goal of the terminology binding, the method used and, when appropriate, the specific software used. Results: In total, 76 publications were selected for an extensive study. The language targeted by publications was 91% (n=69) English. The most frequent types of documents for which the terminology was used are complementary exam reports (n=18, 24%) and narrative notes (n=16, 21%). Mapping to SNOMED CT was the final goal of the research in 21% (n=16) of publications and a part of the final goal in 33% (n=25). The main objectives of mapping are information extraction (n=44, 39%), feature in a classification task (n=26, 23%), and data normalization (n=23, 20%). The method used was rule-based in 70% (n=53) of publications, hybrid in 11% (n=8), and machine learning in 5% (n=4). In total, 12 different software packages were used to map text to SNOMED CT concepts, the most frequent being Medtex, Mayo Clinic Vocabulary Server, and Medical Text Extraction Reasoning and Mapping System. Full terminology was used in 64% (n=49) of publications, whereas only a subset was used in 30% (n=23) of publications. Postcoordination was proposed in 17% (n=13) of publications, and only 5% (n=4) of publications specifically mentioned the use of the compositional grammar. Conclusions: SNOMED CT has been largely used to represent free-text data, most frequently with rule-based approaches, in English. However, currently, there is no easy solution for mapping free text to this terminology and to perform automatic postcoordination. Most solutions conceive SNOMED CT as a simple terminology rather than as a compositional bag of ontologies. Since 2012, the number of publications on this subject per year has decreased. However, the need for formal semantic representation of free text in health care is high, and automatic encoding into a compositional ontology could be a solution.
引用
收藏
页数:18
相关论文
共 109 条
[1]   The readiness of SNOMED problem list concepts for meaningful use of electronic health records [J].
Agrawal, Ankur ;
He, Zhe ;
Perl, Yehoshua ;
Wei, Duo ;
Halper, Michael ;
Elhanan, Gai ;
Chen, Yan .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2013, 58 (02) :73-80
[2]  
[Anonymous], 2012, 4 SWED LANG TECHN C
[3]   Automatic Extraction of Cancer Characteristics from Free-Text Pathology Reports for Cancer Notifications [J].
Anthony Nguyen ;
Moore, Julie ;
Lawley, Michael ;
Hansen, David ;
Colquist, Shoni .
HEALTH INFORMATICS: THE TRANSFORMATIVE POWER OF INNOVATION, 2011, 168 :117-124
[4]   Patient safety incidents involving neuromuscular blockade: analysis of the UK National Reporting and Learning System data from 2006 to 2008 [J].
Arnot-Smith, J. ;
Smith, A. F. .
ANAESTHESIA, 2010, 65 (11) :1106-1113
[5]   An overview of MetaMap: historical perspective and recent advances [J].
Aronson, Alan R. ;
Lang, Francois-Michel .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :229-236
[6]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[7]   Semi-structured document categorization with a semantic kernel [J].
Aseervatham, Sujeevan ;
Bennani, Younes .
PATTERN RECOGNITION, 2009, 42 (09) :2067-2076
[8]   A usability evaluation of a SNOMED CT based compositional interface terminology for intensive care [J].
Bakhshi-Raiez, F. ;
de Keizer, N. F. ;
Cornet, R. ;
Dorrepaal, M. ;
Dongelmans, D. ;
Jaspers, M. W. M. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2012, 81 (05) :351-362
[9]   eQuality:: Electronic quality assessment from narrative clinical reports [J].
Brown, Steven H. ;
Speroff, Theodore ;
Fielstein, Elliot M. ;
Bauer, Brent A. ;
Wahner-Roedler, Dietlind L. ;
Greevy, Robert ;
Elkin, Peter L. .
MAYO CLINIC PROCEEDINGS, 2006, 81 (11) :1472-1481
[10]  
Brown Steven H, 2008, AMIA Annu Symp Proc, P71