Semantic similarity-based alignment between clinical archetypes and SNOMED CT: An application to observations

被引:21
作者
Meizoso Garcia, Maria [1 ]
Iglesias Allones, Jose Luis [1 ]
Martinez Hernandez, Diego [2 ]
Taboada Iglesias, Maria Jesus [1 ]
机构
[1] Univ Santiago de Compostela, Dept Elect & Comp Sci, Santiago De Compostela, Spain
[2] Univ Santiago de Compostela, Dept Appl Phys, Santiago De Compostela, Spain
关键词
Terminology mapping; Electronic Health Records; Clinical archetypes; SNOMED CT; Semantic interoperability; Knowledge representation;
D O I
10.1016/j.ijmedinf.2012.02.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose: One of the main challenges of eHealth is semantic interoperability of health systems. But, this will only be possible if the capture, representation and access of patient data is standardized. Clinical data models, such as OpenEHR Archetypes, define data structures that are agreed by experts to ensure the accuracy of health information. In addition, they provide an option to normalize clinical data by means of binding terms used in the model definition to standard medical vocabularies. Nevertheless, the effort needed to establish the association between archetype terms and standard terminology concepts is considerable. Therefore, the purpose of this study is to provide an automated approach to bind OpenEHR archetypes terms to the external terminology SNOMED CT, with the capability to do it at a semantic level. Methods: This research uses lexical techniques and external terminological tools in combination with context-based techniques, which use information about structural and semantic proximity to identify similarities between terms and so, to find alignments between them. The proposed approach exploits both the structural context of archetypes and the terminology context, in which concepts are logically defined through the relationships (hierarchical and definitional) to other concepts. Results: A set of 25 OBSERVATION archetypes with 477 bound terms was used to test the method. Of these, 342 terms (74.6%) were linked with 96.1% precision, 71.7% recall and 1.23 SNOMED CT concepts on average for each mapping. It has been detected that about one third of the archetype clinical information is grouped logically. Context-based techniques take advantage of this to increase the recall and to validate a 30.4% of the bindings produced by lexical techniques. Conclusions: This research shows that it is possible to automatically map archetype terms to a standard terminology with a high precision and recall, with the help of appropriate contextual and semantic information of both models. Moreover, the semantic-based methods provide a means of validating and disambiguating the resulting bindings. Therefore, this work is a step forward to reduce the human participation in the mapping process. (C) 2012 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:566 / 578
页数:13
相关论文
共 27 条
[1]  
[Anonymous], 2007, Ontology matching, DOI 10.1007/978-3-540-49612-0
[2]  
[Anonymous], 2009, UMLS REF MAN MET
[3]  
[Anonymous], OPENEHR CLIN KNOWL M
[4]  
[Anonymous], 2008, Introduction to information retrieval
[5]  
[Anonymous], 2012, SNOMED CLIN TERMS US
[6]  
CEN/ISO EN13606 invitational workshop, 2010, EN13606 CENISO
[7]  
Doan A, 2004, SIGMOD REC, V33, P11
[8]  
FUNG KW, 2007, HLTH TECHNOL INFORM, V129, P605
[9]   Ontology mapping: the state of the art [J].
Kalfoglou, Y ;
Schorlemmer, M .
KNOWLEDGE ENGINEERING REVIEW, 2003, 18 (01) :1-31
[10]  
Kashyap V., 1996, VLDB Journal, V5, P276, DOI 10.1007/s007780050029