Ontology-Based Data Integration between Clinical and Research Systems

被引:57
作者
Mate, Sebastian [1 ]
Koepcke, Felix [2 ]
Toddenroth, Dennis [1 ]
Martin, Marcus [3 ]
Prokosch, Hans-Ulrich [1 ,2 ]
Buerkle, Thomas [4 ]
Ganslandt, Thomas [2 ]
机构
[1] Univ Erlangen Nurnberg, Inst Med Informat, Erlangen, Germany
[2] Erlangen Univ Hosp, Ctr Med Informat & Commun, Erlangen, Germany
[3] Erlangen Univ Hosp, Tumor Ctr, Erlangen, Germany
[4] Bern Univ Appl Sci, Inst Med Informat, Bern, Switzerland
来源
PLOS ONE | 2015年 / 10卷 / 01期
关键词
ELECTRONIC HEALTH RECORDS; MEDICAL INFORMATICS; NCI THESAURUS; DATA ELEMENTS; INTEROPERABILITY; CANCER; FRAMEWORK; IDENTIFICATION; ARCHITECTURE; EXTRACTION;
D O I
10.1371/journal.pone.0116656
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Data from the electronic medical record comprise numerous structured but uncoded elements, which are not linked to standard terminologies. Reuse of such data for secondary research purposes has gained in importance recently. However, the identification of relevant data elements and the creation of database jobs for extraction, transformation and loading (ETL) are challenging: With current methods such as data warehousing, it is not feasible to efficiently maintain and reuse semantically complex data extraction and trans-formation routines. We present an ontology-supported approach to overcome this challenge by making use of abstraction: Instead of defining ETL procedures at the database level, we use ontologies to organize and describe the medical concepts of both the source system and the target system. Instead of using unique, specifically developed SQL statements or ETL jobs, we define declarative transformation rules within ontologies and illustrate how these constructs can then be used to automatically generate SQL code to perform the desired ETL procedures. This demonstrates how a suitable level of abstraction may not only aid the interpretation of clinical data, but can also foster the reutilization of methods for un-locking it.
引用
收藏
页数:20
相关论文
共 83 条
  • [1] Anitha J., 2014, INT J COMPUTER SCI M, V3, P610
  • [2] [Anonymous], 2004, RDF VOCABULARY DESCR
  • [3] [Anonymous], J AM MED INFORM ASS
  • [4] [Anonymous], 2011, Ontol. Matching
  • [5] Data Definition Ontology for clinical data integration and querying
    Assele Kama, Ariane
    Primadhanty, Audi
    Choquet, Remy
    Teodoro, Douglas
    Enders, Frank
    Duclos, Catherine
    Jaulent, Marie-Christine
    [J]. QUALITY OF LIFE THROUGH QUALITY OF INFORMATION, 2012, 180 : 38 - 42
  • [6] An adaptable architecture for patient cohort identification from diverse data sources
    Bache, Richard
    Miles, Simon
    Taweel, Adel
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2013, 20 (E2) : E327 - E333
  • [7] The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities
    Berners-Lee, T
    Hendler, J
    Lassila, O
    [J]. SCIENTIFIC AMERICAN, 2001, 284 (05) : 34 - +
  • [8] BIZER C, 2004, P 3 INT SEM WEB C IC
  • [9] BLEICH HL, 1992, M D COMPUT, V9, P293
  • [10] HIS-based Kaplan-Meier plots - a single source approach for documenting and reusing routine survival information
    Breil, Bernhard
    Semjonow, Axel
    Mueller-Tidow, Carsten
    Fritz, Fleur
    Dugas, Martin
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2011, 11