Incremental data integration based on hierarchical metadata registry with data visibility

被引:6
|
作者
Jeong, D [1 ]
Baik, DK [1 ]
机构
[1] Korea Univ, Dept Comp Sci & Engn, Software Syst Lab, Seoul 136701, South Korea
关键词
metadata registry; MDR; data visibility; incremental data integration; hierarchical MDR;
D O I
10.1016/j.ins.2003.09.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A considerable number of researches have been studied on data integration based on metadata. However, existing approaches require too much cost to build an initial guideline. Most important reason is that the previous researches have not seriously considered the corresponding domain properties such as the data level and the user level. First, it is difficult in practice to create a standardized guideline on the entire data set, if there is a restricted cost given. Thus, a set of data to be integrated should be selected first. However, most databases have no statistical information that may be used to select such a set of data according to its usability. In this paper, we propose LOG (localization-based global metadata registry) methodology to build a guideline and integrate databases progressively considering the domain properties. The key idea is that the priorities of databases to be integrated are determined by the relationship to the domain properties. We also show the implementation by applying it to actual databases in Korea Institute of Science and Technology Information, which builds and manages a considerable number of databases on the science and technology in Korea. The LOG provides an incremental build method of metadata registry, and also supports progressive data integration mechanism on the existing distributed databases. It especially gives successful and efficient output on the creation of a standard guideline in the situation where the given cost is restricted. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:147 / 181
页数:35
相关论文
共 50 条
  • [21] Syntactic and semantic metadata integration for science data use
    Movva, S
    Ramachandran, R
    Li, X
    Khaire, S
    Keiser, K
    Conover, H
    Graves, S
    COMPUTERS & GEOSCIENCES, 2005, 31 (09) : 1126 - 1134
  • [22] Data integration by fuzzy similarity-based hierarchical clustering
    Ciaramella, Angelo
    Nardone, Davide
    Staiano, Antonino
    BMC BIOINFORMATICS, 2020, 21 (Suppl 10)
  • [23] PYRAMID: A HETEROGENEOUS DATA INTEGRATION ALGORITHM BASED ON HIERARCHICAL GRAPH
    Jiang, Sining
    Lan, Yujun
    Wang, Weigang
    Guo, Zhongwen
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6220 - 6224
  • [24] Data integration by fuzzy similarity-based hierarchical clustering
    Angelo Ciaramella
    Davide Nardone
    Antonino Staiano
    BMC Bioinformatics, 21
  • [25] An Efficient Method of Data Quality Evaluation using Metadata Registry
    Choi, O-Hoon
    Lim, Jung-Eim
    Na, Hong-Seok
    Seong, Kwan-Jae
    Baik, Doo-Kwon
    PROCEEDINGS OF THE 2008 ADVANCED SOFTWARE ENGINEERING & ITS APPLICATIONS, 2008, : 9 - +
  • [26] Development of Quality Control Method for Visibility Data Based on the Characteristics of Visibility Data
    Oh, Yu-Joo
    Suh, Myoung-Seok
    KOREAN JOURNAL OF REMOTE SENSING, 2020, 36 (05) : 707 - 723
  • [27] Incremental Signaling Pathway Modeling by Data Integration
    Koh, Geoffrey
    Hsu, David
    Thiagarajan, P. S.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2010, 6044 : 281 - +
  • [28] A Method for Complex Hierarchical Data Integration
    Maleszka, Marcin
    Ngoc Thanh Nguyen
    CYBERNETICS AND SYSTEMS, 2011, 42 (05) : 358 - 378
  • [29] Semantic data integration in hierarchical domains
    Cruz, IF
    Rajendran, A
    IEEE INTELLIGENT SYSTEMS, 2003, 18 (02): : 66 - 73
  • [30] Hierarchical metadata driven view dependency in spatial data warehouses
    Yu, SM
    SIXTEENTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, : 448 - 452