Matching Attributes Across Overlapping Heterogeneous Data Sources Using Mutual Information

被引:3
|
作者
Zhao, Huimin [1 ]
机构
[1] Univ Wisconsin Milwaukee, Sheldon B Lubar Sch Business, Milwaukee, WI 53201 USA
关键词
Attribute Correspondence; Attribute Matching; Heterogeneous Databases; Information Theory; Mutual Information; SEMANTIC-INTEGRATION; SCHEMA; CORRESPONDENCES; RETRIEVAL; DATABASES;
D O I
10.4018/jdm.2010100105
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identifying matching attributes across heterogeneous data sources is a critical and time-consuming step in integrating the data sources. In this paper, the author proposes a method for matching the most frequently encountered types of attributes across overlapping heterogeneous data sources. The author uses mutual information as a unified measure of dependence on various types of attributes. An example is used to demonstrate the utility of the proposed method, which is useful in developing practical attribute matching tools.
引用
收藏
页码:91 / 110
页数:20
相关论文
共 48 条
  • [1] Exploring attribute correspondences across heterogeneous databases by mutual information
    Zhao, HM
    Soofi, ES
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2006, 22 (04) : 305 - 336
  • [2] Entity matching across heterogeneous data sources: An approach based on constrained cascade generalization
    Zhao, Huimin
    Ram, Sudha
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (03) : 368 - 381
  • [3] Mutual Information Based Matching for Causal Inference with Observational Data
    Sun, Lei
    Nikolaev, Alexander G.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [4] The architecture for semantic data access to heterogeneous information sources
    Rishe, N
    Vaschillo, A
    Vasilevsky, D
    Shaposhnikov, A
    Chen, SC
    COMPUTERS AND THEIR APPLICATIONS, 2000, : 134 - 139
  • [5] Research on Semantic Integration across Heterogeneous Data Sources in Grid
    Liu, Guofeng
    Huang, Shaobin
    Cheng, Yuan
    FRONTIERS IN COMPUTER EDUCATION, 2012, 133 : 397 - 404
  • [6] Robust Multisensor Image Matching Using Bayesian Estimated Mutual Information
    Yan, Yuzhuang
    Shen, Lurong
    Zheng, Yongbin
    Xu, Wanying
    Huang, Xinsheng
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 541 - 548
  • [7] Combining schema and instance information for integrating heterogeneous data sources
    Zhao, Huimin
    Ram, Sudha
    DATA & KNOWLEDGE ENGINEERING, 2007, 61 (02) : 281 - 303
  • [8] Integrating domain heterogeneous data sources using decomposition aggregation queries
    Xu, Jian
    Pottinger, Rachel
    INFORMATION SYSTEMS, 2014, 39 : 80 - 107
  • [9] Analysis and Visualization of Seismic Data Using Mutual Information
    Tenreiro Machado, Jose A.
    Lopes, Antonio M.
    ENTROPY, 2013, 15 (09) : 3892 - 3909
  • [10] Unified Access to Heterogeneous Data Sources Using an Ontology
    Mercier, Daniel
    Cheong, Hyunmin
    Tapaswi, Chaitanya
    SEMANTIC TECHNOLOGY (JIST 2018), 2018, 11341 : 104 - 118