Matching Schemas of Heterogeneous Relational Databases

被引:3
作者
Karasneh, Yaser [1 ]
Ibrahim, Hamidah [1 ]
Othman, Mohamed [1 ]
Yaakob, Razali [1 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Dept Comp Sci, Serdang 43400, Selangar De, Malaysia
来源
2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009) | 2009年
关键词
D O I
10.1109/ICADIWT.2009.5273926
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Schema matching is a basic problem in many database application domains, such as data integration. The problem of schema matching can be formulated as follows, "given two schemas, Si and Si, find the most plausible correspondences between the elements of Si and Si, exploiting all available information, such as the schemas, instance data, and auxiliary sources" [24]. Given the rapidly increasing number of data sources to integrate and due to database heterogeneities, manually identifying schema matches is a tedious, time consuming, error-prone, and therefore expensive process. As systems become able to handle more complex databases and applications, their schemas become large, further increasing the number of matches to be performed. Thus, automating this process, which attempts to achieve faster and less labor-intensive, has been one of the main tasks in data integration. However, it is not possible to determine fully automatically the different correspondences between schemas, primarily because of the differing and often not explicated or documented semantics of the schemas. Several solutions in solving the issues of schema matching have been proposed. Nevertheless, these solutions are still limited, as they do not explore most of the available information related to schemas and thus affect the result of integration. This paper presents an approach for matching schemas of heterogeneous relational databases that utilizes most of the information related to schemas, which indirectly explores the implicit semantics of the schemas, that further improves the results of the integration.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 24 条
[1]  
Bernstein PA, 2004, SIGMOD REC, V33, P38, DOI 10.1145/1041410.1041417
[2]  
Bilke A, 2005, PROC INT CONF DATA, P69
[3]   MDSM: Microarray database schema matching using the Hungarian method [J].
Chen, Yi-Ping Phoebe ;
Promparmote, Supawan ;
Maire, Frederic .
INFORMATION SCIENCES, 2006, 176 (19) :2771-2790
[4]  
DHAMANKAR ROBIN., 2004, SIGMOD Conf, P383
[5]   Learning to match the schemas of data sources: A multistrategy approach [J].
Doan, A ;
Domingos, P ;
Halevy, A .
MACHINE LEARNING, 2003, 50 (03) :279-301
[6]  
Doan A, 2005, AI MAG, V26, P83
[7]   Poster session:: An indexing structure for automatic schema matching [J].
Duchateau, Fabien ;
Bellahsene, Zohra ;
Roantree, Mark ;
Roche, Mathieu .
2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, :485-+
[8]   SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks [J].
Li, WS ;
Clifton, C .
DATA & KNOWLEDGE ENGINEERING, 2000, 33 (01) :49-84
[9]  
Lu JG, 2005, LECT NOTES COMPUT SC, V3579, P273
[10]  
Madhavan J., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P49