A Schema Matching Method Based on Partial Functional Dependencies

被引:1
作者
Li Guo-Hui [1 ]
Du Xiao-Kun [1 ]
Hu Fang-Xiao [1 ]
Du Jian-Qiang [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Hubei, Peoples R China
[2] Jiangxi Univ Tradit Chinese Med, Sch Comp Sci, Nanchang 330006, Jiangxi, Peoples R China
来源
FCST: 2008 JAPAN-CHINA JOINT WORKSHOP ON FRONTIER OF COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS | 2008年
关键词
schema matching; partial functional dependency; structure matching;
D O I
10.1109/FCST.2008.30
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Schema matching is a difficulty in many database application domains, e.g., data integration, E-business, data warehousing and semantic query processing. We can get correct schema mapping by mining the semantics of elements from the elements' own information (e.g., elements' names and elements' data types and domains), data instances and structure information. But in fact, most existing related works only consider elements' own information, and data instances and structure information are seldom used for schema matching. At present, there is a trend to combine the element's own information with elements' data instance information and structure information for schema matching in order to improve the matching accuracy. The new method proposed in this paper uses elements' data instances and structure information to support matching. For a pair of element x and y, if a very small number of tuples are deleted from the table, x fully functionally determines y. Such kind of functional dependencies are called partial functional dependencies. A set of strategies are introduced in this paper which utilize these partial functional dependencies to improve schema matching efficiency and accuracy. Extensive simulation experiments are conducted and the results show that this method is better than other related methods in various performance metrics such as precision, recall and overall.
引用
收藏
页码:131 / +
页数:2
相关论文
共 14 条
[1]  
AUMUELLER D, 2005, P SIGMOD 2005
[2]   Relational decomposition through partial functional dependencies [J].
Berzal, F ;
Cubero, JC ;
Cuenca, F ;
Medina, JM .
DATA & KNOWLEDGE ENGINEERING, 2002, 43 (02) :207-234
[3]  
BILKE A, 2005, P 21 INT C DAT ENG I
[4]  
BOHANNON P, P VLDB 2006
[5]  
DO HH, 2002, P VLDB
[6]  
Gusfield D., 1989, STABLE MARRIAGE PROB
[7]   SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks [J].
Li, WS ;
Clifton, C .
DATA & KNOWLEDGE ENGINEERING, 2000, 33 (01) :49-84
[8]  
LU ZN, 2006, ZHANG HUAISHENG FDN
[9]  
MADHAVAN J, P VLDB 2001
[10]  
MADHAVAN J, 2005, P 21 INT C DAT ENG I