Schema Mapping Discovery from Data Instances

被引:37
作者
Gottlob, Georg [1 ]
Senellart, Pierre [2 ,3 ,4 ]
机构
[1] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
[2] Telecom ParisTech, Dept Informat & Reseaux, F-75634 Paris 13, France
[3] CNRS, LTCI, Paris, France
[4] Inst Telecom, Paris, France
基金
英国工程与自然科学研究理事会; 欧洲研究理事会;
关键词
Languages; Theory; Schema mapping; instance; complexity; match; data exchange;
D O I
10.1145/1667053.1667055
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a theoretical framework for discovering relationships between two database instances over distinct and unknown schemata. This framework is grounded in the context of data exchange. We formalize the problem of understanding the relationship between two instances as that of obtaining a schema mapping so that a minimum repair of this mapping provides a perfect description of the target instance given the source instance. We show that this definition yields "intuitive" results when applied on database instances derived from each other by basic operations. We study the complexity of decision problems related to this optimality notion in the context of different logical languages and show that, even in very restricted cases, the problem is of high complexity.
引用
收藏
页数:37
相关论文
共 31 条
[1]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[2]  
[Anonymous], 1936, Theorie der endlichen und unendlichen Graphen
[3]  
ARENAS M, 1999, P ACM SIGACT SIGMOD
[4]  
BEERI C, 1981, P ANN ACM S THEOR CO
[5]  
BERNSTEIN P, 2003, P C INN DAT SYST
[6]   USING SEMI-JOINS TO SOLVE RELATIONAL QUERIES [J].
BERNSTEIN, PA ;
CHIU, DMW .
JOURNAL OF THE ACM, 1981, 28 (01) :25-40
[7]  
Chandra Ashok K., 1977, P ANN ACM S THEOR CO
[8]  
CRESCENZI V, 2001, P S VER LARG DAT VLD
[9]  
Diestel R., 2005, GRAPH THEORY, VThird
[10]   RECURSIVE UNSOLVABILITY OF DECISION PROBLEM FOR CLASS OF DEFINITE FORMULAS [J].
DIPAOLA, RA .
JOURNAL OF THE ACM, 1969, 16 (02) :324-&