Efficiency frontiers of XML cardinality constraints

被引:13
作者
Ferrarotti, Flavio [1 ]
Hartmann, Sven [2 ]
Link, Sebastian [3 ]
机构
[1] Victoria Univ Wellington, Sch Informat Management, Wellington, New Zealand
[2] Tech Univ Clausthal, Dept Informat, Clausthal Zellerfeld, Germany
[3] Univ Auckland, Dept Comp Sci, Auckland 1, New Zealand
关键词
Semi-structured data and XML; Database semantics; Database constraints; Cardinality constraints; Cardinality estimation; ENTITY-RELATIONSHIP; SELECTIVITY ESTIMATION; NORMAL FORMS; DEPENDENCIES; KEYS;
D O I
10.1016/j.datak.2012.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
XML has gained widespread acceptance as a premier format for publishing, sharing and manipulating data through the web. While the semi-structured nature of XML provides a high degree of syntactic flexibility there are significant shortcomings when it comes to specifying the semantics of XML data. For the advancement of XML applications it is therefore a major challenge to discover natural classes of constraints that can be utilized effectively by XML data engineers. This endeavor is ambitious given the multitude of intractability results that have been established. We investigate a class of XML cardinality constraints that is precious in the sense that it keeps the right balance between expressiveness and efficiency of maintenance. In particular, we characterize the associated implication problem axiomatically and develop a low-degree polynomial time algorithm that can be readily applied for deciding implication. Our class of constraints is chosen near-optimal as already minor extensions of its expressiveness cause potential intractability. Finally, we transfer our findings to establish a precious class of soft cardinality constraints on XML data. Soft cardinality constraints need to be satisfied on average only, and thus permit violations in a controlled manner. Soft constraints are therefore able to tolerate exceptions that frequently occur in practice, yet can be reasoned about efficiently. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:297 / 319
页数:23
相关论文
共 55 条
[1]  
Abiteboul S., 1995, Foundations of databases, V8
[2]  
[Anonymous], 1999, Graphs, Networks and Algorithm
[3]  
[Anonymous], P IJCAI 03 WORKSH IN
[4]  
Apparao V., 1998, DOCUMENT OBJECT MODE
[5]   An information-theoretic approach to normal forms for relational and XML data [J].
Arenas, M ;
Libkin, L .
JOURNAL OF THE ACM, 2005, 52 (02) :246-283
[6]  
Arenas M., 2002, Database and Expert Systems Applications. 13th International Conference, DEXA 2002. Proceedings (Lecture Notes in Computer Science Vol.2453), P269
[7]   On the complexity of verifying consistency of XML specifications [J].
Arenas, Marcelo ;
Fan, Wenfei ;
Libkin, Leonid .
SIAM JOURNAL ON COMPUTING, 2008, 38 (03) :841-880
[8]  
Artale A, 2007, LECT NOTES COMPUT SC, V4801, P277
[9]  
Bray T., 2004, Extensible Markup Language (XML) 1.0, VThird
[10]   Keys for XML [J].
Buneman, P ;
Davidson, S ;
Fan, WF ;
Hara, C ;
Tan, WC .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2002, 39 (05) :473-487