Customized Policies for Handling Partial Information in Relational Databases

被引:1
|
作者
Martinez, Maria Vanina [1 ]
Molinaro, Cristian [2 ]
Grant, John [3 ]
Subrahmanian, V. S. [4 ]
机构
[1] Univ Oxford, Dept Comp Sci, Oxford OX1 3QD, England
[2] Univ Calabria, Dipartimento Elettron Informat & Sistemist, I-87036 Arcavacata Di Rende, CS, Italy
[3] Towson Univ, Dept Math, Towson, MD 21252 USA
[4] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
Knowledge personalization and customization; database semantics; NULL VALUES; FUNCTIONAL-DEPENDENCIES;
D O I
10.1109/TKDE.2012.91
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most real-world databases have at least some missing data. Today, users of such databases are "on their own" in terms of how they manage this incompleteness. In this paper, we propose the general concept of partial information policy (PIP) operator to handle incompleteness in relational databases. PIP operators build upon preference frameworks for incomplete information, but accommodate different types of incomplete data (e.g., a value exists but is not known; a value does not exist; a value may or may not exist). Different users in the real world have different ways in which they want to handle incompleteness-PIP operators allow them to specify a policy that matches their attitude to risk and their knowledge of the application and how the data was collected. We propose index structures for efficiently evaluating PIP operators and experimentally assess their effectiveness on a real-world airline data set. We also study how relational algebra operators and PIP operators interact with one another.
引用
收藏
页码:1254 / 1271
页数:18
相关论文
共 50 条