Generalized union and project operations for pooling uncertain and imprecise information

被引:35
作者
Bell, DA [1 ]
Guan, JW [1 ]
Lee, SK [1 ]
机构
[1] UNIV IOWA,DEPT COMP SCI,IOWA CITY,IA 52242
关键词
database systems; uncertainty; Dempster-Shafer theory; artificial intelligence; relational algebra; uncertain and imprecise information arising from different sources;
D O I
10.1016/0169-023X(95)00029-R
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have previously proposed an extended relational data model with the objective of supporting uncertain information in a consistent and coherent manner. The model, which can represent both uncertainty and imprecision in data, is based on the Dempster-Shafer (D-S) theory of evidence, and it uses bel and pls functions of the theory, with their definitions extended somewhat for this purpose. Relational operations such as Select, Cartesian Product, Join, Project, Intersect, and Union have previously been defined [21]. In this paper we consider two data combination problems associated with the data model. These problems are believed to be inherent in most database models which handle uncertain information. The problems are: the potential existence in the database of identical tuples which have different respective degrees of belief (the redundancy problem), and the potential existence of different tuples with the same key values (the inconsistency problem). The redundancy problem was treated to some extent in an earlier paper, but the inconsistency problem has not been considered at all yet. Now the well-known orthogonal sum operation in the D-S theory, which performs the pooling of data for the purpose of making choices between hypotheses, may be viewed as a means of reducing inconsistency in data arising from different sources. This capability has not yet been exploited in our data model. So the idea here is to define a new combine operation as a primitive for handling inconsistency in relations. When data from a number of sources is being pooled - often in order to support decision making - the Union operation, and the Project operation, are very important. We are particularly interested in the case where tuples in operand relations match attribute-wise, but have different uncertainty and imprecision characteristics. The execution of both the Union and Project operations, which the new combine operation can help solve, is a means of dealing with the problem of information aggregation. We use the orthogonal sum, which generalizes results from traditional probability theory in a natural and correct manner, for pooling evidence during the combine computation. The paper also addresses the execution efficiency of our suggested approach. The orthogonal sum operation is exponentially complex if implemented naively. A linear time algorithm can readily be made available for Union and Project for the simple case where the attribute values to be combined are singletons (i.e., atomic values - as in the conventional relational model). However, many potential applications of the approach can exploit the new data model's facility of supporting set-valued attributes. In the method presented here we can combine data supporting non-singleton subsets in linear-time.
引用
收藏
页码:89 / 117
页数:29
相关论文
共 19 条
[1]  
ABITEBOUL S, 1984, P ACM PODS C WAT
[2]  
ANAND SS, 1995, TRANSPUTER OCCAM DEV
[3]  
[Anonymous], EVIDENCE THEORY ITS
[4]  
[Anonymous], EVIDENCE THEORY ITS
[5]   THE MANAGEMENT OF PROBABILISTIC DATA [J].
BARBARA, D ;
GARCIAMOLINA, H ;
PORTER, D .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1992, 4 (05) :487-502
[6]  
Barnett J.A., 1981, P 7 INT JOINT C ART, P868
[7]  
BELL D, 1992, MIE C
[8]   From data properties to evidence [J].
Bell, D.A. .
IEEE Transactions on Knowledge and Data Engineering, 1993, 5 (06) :965-969
[9]   A FUZZY REPRESENTATION OF DATA FOR RELATIONAL DATABASES [J].
BUCKLES, BP ;
PETRY, FE .
FUZZY SETS AND SYSTEMS, 1982, 7 (03) :213-226
[10]   A METHOD FOR MANAGING EVIDENTIAL REASONING IN A HIERARCHICAL HYPOTHESIS SPACE [J].
GORDON, J ;
SHORTLIFFE, EH .
ARTIFICIAL INTELLIGENCE, 1985, 26 (03) :323-357