Uncertain Data Queries Processing in a Probabilistic Framework

被引:0
作者
He, Ming [1 ]
Du, Yong-ping [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
probabilistic databases; uncertain data; information extraction; conditional random fields;
D O I
10.4304/jcp.5.11.1663-1669
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many applications today need to manage data that is uncertain, such as information extraction (IE), data integration, sensor RFID networks, and scientific experiments. Top-k queries are often natural and useful in analyzing uncertain data in those applications. In this paper, we study the problem of answering top-k queries in a probabilistic framework from a state-of-the-art statistical IE model-semi-Conditional Random Fields (CRFs)-in the setting of Probabilistic Databases that treat statistical models as first-class data objects. We investigate the problem of ranking the answers to Probabilistic Databases query. We present efficient algorithm for finding the best approximating parameters in such a framework to efficiently retrieve the top-k ranked results. An empirical study using real data sets demonstrates the effectiveness of probabilistic top-k queries and the efficiency of our method.
引用
收藏
页码:1663 / 1669
页数:7
相关论文
共 14 条
[1]  
Abiteboul S., 1987, SIGMOD Record, V16, P34, DOI 10.1145/38714.38724
[2]   A Survey of Uncertain Data Algorithms and Applications [J].
Aggarwal, Charu C. ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (05) :609-623
[3]  
Agrawal Jagrati, 2008, P 2008 ACM SIGMOD IN
[4]  
BURDICK D, 2006, P 32 INT C VER LARG, P391
[5]  
Califf ME, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P328
[6]  
Deshpande A., 2004, P 30 INT C VERY LARG, V30, P588
[7]   A probabilistic relational algebra for the integration of information retrieval and database systems [J].
Fuhr, N ;
Rolleke, T .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1997, 15 (01) :32-66
[8]  
Green T., 2006, DATA ENG B, V29
[9]  
GUPTA R, 2006, VLDB, P965
[10]  
Lafferty John, 2001, P 18 INT C MACH LEAR, V1, P282