An efficient algorithm for top-k queries on uncertain data streams

被引:0
作者
Dai, Caiyan [1 ]
Chen, Ling [2 ]
Chen, Yixin [3 ]
Tang, Keming [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ, Dept Comp Sci, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[3] Washington Univ, Dept Comp Sci, St Louis, MO 63130 USA
[4] Yancheng Teachers Univ, Coll Informat Sci & Technol, Yancheng, Peoples R China
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1 | 2012年
关键词
uncertain data streams; top-k queries; sliding-window;
D O I
10.1109/ICMLA.2012.57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the problem of answering maximum probabilistic top-k tuple set queries. We use a sliding-window model on uncertain data streams and present an efficient algorithm for processing sliding-window queries on uncertain streams. In each sliding window, the algorithm selects the k tuples with the highest probabilities from sets of different numbers of the tuples with the highest scores. Then, the algorithm computes existential probability of the top-k tuples, and chooses the set with the highest probability as the top-k query result. We theoretically prove the correctness of the algorithm. Our experimental results show that our algorithm requires lower time and space complexity than other existing algorithms.
引用
收藏
页码:294 / 299
页数:6
相关论文
共 19 条
[1]  
[Anonymous], 2007, PROC IEEE 23 INT C D
[2]  
Babcock B., 2002, PODS, P1, DOI [DOI 10.1145/543613.543615, 10.1145/543613.543615]
[3]   A dynamic data structure for top-k queries on uncertain data [J].
Chen, Jiang ;
Yi, Ke .
THEORETICAL COMPUTER SCIENCE, 2008, 407 (1-3) :310-317
[4]  
CHUI CK, 2007, PAKDD, V4426, P47, DOI DOI 10.1007/978-3-540-71701-0_8
[5]   Semantics of Ranking Queries for Probabilistic Data and Expected Ranks [J].
Cormode, Graham ;
Li, Feifei ;
Yi, Ke .
ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, :305-316
[6]  
Ge TJ, 2009, ACM SIGMOD/PODS 2009 CONFERENCE, P375
[7]  
Golab L., 2006, THESIS
[8]  
Haghani Parisa., 2009, CIKM, P877
[9]   Efficiently answering probabilistic threshold top-k queries on uncertain data [J].
Hua, Ming ;
Pei, Jian ;
Zhang, Wenjie ;
Lin, Xuemin .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :1403-+
[10]   Sliding-window top-k queries on uncertain streams [J].
Jin, Cheqing ;
Yi, Ke ;
Chen, Lei ;
Yu, Jeffrey Xu ;
Lin, Xuemin .
VLDB JOURNAL, 2010, 19 (03) :411-435