Top-k Keyword Search over Probabilistic XML Data

被引:0
作者
Li, Jianxin [1 ]
Liu, Chengfei [1 ]
Zhou, Rui [1 ]
Wang, Wei [2 ]
机构
[1] Swinburne Univ Technol, Hawthorn, Vic 3122, Australia
[2] Univ New South Wales, Sydney, NSW 2052, Australia
来源
IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the proliferation of work on XML keyword query, it remains open to support keyword query over probabilistic XML data. Compared with traditional keyword search, it is far more expensive to answer a keyword query over probabilistic XML data due to the consideration of possible world semantics. In this paper, we firstly define the new problem of studying top-k keyword search over probabilistic XML data, which is to retrieve k SLCA results with the k highest probabilities of existence. And then we propose two efficient algorithms. The first algorithm PrStack can find k SLCA results with the k highest probabilities by scanning the relevant keyword nodes only once. To further improve the efficiency, we propose a second algorithm EagerTopK based on a set of pruning properties which can quickly prune unsatisfied SLCA candidates. Finally, we implement the two algorithms and compare their performance with analysis of extensive experimental results.
引用
收藏
页码:673 / 684
页数:12
相关论文
共 25 条
[1]  
Abiteboul S, 2006, LECT NOTES COMPUT SC, V3896, P1059
[2]   On the expressiveness of probabilistic XML models [J].
Abiteboul, Serge ;
Kimelfeld, Benny ;
Sagiv, Yehoshua ;
Senellart, Pierre .
VLDB JOURNAL, 2009, 18 (05) :1041-1064
[3]  
[Anonymous], 2003, Proceedings of the 2003 ACM SIGMOD international conference on Management of data
[4]  
[Anonymous], P ACM SIGMOD INT C M
[5]  
[Anonymous], 2004, Proceedings of the Thirtieth international conference on Very Large Databases-Volume
[6]  
Bao ZF, 2009, PROC INT CONF DATA, P517, DOI 10.1109/ICDE.2009.16
[7]  
Chang L., 2009, EDBT, P156
[8]   Incorporating Constraints in Probabilistic XML [J].
Cohen, Sara ;
Kimelfeld, Benny ;
Sagiv, Yehoshua .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (03)
[9]   Integrating keyword search into XML query processing [J].
Florescu, D ;
Kossmann, D ;
Manolescu, I .
COMPUTER NETWORKS, 2000, 33 (1-6) :119-135
[10]   PXML: A probabilistic semistructured data model and algebra [J].
Hung, E ;
Getoor, L ;
Subrahmanian, VS .
19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, :467-478