Continuous Top-k Monitoring on Document Streams

被引:9
|
作者
Hou, Leong U. [1 ]
Zhang, Junjie [1 ]
Mouratidis, Kyriakos [2 ]
Li, Ye [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[2] Singapore Management Univ, Sch Informat Syst, Singapore 188065, Singapore
关键词
Top-k query; continuous query; document stream; QUERIES;
D O I
10.1109/TKDE.2017.2657622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The efficient processing of document streams plays an important role in many information filtering systems. Emerging applications, such as news update filtering and social network notifications, demand presenting end-users with the most relevant content to their preferences. In this work, user preferences are indicated by a set of keywords. A central server monitors the document stream and continuously reports to each user the top-k documents that are most relevant to her keywords. Our objective is to support large numbers of users and high stream rates, while refreshing the top-k results almost instantaneously. Our solution abandons the traditional frequency-ordered indexing approach. Instead, it follows an identifier-ordering paradigm that suits better the nature of the problem. When complemented with a novel, locally adaptive technique, our method offers (i) proven optimality w.r.t. the number of considered queries per stream event, and (ii) an order of magnitude shorter response time (i.e., time to refresh the query results) than the current state-of-the-art.
引用
收藏
页码:991 / 1003
页数:13
相关论文
共 50 条
  • [1] Evaluating continuous top-k queries over document streams
    Weixiong Rao
    Lei Chen
    Shudong Chen
    Sasu Tarkoma
    World Wide Web, 2014, 17 : 59 - 83
  • [2] Evaluating continuous top-k queries over document streams
    Rao, Weixiong
    Chen, Lei
    Chen, Shudong
    Tarkoma, Sasu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (01): : 59 - 83
  • [3] Continuous monitoring of moving skyline and top-k queries
    Arif Hidayat
    Muhammad Aamir Cheema
    Xuemin Lin
    Wenjie Zhang
    Ying Zhang
    The VLDB Journal, 2022, 31 : 459 - 482
  • [4] Sliding Window Top-K Monitoring over Distributed Data Streams
    Lv, Zhijin
    Chen, Ben
    Yu, Xiaohui
    WEB AND BIG DATA, APWEB-WAIM 2017, PT I, 2017, 10366 : 527 - 540
  • [5] Continuously monitoring top-k uncertain data streams: a probabilistic threshold method
    Hua, Ming
    Pei, Jian
    DISTRIBUTED AND PARALLEL DATABASES, 2009, 26 (01) : 29 - 65
  • [6] Continuously monitoring top-k uncertain data streams: a probabilistic threshold method
    Ming Hua
    Jian Pei
    Distributed and Parallel Databases, 2009, 26 : 29 - 65
  • [7] Faster Compact Top-k Document Retrieval
    Konow, Roberto
    Navarro, Gonzalo
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 351 - 360
  • [8] Efficient In-Memory Top-k Document Retrieval
    Culpepper, J. Shane
    Petri, Matthias
    Scholer, Falk
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 225 - 234
  • [9] TIME-OPTIMAL TOP-k DOCUMENT RETRIEVAL
    Navarro, Gonzalo
    Nekrich, Yakov
    SIAM JOURNAL ON COMPUTING, 2017, 46 (01) : 80 - 113
  • [10] Top-k monitoring in wireless sensor networks
    Wu, Minji
    Xu, Jianliang
    Tang, Xueyan
    Lee, Wang-Chien
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (07) : 962 - 976