Continuous Top-k Monitoring on Document Streams

被引:9
|
作者
Hou, Leong U. [1 ]
Zhang, Junjie [1 ]
Mouratidis, Kyriakos [2 ]
Li, Ye [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[2] Singapore Management Univ, Sch Informat Syst, Singapore 188065, Singapore
关键词
Top-k query; continuous query; document stream; QUERIES;
D O I
10.1109/TKDE.2017.2657622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The efficient processing of document streams plays an important role in many information filtering systems. Emerging applications, such as news update filtering and social network notifications, demand presenting end-users with the most relevant content to their preferences. In this work, user preferences are indicated by a set of keywords. A central server monitors the document stream and continuously reports to each user the top-k documents that are most relevant to her keywords. Our objective is to support large numbers of users and high stream rates, while refreshing the top-k results almost instantaneously. Our solution abandons the traditional frequency-ordered indexing approach. Instead, it follows an identifier-ordering paradigm that suits better the nature of the problem. When complemented with a novel, locally adaptive technique, our method offers (i) proven optimality w.r.t. the number of considered queries per stream event, and (ii) an order of magnitude shorter response time (i.e., time to refresh the query results) than the current state-of-the-art.
引用
收藏
页码:991 / 1003
页数:13
相关论文
共 50 条
  • [41] Uncertain top-k query processing in distributed environments
    Wang, Xite
    Shen, Derong
    Yu, Ge
    DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (04) : 567 - 589
  • [42] Top-k Query for Weighted Interactive Product Configuration
    Chen, Baijun
    Feng, Tao
    2018 17TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES), 2018, : 326 - 331
  • [43] An Approximate Top-k Query Algorithm in Distributed Network
    Li, Wenhua
    Yu, Wenting
    Xiao, Feng
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 417 - 420
  • [44] A Toolkit for Managing Multiple Crowdsourced Top-K Queries
    Shan, Caihua
    Hou, Leong U.
    Mamoulis, Nikos
    Cheng, Reynold
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 3453 - 3456
  • [45] Top-K Deep Video Analytics: A Probabilistic Approach
    Lai, Ziliang
    Han, Chenxia
    Liu, Chris
    Zhang, Pengfei
    Lo, Eric
    Kao, Ben
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1037 - 1050
  • [46] Optimizing Distributed Top-k Queries on Uncertain Data
    Zhao Zhibin
    Yu Yang
    Bao Yubin
    Yu Ge
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3209 - 3214
  • [47] Scalable top-k keyword search in relational databases
    Xu, Yanwei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 731 - 747
  • [48] Efficient and Secure Top-k Queries With Top Order-Preserving Encryption
    Quan, Hanyu
    Wang, Boyang
    Zhang, Yuqing
    Wu, Gaofei
    IEEE ACCESS, 2018, 6 : 31525 - 31540
  • [49] Reverse k nearest neighbors queries and spatial reverse top-k queries
    Shiyu Yang
    Muhammad Aamir Cheema
    Xuemin Lin
    Ying Zhang
    Wenjie Zhang
    The VLDB Journal, 2017, 26 : 151 - 176
  • [50] Why Not Yet: Fixing a Top-k Ranking that Is Not Fair to Individuals
    Chen, Zixuan
    Manolios, Panagiotis
    Riedewald, Mirek
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (09): : 2377 - 2390