Towards Generic and Efficient Distributed Top-k Monitoring

被引:0
作者
Shi Biao [1 ]
Deng Bo [2 ]
Liu Li-mei [1 ]
Zhou Xian-cheng [1 ]
机构
[1] Hunan Univ Commerce, Changsha, Hunan, Peoples R China
[2] Nanjing Telecommun Technol Inst, Nanjing, Peoples R China
来源
2009 INTERNATIONAL CONFERENCE ON SCALABLE COMPUTING AND COMMUNICATIONS & EIGHTH INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTING | 2009年
关键词
top-k; data streams; monitoring; distributed system;
D O I
10.1109/EmbeddedCom-ScalCom.2009.53
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Monitoring data streams in a distributed system is the focus of much research in recent years. This paper addresses the generic and efficient processing of distributed top-k monitoring, which is continuously reporting the k largest values according to a user-specified aggregation function over distributed data streams. In practice, the user-specified aggregation function would be arbitrary function. Unfortunately, state-of-art distributed top-k monitoring approaches only support the sum function as the aggregation function. In this paper, we present a generic algorithm for distributed top-k monitoring, which supports not only the sum function but also min, max, count, average, and their compounds. These functions are the most general aggregation functions. In our algorithm, two kinds of arithmetic constraints are maintained at remote stream sources to ensure that the most recently provided top-k answer remains valid to within a user-specified error tolerance. Theoretical analyses results show that, in our algorithm, distributed communication is only necessary on occasion and few objects are necessary transmitted, when constraints are violated, and the communication cost is independent of k.
引用
收藏
页码:257 / +
页数:3
相关论文
共 16 条
  • [1] [Anonymous], P ACM SIGMOD INT C M
  • [2] BABCOCK B, 2003, P 2003 ACM SIGMOD IN, P28, DOI DOI 10.1145/872757.872764
  • [3] Balke WT, 2005, PROC INT CONF DATA, P174
  • [4] Evaluating Top-k queries over web-accessible Databases
    Bruno, N
    Gravano, L
    Marian, A
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 369 - +
  • [5] CAREY MJ, 1997, P ACM SIGMOD INT C M, V26, P383, DOI DOI 10.1145/253260.253302
  • [6] Carney D., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P215
  • [7] Chen J., 2000, SIGMOD 00, P379, DOI DOI 10.1145/342009.335432
  • [8] Deng B, 2006, LECT NOTES COMPUT SC, V4016, P496, DOI 10.1007/11775300_42
  • [9] Optimal aggregation algorithms for middleware
    Fagin, R
    Lotem, A
    Naor, M
    [J]. JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 614 - 656
  • [10] Guntzer Ulrich., 2000, VLDB J, P419