A Survey of Distributed Data Aggregation Algorithms

被引:89
作者
Jesus, Paulo [1 ]
Baquero, Carlos [1 ]
Almeida, Paulo Sergio [1 ]
机构
[1] Univ Minho, HASLab INESC TEC, P-4710057 Braga, Portugal
关键词
Distributed algorithms; data aggregation; performance trade-offs; fault-tolerance; WIRELESS SENSOR NETWORKS; FAULT-TOLERANT AGGREGATION; STABILIZING GROUP COMMUNICATION; MULTICAST TREES; SIZE-ESTIMATION; RANDOM-WALK; EFFICIENT; GOSSIP; COMPUTATION; ROBUST;
D O I
10.1109/COMST.2014.2354398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, which can then be used to direct the execution of other applications. The resulting values are derived by the distributed computation of functions like COUNT, SUM, and AVERAGE. Some application examples deal with the determination of the network size, total storage capacity, average load, majorities and many others. In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task. This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.
引用
收藏
页码:381 / 404
页数:24
相关论文
共 108 条
[1]   Probabilistic quorums for dynamic systems [J].
Abraham, I ;
Malkhi, D .
DISTRIBUTED COMPUTING, 2005, 18 (02) :113-124
[2]  
Almeida PS, 2011, LECT NOTES COMPUT SC, V7109, P513, DOI 10.1007/978-3-642-25873-2_35
[3]  
Alouf S, 2002, IEEE INFOCOM SER, P1109, DOI 10.1109/INFCOM.2002.1019359
[4]  
Alzaid Hani., 2008, 6th Australasian conference on Information security (AISC), P93
[5]  
[Anonymous], 2006, Proc. of ACM Symposium on Principles of Distributed Computing
[6]  
[Anonymous], 2004, Proceedings of the 2nd International Conference on Embedded Networked Sensor Systems (SenSys), DOI DOI 10.1145/1031495.1031524
[7]  
[Anonymous], 2006, 2006 IFIP INT C WIRE, DOI DOI 10.1109/WOCN.2006.1666600
[8]  
[Anonymous], 2004, P ACM SIGMOD SIGACT
[9]   Extrema Propagation: Fast Distributed Estimation of Sums and Network Sizes [J].
Baquero, Carlos ;
Almeida, Paulo Sergio ;
Menezes, Raquel ;
Jesus, Paulo .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (04) :668-675
[10]   Fast Estimation of Aggregates in Unstructured Networks [J].
Baquero, Carlos ;
Almeida, Paulo Sergio ;
Menezes, Raquel .
ICAS: 2009 FIFTH INTERNATIONAL CONFERENCE ON AUTONOMIC AND AUTONOMOUS SYSTEMS, 2009, :88-+