Optimization of Load Adaptive Distributed Stream Processing Services

被引:3
作者
Wu, Xing [1 ]
Liu, Yan [2 ]
机构
[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
[2] Concordia Univ, Fac Engn & Comp Sci, Montreal, PQ, Canada
来源
2014 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2014) | 2014年
关键词
Software as a Service; stream processing; load optimization;
D O I
10.1109/SCC.2014.73
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Stream processing software frameworks enable real-time processing of continuous unbounded streams of data at a high speed. Leveraging the elasticity of cloud computing infrastructure, stream processing frameworks can become Software as a Service for many domain applications that provide simplified development and run-time management. An issue of making such a SaaS scalable is to allocate data processing operators on nodes of clusters and balance the workload dynamically. Since the data volume and rate can be unpredictable, static mapping between operators and cluster resources often results in unbalanced operator load distribution. This paper proposes an optimization method that combines correlation of resource utilization of nodes and capacity of clusters. The associated software components form a layer between a stream processing software framework and cloud clusters and nodes. This software layer allows dynamic transferring of an operator to different cluster nodes at runtime and keeps transparent to developers. We present a prototype evaluation on Yahoo's S4 and clusters on Emulab.org. Our implementation is evaluated by a top-N topic list application on Twitter streams. The results demonstrate improved stream processing throughputs and cluster resource utilization.
引用
收藏
页码:504 / 511
页数:8
相关论文
共 21 条
[1]  
Abadi DJ., 2005, CIDR, V5, P277
[2]  
[Anonymous], 2013, P ICWSM
[3]  
[Anonymous], P NAT WORKSH DISTR C
[4]  
[Anonymous], P 10 INT C AUT COMP
[5]  
BALKESEN C., 2013, 7 ACM INT C DISTRIBU, P15
[6]  
Gedik B., 2008, P ACM SIGMOD INT C M, P1123
[7]  
Gorlach K., 2012, 2012 IEEE International Conference on Services Computing (SCC), P555, DOI 10.1109/SCC.2012.30
[8]   StreamCloud: An Elastic and Scalable Data Streaming System [J].
Gulisano, Vincenzo ;
Jimenez-Peris, Ricardo ;
Patino-Martinez, Marta ;
Soriente, Claudio ;
Valduriez, Patrick .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (12) :2351-2365
[9]  
Hoeksema J., 2011, Ordring Workshop at ISWC
[10]  
Hunt P., 2010, P USENIX ANN TECH C, P11, DOI DOI 10.5555/1855840.1855851