Re-Stream: Real-time and energy-efficient resource scheduling in big data stream computing environments

被引:71
作者
Sun, Dawei [1 ]
Zhang, Guangyan [1 ]
Yang, Songlin [1 ]
Meng, Weimin [1 ]
Khan, Samee U. [2 ]
Li, Keqin [3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
[3] SUNY Coll New Paltz, Dept Comp Sci, New Paltz, NY 12561 USA
基金
美国国家科学基金会; 中国国家自然科学基金; 中国博士后科学基金;
关键词
Big data; Critical path; Directed acyclic graph; Energy efficiency; Real time; Stream computing; SYSTEMS; COST; CONSUMPTION;
D O I
10.1016/j.ins.2015.03.027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To achieve high energy efficiency and low response time in big data stream computing environments, it is required to model an energy-efficient resource scheduling and optimization framework. In this paper, we propose a real-time and energy-efficient resource scheduling and optimization framework, termed the Re-Stream. Firstly, the Re-Stream profiles a mathematical relationship among energy consumption, response time, and resource utilization, and obtains the conditions to meet high energy efficiency and low response time. Secondly, a data stream graph is modeled by using the distributed stream computing theories, which identifies the critical path within the data stream graph. Such a methodology aids in calculating the energy consumption of a resource allocation scheme for a data stream graph at a given data stream speed. Thirdly, the Re-Stream allocates tasks by utilizing an energy-efficient heuristic and a critical path scheduling mechanism subject to the architectural requirements. This is done to optimize the scheduling mechanism online by reallocating the critical vertices on the critical path of a data stream graph to minimize the response time and system fluctuations. Moreover, the Re-Stream consolidates the non-critical vertices on the non-critical path so as to improve energy efficiency. We evaluate the Re-Stream to measure energy efficiency and response time for big data stream computing environments. The experimental results demonstrate that the Re-Stream has the ability to improve energy efficiency of a big data stream computing system, and to reduce average response time. The Re-Stream provides an elegant trade-off between increased energy efficiency and decreased response time effectively within big data stream computing environments. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:92 / 112
页数:21
相关论文
共 42 条
[1]   On Density-Based Data Streams Clustering Algorithms: A Survey [J].
Amini, Amineh ;
Teh, Ying Wah ;
Saboohi, Hadi .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (01) :116-141
[2]  
Aniello L., 2013, 7 ACM INT C DISTR EV, P207
[3]   Green Cloud Computing: Balancing Energy in Processing, Storage, and Transport [J].
Baliga, Jayant ;
Ayre, Robert W. A. ;
Hinton, Kerry ;
Tucker, Rodney S. .
PROCEEDINGS OF THE IEEE, 2011, 99 (01) :149-167
[4]   Energy aware DAG scheduling on heterogeneous systems [J].
Baskiyar, Sanjeev ;
Abdel-Kader, Rabab .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2010, 13 (04) :373-383
[5]   Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing [J].
Beloglazov, Anton ;
Abawajy, Jemal ;
Buyya, Rajkumar .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2012, 28 (05) :755-768
[6]  
Chauhan J., 2012, 2012 Seventh International Conference on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC 2012), P58, DOI 10.1109/3PGCIC.2012.55
[7]   A hybrid heuristic-genetic algorithm for task scheduling in heterogeneous processor networks [J].
Daoud, Mohammad I. ;
Kharma, Nawwaf .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2011, 71 (11) :1518-1531
[8]   Automatic optimization of stream programs via source program operator graph transformations [J].
Dayarathna, Miyuru ;
Suzumura, Toyotaro .
DISTRIBUTED AND PARALLEL DATABASES, 2013, 31 (04) :543-599
[9]   Leveraging the capabilities of service-oriented decision support systems: Putting analytics and big data in cloud [J].
Demirkan, Haluk ;
Delen, Dursun .
DECISION SUPPORT SYSTEMS, 2013, 55 (01) :412-421
[10]   Cooperation of multiple mobile sensors with minimum energy cost for mobility and communication [J].
Guo, Ge ;
Zhao, Yuan ;
Yang, Guoqing .
INFORMATION SCIENCES, 2014, 254 :69-82