EAFR: An Energy-Efficient Adaptive File Replication System in Data-Intensive Clusters

被引:14
|
作者
Lin, Yuhua [1 ]
Shen, Haiying [2 ]
机构
[1] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29634 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
基金
美国国家科学基金会;
关键词
Data-intensive clusters; file replication; replica placement; energy-efficient; DATA CENTERS; MANAGEMENT; REDUCTION;
D O I
10.1109/TPDS.2016.2613989
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In data intensive clusters, a large amount of files are stored, processed and transferred simultaneously. To increase the data availability, some file systems create and store three replicas for each file in randomly selected servers across different racks. However, they neglect the file heterogeneity and server heterogeneity, which can be leveraged to further enhance data availability and file system efficiency. As files have heterogeneous popularities, a rigid number of three replicas may not provide immediate response to an excessive number of read requests to hot files, and waste resources (including energy) for replicas of cold files that have few read requests. Also, servers are heterogeneous in network bandwidth, hardware configuration and capacity (i. e., the maximal number of service requests that can be supported simultaneously), it is crucial to select replica servers to ensure low replication delay and request response delay. In this paper, we propose an Energy-Efficient Adaptive File Replication System (EAFR), which incorporates three components. It is adaptive to time-varying file popularities to achieve a good tradeoff between data availability and efficiency. Higher popularity of a file leads to more replicas and vice versa. Also, to achieve energy efficiency, servers are classified into hot servers and cold servers with different energy consumption, and cold files are stored in cold servers. EAFR then selects a server with sufficient capacity (including network bandwidth and capacity) to hold a replica. To further improve the performance of EAFR, we propose a dynamic transmission rate adjustment strategy to prevent potential incast congestion when replicating a file to a server, a networkaware data node selection strategy to reduce file read latency, and a load-aware replica maintenance strategy to quickly create file replicas under replica node failures. Experimental results on a real-world cluster show the effectiveness of EAFR and proposed strategies in reducing file read latency, replication time, and power consumption in large clusters.
引用
收藏
页码:1017 / 1030
页数:14
相关论文
共 50 条
  • [31] Stochastic Optimal Control of HVAC System for Energy-Efficient Buildings
    Yang, Yu
    Hu, Guoqiang
    Spanos, Costas J.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2022, 30 (01) : 376 - 383
  • [32] Adaptive DRL-Based Task Scheduling for Energy-Efficient Cloud Computing
    Kang, Kaixuan
    Ding, Ding
    Xie, Huamao
    Yin, Qian
    Zeng, Jing
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4948 - 4961
  • [33] A smart coordinated temperature feedback controller for energy-efficient data centers
    Zhao, Xiaogang
    Xiong, Zenggang
    Ding, Ling
    Zhang, Xuemin
    Xu, Fang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 : 506 - 514
  • [34] RESS: A Reliable Energy-Efficient Storage System
    Yin, Shu
    Xiao, Zhaoyu
    Li, Kenli
    Huang, Jianzhong
    Ruan, Xiaojun
    Zhu, Xiaomin
    Qin, Xiao
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 1193 - 1198
  • [35] ECOS: An Energy-Efficient Cluster Storage System
    Ruan, Xiaojun
    Yin, Shu
    Manzanares, Adam
    Xie, Jiong
    Ding, Zhiyang
    Majors, James
    Qin, Xiao
    2009 IEEE 28TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCC 2009), 2009, : 79 - 86
  • [36] EASYR: Energy-Efficient Adaptive System Reconfiguration for Dynamic Deadlines in Autonomous Driving on Multicore Processors
    Yi, Saehanseul
    Kim, Tae-Wook
    Kim, Jong-Chan
    Dutt, Nikil
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (03)
  • [37] An Energy-Efficient Load Balancing Scheme in Heterogeneous Clusters by Linear Programming
    Zhang, Yujian
    Li, Mingde
    Tong, Fei
    2022 31ST INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2022), 2022,
  • [38] AdaMD: Adaptive Mapping and DVFS for Energy-Efficient Heterogeneous Multicores
    Basireddy, Karunakar R.
    Singh, Amit Kumar
    Al-Hashimi, Bashir M.
    Merrett, Geoff V.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (10) : 2206 - 2217
  • [39] Electronics and waste material recycler energy-efficient electric furnace system
    Shufian, Abu
    Mahin, Md. Saidur Rashid
    Islam, Riadul
    CLEANER ENGINEERING AND TECHNOLOGY, 2022, 6
  • [40] A Green energy-efficient scheduler for cloud data centers
    Mohammed Amoon
    Tarek E. El. Tobely
    Cluster Computing, 2019, 22 : 3247 - 3259