EAFR: An Energy-Efficient Adaptive File Replication System in Data-Intensive Clusters

被引:14
|
作者
Lin, Yuhua [1 ]
Shen, Haiying [2 ]
机构
[1] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29634 USA
[2] Univ Virginia, Dept Comp Sci, Charlottesville, VA 22904 USA
基金
美国国家科学基金会;
关键词
Data-intensive clusters; file replication; replica placement; energy-efficient; DATA CENTERS; MANAGEMENT; REDUCTION;
D O I
10.1109/TPDS.2016.2613989
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In data intensive clusters, a large amount of files are stored, processed and transferred simultaneously. To increase the data availability, some file systems create and store three replicas for each file in randomly selected servers across different racks. However, they neglect the file heterogeneity and server heterogeneity, which can be leveraged to further enhance data availability and file system efficiency. As files have heterogeneous popularities, a rigid number of three replicas may not provide immediate response to an excessive number of read requests to hot files, and waste resources (including energy) for replicas of cold files that have few read requests. Also, servers are heterogeneous in network bandwidth, hardware configuration and capacity (i. e., the maximal number of service requests that can be supported simultaneously), it is crucial to select replica servers to ensure low replication delay and request response delay. In this paper, we propose an Energy-Efficient Adaptive File Replication System (EAFR), which incorporates three components. It is adaptive to time-varying file popularities to achieve a good tradeoff between data availability and efficiency. Higher popularity of a file leads to more replicas and vice versa. Also, to achieve energy efficiency, servers are classified into hot servers and cold servers with different energy consumption, and cold files are stored in cold servers. EAFR then selects a server with sufficient capacity (including network bandwidth and capacity) to hold a replica. To further improve the performance of EAFR, we propose a dynamic transmission rate adjustment strategy to prevent potential incast congestion when replicating a file to a server, a networkaware data node selection strategy to reduce file read latency, and a load-aware replica maintenance strategy to quickly create file replicas under replica node failures. Experimental results on a real-world cluster show the effectiveness of EAFR and proposed strategies in reducing file read latency, replication time, and power consumption in large clusters.
引用
收藏
页码:1017 / 1030
页数:14
相关论文
共 50 条
  • [41] Energy-Efficient Analytics for Geographically Distributed Big Data
    Zhao, Peng
    Yang, Xinyu
    Lin, Jie
    Yang, Shusen
    Yu, Wei
    IEEE INTERNET COMPUTING, 2019, 23 (03) : 18 - 29
  • [42] Modeling and Simulation of Energy-Efficient Cloud Data Centers
    Moustafa, Nada
    Mashaly, Maggie
    Ashour, Mohamed
    2014 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2014,
  • [43] Minimum Dependencies Energy-Efficient Scheduling in Data Centers
    Zotkiewicz, Mateusz
    Guzek, Mateusz
    Kliazovich, Dzmitry
    Bouvry, Pascal
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (12) : 3561 - 3574
  • [44] Machine Learning-based Energy-efficient Workload Management for Data Centers
    Smith, Matthew
    Zhao, Luke
    Cordova, Jonathan
    Jiang, Xunfei
    Ebrahimi, Mahdi
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 799 - 806
  • [45] A Green energy-efficient scheduler for cloud data centers
    Amoon, Mohammed
    El Tobely, Tarek E.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S3247 - S3259
  • [46] Feedback Control Scheduling in Energy-Efficient and Thermal-Aware Data Centers
    Zhao, Xiaogang
    Peng, Tao
    Qin, Xiao
    Hu, Qiping
    Ding, Ling
    Fang, Zhijun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2016, 46 (01): : 48 - 60
  • [47] Ensuring renewable energy utilization with quality of service guarantee for energy-efficient data center operations
    Kwon, Soongeol
    APPLIED ENERGY, 2020, 276 (276)
  • [48] An Energy-Efficient Miniaturized Intracranial Pressure Monitoring System
    Ghanbari, Mohammad Meraj
    Tsai, Julius M.
    Nirmalathas, Ampalavanapillai
    Muller, Rikky
    Gambini, Simone
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (03) : 720 - 734
  • [49] Energy-Efficient Resource Provisioning Using Adaptive Harmony Search Algorithm for Compute-Intensive Workloads with Load Balancing in Datacenters
    Renugadevi, T.
    Geetha, K.
    Muthukumar, K.
    Geem, Zong Woo
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [50] Energy-Efficient Cooperative Adaptive Cruise Control for Electric Vehicle Platooning
    Li, Jiahang
    Chen, Cailian
    Yang, Bo
    He, Jianping
    Guan, Xinping
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 4862 - 4875