Replica-aware data recovery performance improvement for Hadoop system with NVM

被引:0
|
作者
Li, Xin [1 ]
Li, Huijie [1 ]
Lu, Youyou [2 ]
Zhao, Yanchao [1 ]
Qin, Xiaolin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Data recovery; HDFS; MapReduce; Non-volatile memory; Performance tuning; CLUSTER; MEMORY;
D O I
10.1007/s42514-021-00066-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The non-volatile memory (NVM) is the promising device to store data and accelerate big data analysis due to its excellent I/O performance. However, we find that simply replacing hard disk drive (HDD) with NVM cannot bring the expected performance improvement. In this paper, we take the data recovery issue in Hadoop file system (HDFS) as a case study to investigate how to take advantage of the performance of NVM. We analyze the data recovery mechanism in HDFS and find that the configuration of replication tasks in the DataNode can affect the data recovery significantly. We conduct extensive analysis and experiments tuning the configuration and also get some interesting findings. With the new configuration, we increase the data recovery performance from 17 to 71%. We can also improve the execution performance of MapReduce jobs from 28 to 59% through optimized configuration. We also find that the sudden data recovery brings disordered network resource competition, which reduces the performance of MapReduce jobs. Hence, We present a priority-aware multi-stage data recovery method. This improves the performance by 32.5% in addition for the MapReduce jobs.
引用
收藏
页码:144 / 156
页数:13
相关论文
共 50 条
  • [21] Performance improvement of data transmission using a hybrid underwater and terrestrial system
    Ramadan, Khalil F.
    Ramadan, Khaled
    Taha, Taha E.
    Dessouky, Moawad, I
    Abd El-Samie, Fathi E.
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (09):
  • [22] Performance improvement of an ad hoc network system for wireless data service
    Yamamoto, T
    Sugano, M
    Murata, M
    Hatauchi, T
    Hosooka, Y
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2003, E86B (12) : 3559 - 3568
  • [23] A decision support system for waste heat recovery and energy efficiency improvement in data centres
    Luo, Yang
    Andresen, John
    Clarke, Henry
    Rajendra, Matthew
    Maroto-Valer, Mercedes
    APPLIED ENERGY, 2019, 250 : 1217 - 1224
  • [24] Performance improvement of a slip energy recovery drive system by a voltage-controlled technique
    Tunyasrirut, Satean
    Kinnares, Vijit
    Ngamwiwit, Jongkol
    RENEWABLE ENERGY, 2010, 35 (10) : 2235 - 2242
  • [25] PERFORMANCE IMPROVEMENT OF A 330 MWe POWER PLANT BY FLUE GAS HEAT RECOVERY SYSTEM
    Xu, Changchun
    Xu, Min
    Zhao, Ming
    Liang, Junyu
    Sai, Juncong
    Qiu, Yalin
    Xiang, Wenguo
    THERMAL SCIENCE, 2016, 20 (01): : 303 - 314
  • [26] CLUST - Grouping Aware Data Placement for Improving the Performance of Large-Scale Data Management System
    Vengadeswaran, Shanmugasundaram
    Balasundaram, Sadhu Ramakrishnan
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 1 - 9
  • [27] Performance Improvement of Multi-Dimensional Indexing System for Big Data Analysis
    Nakanishi, Kazulo
    Hochin, Teruhisa
    Nomiya, Hiroki
    2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 697 - 702
  • [28] PERFORMANCE IMPROVEMENT OF A FREQUENCY HOPPING CDMA SYSTEM UTILIZING MEMORIZED PRIOR DATA
    KIM, S
    METZNER, JJ
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1991, 39 (04) : 496 - 502
  • [29] Investigation and improvement of air distribution system's airflow performance in data centers
    Yuan, Xiaolei
    Liu, Jinxiang
    Yang, Yujiang
    Wang, Yu
    Yuan, Xiaohang
    10TH INTERNATIONAL SYMPOSIUM ON HEATING, VENTILATION AND AIR CONDITIONING, ISHVAC2017, 2017, 205 : 2895 - 2902
  • [30] Strategy on performance improvement of inverse Brayton cycle system for energy recovery in turbocharged diesel engines
    Zhu, Dengting
    Lin, Yun
    Zheng, Xinqian
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART A-JOURNAL OF POWER AND ENERGY, 2020, 234 (01) : 85 - 95