A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems

被引:8
|
作者
He, Shuibing [1 ]
Li, Zheng [2 ]
Zhou, Jiang [3 ]
Yin, Yanlong [4 ]
Xu, Xiaohua [5 ]
Chen, Yong [6 ]
Sun, Xian-He [7 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Stockton Univ, Sch Business, Comp Sci Program, Galloway, NJ 08205 USA
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100864, Peoples R China
[4] Inst Artificial Intelligence, Intelligent Comp Syst Res Ctr, Zhejiang Lab, Hangzhou 311100, Peoples R China
[5] Kennesaw State Univ, Dept Comp Sci, Kennesaw, GA 30144 USA
[6] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[7] Illinois Inst Technol, Dept Comp Sci, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Servers; System performance; Bandwidth; Computer science; Distributed databases; Sun; File systems; Parallel I; O system; parallel file system; hybrid parallel file system; data placement; solid state drive; SSD;
D O I
10.1109/TPDS.2019.2948901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present H2DP, a holistic heterogeneity-aware data placement scheme for hybrid parallel I/O systems, which consist of HDD servers and SSD servers. Most of the existing approaches focus on server performance or application I/O pattern heterogeneity in data placement. H2DP considers three axes of heterogeneity: server performance, server space, and application I/O pattern. More specifically, H2DP determines the optimized stripe sizes on servers based on server performance, keeps only critical data on all hybrid servers and the rest data on HDD servers, and dynamically migrates data among different types of servers at run-time. This holistic heterogeneity-awareness enables H2DP to achieve high performance by alleviating server load imbalance, efficiently utilizing SSD space, and accommodating application pattern variation. We have implemented a prototype of H2DP under MPICH2 atop OrangeFS. Extensive experimental results demonstrate that H2DP significantly improve I/O system performance compared to existing data placement schemes.
引用
收藏
页码:830 / 842
页数:13
相关论文
共 50 条
  • [21] Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O Systems
    He, Shuibing
    Wang, Yang
    Li, Zheng
    Sun, Xian-He
    Xu, Chenzhong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 1853 - 1865
  • [22] Petrel: Heterogeneity-Aware Distributed Deep Learning Via Hybrid Synchronization
    Zhou, Qihua
    Guo, Song
    Qu, Zhihao
    Li, Peng
    Li, Li
    Guo, Minyi
    Wang, Kun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1030 - 1043
  • [23] Random Mobility and Heterogeneity-Aware Hybrid Synchronization for Wireless Sensor Network
    Mantri, Dnyaneshwar S.
    Prasad, Neeli Rashmi
    Prasad, Ramjee
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 100 (02) : 321 - 336
  • [24] Heterogeneity-Aware Codes With Uncoded Repair for Distributed Storage Systems
    Zhu, Bing
    Shum, Kenneth W.
    Li, Hui
    IEEE COMMUNICATIONS LETTERS, 2015, 19 (06) : 901 - 904
  • [25] HAShCache: Heterogeneity-Aware Shared DRAMCache for Integrated Heterogeneous Systems
    Patil, Adarsh
    Govindarajan, Ramaswamy
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2017, 14 (04)
  • [26] Heterogeneity-Aware Graph Partitioning for Distributed Deployment of Multiagent Systems
    Davoodi, Mohammadreza
    Velni, Javad Mohammadpour
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) : 2578 - 2588
  • [27] Random Mobility and Heterogeneity-Aware Hybrid Synchronization for Wireless Sensor Network
    Dnyaneshwar S. Mantri
    Neeli Rashmi Prasad
    Ramjee Prasad
    Wireless Personal Communications, 2018, 100 : 321 - 336
  • [28] PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File Systems
    He, Shuibing
    Liu, Yan
    Sun, Xian-He
    2014 INTERNATIONAL WORKSHOP ON DATA-INTENSIVE SCALABLE COMPUTING SYSTEMS (DISCS), 2014, : 41 - 48
  • [29] Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O Systems
    Yin, Yanlong
    Li, Jibing
    He, Jun
    Sun, Xian-He
    Thakur, Rajeev
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 345 - 356
  • [30] Heterogeneity-aware Peak Power Management for Accelerator-based Systems
    Wang, Guibin
    Lin, Yisong
    2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2011, : 396 - 403