A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems

被引:8
|
作者
He, Shuibing [1 ]
Li, Zheng [2 ]
Zhou, Jiang [3 ]
Yin, Yanlong [4 ]
Xu, Xiaohua [5 ]
Chen, Yong [6 ]
Sun, Xian-He [7 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Stockton Univ, Sch Business, Comp Sci Program, Galloway, NJ 08205 USA
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100864, Peoples R China
[4] Inst Artificial Intelligence, Intelligent Comp Syst Res Ctr, Zhejiang Lab, Hangzhou 311100, Peoples R China
[5] Kennesaw State Univ, Dept Comp Sci, Kennesaw, GA 30144 USA
[6] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[7] Illinois Inst Technol, Dept Comp Sci, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Servers; System performance; Bandwidth; Computer science; Distributed databases; Sun; File systems; Parallel I; O system; parallel file system; hybrid parallel file system; data placement; solid state drive; SSD;
D O I
10.1109/TPDS.2019.2948901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present H2DP, a holistic heterogeneity-aware data placement scheme for hybrid parallel I/O systems, which consist of HDD servers and SSD servers. Most of the existing approaches focus on server performance or application I/O pattern heterogeneity in data placement. H2DP considers three axes of heterogeneity: server performance, server space, and application I/O pattern. More specifically, H2DP determines the optimized stripe sizes on servers based on server performance, keeps only critical data on all hybrid servers and the rest data on HDD servers, and dynamically migrates data among different types of servers at run-time. This holistic heterogeneity-awareness enables H2DP to achieve high performance by alleviating server load imbalance, efficiently utilizing SSD space, and accommodating application pattern variation. We have implemented a prototype of H2DP under MPICH2 atop OrangeFS. Extensive experimental results demonstrate that H2DP significantly improve I/O system performance compared to existing data placement schemes.
引用
收藏
页码:830 / 842
页数:13
相关论文
共 50 条
  • [1] Heterogeneity-Aware Data Placement in Hybrid Clouds
    Marquez, Jack D.
    Gonzalez, Juan D.
    Mondragon, Oscar H.
    CLOUD COMPUTING - CLOUD 2019, 2019, 11513 : 177 - 191
  • [2] Heterogeneity-Aware Collective I/O for Parallel I/O Systems with Hybrid HDD/SSD Servers
    He, Shuibing
    Wang, Yang
    Sun, Xian-He
    Huang, Chuanhe
    Xu, Chenzhong
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (06) : 1091 - 1098
  • [3] A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Wang, Yang
    Xu, Chengzhong
    2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 1133 - 1142
  • [4] HAS: Heterogeneity-Aware Selective Layout Scheme for Parallel File Systems on Hybrid Servers
    He, Shuibing
    Sun, Xian-He
    Haider, Adnan
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 613 - 622
  • [5] A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O Systems
    He, Shuibing
    Sun, Xian-He
    Feng, Bo
    Huang, Xin
    Feng, Kun
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [6] A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Wang, Yang
    Kougkas, Antonis
    Haider, Adnan
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 340 - 349
  • [7] Heterogeneity-Aware Data Regeneration in Distributed Storage Systems
    Wang, Yan
    Wei, Dongsheng
    Yin, Xunrui
    Wang, Xin
    2014 PROCEEDINGS IEEE INFOCOM, 2014, : 1878 - 1886
  • [8] HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data Layout
    He, Shuibing
    Wang, Yang
    Sun, Xian-He
    Xu, Chengzhong
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (06) : 1048 - 1060
  • [9] Orchestration Extensions for Interference- and Heterogeneity-Aware Placement for Data-Analytics
    Tzenetopoulos, Achilleas
    Masouros, Dimosthenis
    Xydis, Sotirios
    Soudris, Dimitrios
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (04) : 298 - 323
  • [10] Performance-Aware Data Placement in Hybrid Parallel File Systems
    He, Shuibing
    Sun, Xian-He
    Feng, Bo
    Feng, Kun
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2014, PT I, 2014, 8630 : 563 - 576