Using Pattern-Models to Guide SSD Deployment for Big Data Applications in HPC Systems

被引:0
|
作者
Chen, Junjie [1 ]
Roth, Philip C. [2 ]
Chen, Yong [1 ,3 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[2] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37830 USA
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA | 2013年
基金
美国国家科学基金会;
关键词
Big Data; Solid State Drives; Hybrid Storage Systems; High Performance Computing; Exascale Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-memory based Solid State Drives (SSDs) embrace higher performance and lower power consumption compared to traditional storage devices (HDDs). These benefits are needed in HPC systems, especially with the growing demand of supporting Big Data applications. In this paper, we study placement and deployment strategies of SSDs in HPC systems to maximize the performance improvement, given a practical fixed hardware budget constraint. We propose a pattern-model approach to guide SSD deployment for HPC systems through two steps; characterizing workload and mapping deployment strategy. The first step is responsible for characterizing the access patterns of the workload and the second step contributes the actual deployment recommendation for Parallel File System (PFS) configuration combining with an analytical model. We have carried out initial experimental tests and the results confirmed that the proposed approach can guide placement of SSDs in HPC systems for accelerating data accesses. Our research will be helpful in guiding designs and developments for Big Data applications in current and projected HPC systems including exascale systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Word pattern prediction using Big Data frameworks
    Szabari, Bence
    Kiss, Attila
    ACTA UNIVERSITATIS SAPIENTIAE INFORMATICA, 2020, 12 (01) : 51 - 69
  • [22] Contemporary Recommendation Systems on Big Data and Their Applications: A Survey
    Xia, Ziyuan
    Sun, Anchen
    Xu, Jingyi
    Peng, Yuanzhe
    Ma, Rui
    Cheng, Minghui
    IEEE ACCESS, 2024, 12 : 196914 - 196928
  • [23] Pattern Mining from Big IoT Data with Fog Computing: Models, Issues, and Research Perspectives
    Braun, Peter
    Cuzzocrea, Alfredo
    Leung, Carson K.
    Pazdor, Adam G. M.
    Souza, Joglas
    Tanbeer, Syed K.
    2019 19TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2019, : 584 - 591
  • [24] Integrated Information Supporting Systems in Big Data Applications
    Sun, Zheng-hao
    Wang, Hong-sheng
    Zhu, Changming
    Wang, Qian
    Yan, Yan
    2015 EIGHTH INTERNATIONAL CONFERENCE ON INTERNET COMPUTING FOR SCIENCE AND ENGINEERING (ICICSE), 2015, : 15 - 18
  • [25] Aten: A Dispatcher for Big Data Applications in Heterogeneous Systems
    de Souza Junior, Paulo R. R.
    Matteussi, Kassiano J.
    dos Anjos, Julio C. S.
    dos Santos, Jobe D. D.
    Resin Geyer, Claudio Fernando
    Veith, Alexandre da Silva
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2018, : 585 - 592
  • [26] Performance evaluation of NoSQL big-data applications using multi-formalism models
    Barbierato, Enrico
    Gribaudo, Marco
    Iacono, Mauro
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 345 - 353
  • [27] A Big Data Analytics Framework for HPC Log Data: Three Case Studies Using the Titan Supercomputer Log
    Park, Byung H.
    Hui, Yawei
    Boehm, Swen
    Ashraf, Rizwan A.
    Layton, Christopher
    Engelmann, Christian
    2018 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2018, : 571 - 579
  • [28] Bandwidth Modeling in Large Distributed Systems for Big Data Applications
    Javadi, Bahman
    Zhang, Boyu
    Taufer, Michela
    2014 15TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2014), 2014, : 21 - 27
  • [29] The Implications of Diverse Applications and Scalable Data Sets in Benchmarking Big Data Systems
    Jia, Zhen
    Zhou, Runlin
    Zhu, Chunge
    Wang, Lei
    Gao, Wanling
    Shi, Yingjie
    Zhan, Jianfeng
    Zhang, Lixin
    SPECIFYING BIG DATA BENCHMARKS, 2014, 8163 : 44 - 59
  • [30] A Task Scheduling Algorithm for HPC Applications using Colored Stochastic Petri Net Models
    Mironescu, Ion Dan
    Vintan, Lucian
    2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2017, : 479 - 486