Using Pattern-Models to Guide SSD Deployment for Big Data Applications in HPC Systems

被引:0
|
作者
Chen, Junjie [1 ]
Roth, Philip C. [2 ]
Chen, Yong [1 ,3 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[2] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37830 USA
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA | 2013年
基金
美国国家科学基金会;
关键词
Big Data; Solid State Drives; Hybrid Storage Systems; High Performance Computing; Exascale Systems;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flash-memory based Solid State Drives (SSDs) embrace higher performance and lower power consumption compared to traditional storage devices (HDDs). These benefits are needed in HPC systems, especially with the growing demand of supporting Big Data applications. In this paper, we study placement and deployment strategies of SSDs in HPC systems to maximize the performance improvement, given a practical fixed hardware budget constraint. We propose a pattern-model approach to guide SSD deployment for HPC systems through two steps; characterizing workload and mapping deployment strategy. The first step is responsible for characterizing the access patterns of the workload and the second step contributes the actual deployment recommendation for Parallel File System (PFS) configuration combining with an analytical model. We have carried out initial experimental tests and the results confirmed that the proposed approach can guide placement of SSDs in HPC systems for accelerating data accesses. Our research will be helpful in guiding designs and developments for Big Data applications in current and projected HPC systems including exascale systems.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Exposing data locality in HPC-based systems by using the HDFS backend
    Rivadeneira, Jose
    Garcia-Carballeira, Felix
    Carretero, Jesus
    Garcia-Blas, Javier
    2020 IEEE 27TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2020), 2020, : 243 - 250
  • [32] Using FPGAs to Accelerate HPC and Data Analytics on Intel-Based Systems
    Steinke, Thomas
    Suarez, Estela
    Boku, Taisuke
    Kumar, Nalini
    Martin, David E.
    HIGH PERFORMANCE COMPUTING: ISC HIGH PERFORMANCE 2019 INTERNATIONAL WORKSHOPS, 2020, 11887 : 561 - 566
  • [33] SOLAR RADIATION PREDICTION IN PV POWER SYSTEMS: A COMPARISON OF DEEP LEARNING MODELS USING BIG DATA
    Alay, Fatma Didem
    Ilhan, Nagehan
    Gulluoglu, Mehmet Tahir
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2024, 77 (09): : 1347 - 1354
  • [34] Guidance Models for Designing Big Data Cyber Security Analytics Systems
    Ullah, Faheem
    Babar, Muhammad Ali
    SOFTWARE ARCHITECTURE, ECSA 2023, 2023, 14212 : 70 - 80
  • [35] Recent applications of big data analytics in railway transportation systems: A survey
    Ghofrani, Faeze
    He, Qing
    Goverde, Rob M. P.
    Liu, Xiang
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2018, 90 : 226 - 246
  • [36] Membrane Models in Big Data Process on GPU-accelerating Systems
    Zhang, Yuanhan
    Ji, Zhenzhou
    PROCEEDINGS OF 2016 SIXTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2016), 2016, : 422 - 425
  • [37] Using machine learning to optimize parallelism in big data applications
    Brandon Hernandez, Alvaro
    Perez, Maria S.
    Gupta, Smrati
    Muntes-Mulero, Victor
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 1076 - 1092
  • [38] Tension in big data using machine learning: Analysis and applications
    Wang, Huamao
    Yao, Yumei
    Salhi, Said
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2020, 158
  • [39] Using Viable Systems Model and Big Data for Community Energy Systems
    Joshi, Kevin
    Ramamritham, Krithi
    2019 2ND INTERNATIONAL CONFERENCE ON SMART ENERGY SYSTEMS AND TECHNOLOGIES (SEST 2019), 2019,
  • [40] Heart Failure Prediction Models using Big Data Techniques
    Rammal, Heba F.
    Emam, Ahmed Z.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (05) : 363 - 371