Trends in Data Locality Abstractions for HPC Systems

被引:60
|
作者
Unat, Didem [1 ]
Dubey, Anshu [2 ]
Hoefler, Torsten [3 ]
Shalf, John [4 ]
Abraham, Mark [5 ]
Bianco, Mauro [6 ]
Chamberlain, Bradford L. [7 ]
Cledat, Romain [8 ]
Edwards, H. Carter [9 ]
Finkel, Hal [10 ]
Fuerlinger, Karl [11 ]
Hannig, Frank [12 ]
Jeannot, Emmanuel [13 ]
Kamil, Amir [14 ,15 ]
Keasler, Jeff [16 ]
Kelly, Paul H. J. [17 ]
Leung, Vitus [9 ]
Ltaief, Hatem [18 ]
Maruyama, Naoya [19 ]
Newburn, Chris J. [20 ]
Pericas, Miquel [21 ]
机构
[1] Koc Univ, Dept Comp Engn, TR-34450 Istanbul, Turkey
[2] Argonne Natl Lab, 9700 S Cass Ave, Argonne, IL 60439 USA
[3] ETH, CH-8092 Zurich, Switzerland
[4] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[5] KTH Royal Inst Technol, S-17121 Solna, Sweden
[6] Swiss Natl Supercomp Ctr, CH-6900 Lugano, Switzerland
[7] Cray Inc, Seattle, WA 98164 USA
[8] Intel Cooperat, Santa Clara, CA 95050 USA
[9] Sandia Natl Labs, POB 5800, Albuquerque, NM 87185 USA
[10] Argonne Natl Lab, 9700 S Cass Ave, Argonne, IL 60439 USA
[11] Ludwig Maximilians Univ Munchen, D-80538 Munich, Germany
[12] Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
[13] INRIA Bordeaux Sud Ouest, F-33405 Talence, France
[14] Univ Michigan, Ann Arbor, MI 48109 USA
[15] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
[16] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
[17] Imperial Coll London, Software Technol, London, England
[18] King Abdullah Univ Sci & Technol, Thuwal 23955, Saudi Arabia
[19] RIKEN, Kobe, Hyogo 6500047, Japan
[20] Nvidia Corp, Santa Clara, CA 95050 USA
[21] Chalmers Univ Technol, S-41296 Gothenburg, Sweden
基金
英国工程与自然科学研究理事会;
关键词
Data locality; programming abstractions; high-performance computing; data layout; locality-aware runtimes;
D O I
10.1109/TPDS.2017.2703149
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The cost of data movement has always been an important concern in high performance computing (HPC) systems. It has now become the dominant factor in terms of both energy consumption and performance. Support for expression of data locality has been explored in the past, but those efforts have had only modest success in being adopted in HPC applications for various reasons. them However, with the increasing complexity of the memory hierarchy and higher parallelism in emerging HPC systems, locality management has acquired a new urgency. Developers can no longer limit themselves to low-level solutions and ignore the potential for productivity and performance portability obtained by using locality abstractions. Fortunately, the trend emerging in recent literature on the topic alleviates many of the concerns that got in the way of their adoption by application developers. Data locality abstractions are available in the forms of libraries, data structures, languages and runtime systems; a common theme is increasing productivity without sacrificing performance. This paper examines these trends and identifies commonalities that can combine various locality concepts to develop a comprehensive approach to expressing and managing data locality on future large-scale high-performance computing systems.
引用
收藏
页码:3007 / 3020
页数:14
相关论文
共 50 条
  • [31] Improving data locality with loop transformations
    McKinley, KS
    Carr, S
    Tseng, CW
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1996, 18 (04): : 424 - 453
  • [32] A Survey of System Scheduling for HPC and Big Data
    Wang, Bo
    Chen, Zhiguang
    Xiao, Nong
    HP3C 2020: PROCEEDINGS OF THE 2020 4TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS, 2020, : 178 - 183
  • [33] Improving data locality by array contraction
    Song, YH
    Xu, R
    Wang, C
    Li, ZY
    IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (09) : 1073 - 1084
  • [34] An improved task scheduling algorithm based on cache locality and data locality in Hadoop
    Zhang, Peng
    Li, Chunlin
    Zhao, Yahui
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 244 - 249
  • [35] A Review on Data locality in Hadoop MapReduce
    Sharma, Anil
    Singh, Gurwinder
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 723 - 728
  • [36] Fine-Granular Computation and Data Layout Reorganization for Improving Locality
    Kandemir, Mahmut
    Tang, Xulong
    Kotra, Jagadish
    Karakoy, Mustafa
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [37] Scalable system scheduling for HPC and big data
    Reuther, Albert
    Byun, Chansup
    Arcand, William
    Bestor, David
    Bergeron, Bill
    Hubbell, Matthew
    Jones, Michael
    Michaleas, Peter
    Prout, Andrew
    Rosa, Antonio
    Kepner, Jeremy
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 111 : 76 - 92
  • [38] Efficient Metadata Indexing for HPC Storage Systems
    Paul, Arnab K.
    Wang, Brian
    Rutman, Nathan
    Spitz, Cory
    Butt, Ali R.
    2020 20TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2020), 2020, : 162 - 171
  • [39] Data Locality in High Performance Computing, Big Data, and Converged Systems: An Analysis of the Cutting Edge and a Future System Architecture
    Usman, Sardar
    Mehmood, Rashid
    Katib, Iyad
    Albeshri, Aiiad
    ELECTRONICS, 2023, 12 (01)
  • [40] Modeling and Simulation of Extreme-Scale Fat-Tree Networks for HPC Systems and Data Centers
    Liu, Ning
    Haider, Adnan
    Jin, Dong
    Sun, Xian-He
    ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION, 2017, 27 (02):