On-Chip Cache Hierarchy-Aware Tile Scheduling for Multicore Machines

被引:0
|
作者
Liu, Jun [1 ]
Zhang, Yuanrui [1 ]
Ding, Wei [1 ]
Kandemir, Mahmut [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
来源
2011 9TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO) | 2011年
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Iteration space tiling and scheduling is an important technique for optimizing loops that constitute a large fraction of execution times in computation kernels of both scientific codes and embedded applications. While tiling has been studied extensively in the context of both uniprocessor and multiprocessor platforms, prior research has paid less attention to tile scheduling, especially when targeting multicore machines with deep on-chip cache hierarchies. In this paper, we propose a cache hierarchy-aware tile scheduling algorithm for multicore machines, with the purpose of maximizing both horizontal and vertical data reuses in on-chip caches, and balancing the workloads across different cores. This scheduling algorithm is one of the key components in a source-to-source translation tool that we developed for automatic loop parallelization and multithreaded code generation from sequential codes. To the best of our knowledge, this is the first effort that develops a fully-automated tile scheduling strategy customized for on-chip cache topologies of multicore machines. The experimental results collected by executing twelve application programs on three commercial Intel machines (Nehalem, Dunnington, and Harpertown) reveal that our cache-aware tile scheduling brings about 27.9% reduction in cache misses, and on average, 13.5% improvement in execution times over an alternate method tested.
引用
收藏
页码:161 / 170
页数:10
相关论文
共 23 条
  • [1] Cache Hierarchy-Aware Query Mapping on Emerging Multicore Architectures
    Ozturk, Ozcan
    Orhan, Umut
    Ding, Wei
    Yedlapalli, Praveen
    Kandemir, Mahmut Taylan
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (03) : 403 - 415
  • [2] Cache Utilization-Aware Scheduling for Multicore Processors
    Chu, Edward T. -H.
    Lu, Wen-wei
    2012 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS), 2012, : 368 - 371
  • [3] Optimal on-chip cache hierarchy synthesis with scaling of technology
    Fu, ST
    Flynn, MJ
    CONFERENCE PROCEEDINGS OF THE 1996 IEEE FIFTEENTH ANNUAL INTERNATIONAL PHOENIX CONFERENCE ON COMPUTERS AND COMMUNICATIONS, 1996, : 129 - 135
  • [4] Secure Hierarchy-Aware Cache Replacement Policy (SHARP): Defending Against Cache-Based Side Channel Attacks
    Yan, Mengjia
    Gopireddy, Bhargava
    Shull, Thomas
    Torrellas, Josep
    44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, : 347 - 360
  • [5] Cache-Hierarchy Contention-Aware Scheduling in CMPs
    Feliu, Josue
    Petit, Salvador
    Sahuquillo, Julio
    Duato, Jose
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2014, 25 (03) : 581 - 590
  • [6] Energy-Efficient Cache-Aware Scheduling on Heterogeneous Multicore Systems
    Sheikh, Saad Zia
    Pasha, Muhammad Adeel
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (01) : 206 - 217
  • [7] Integrating Cache-Related Preemption Delay into GEDF Analysis for Multiprocessor Scheduling with On-Chip Cache
    Zhang, Ying
    Guo, Zhishan
    Wang, Lingxiang
    Xiong, Haoyi
    Zhang, Zhenkai
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS / 11TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING / 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2017, : 815 - 822
  • [8] DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors
    Holtryd, Nadja
    Manivannan, Madhavan
    Stenstrom, Per
    Pericas, Miquel
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 578 - 589
  • [9] Evaluation of Hybrid Memory Technologies Using SOT-MRAM for On-Chip Cache Hierarchy
    Oboril, Fabian
    Bishnoi, Rajendra
    Ebrahimi, Mojtaba
    Tahoori, Mehdi B.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (03) : 367 - 380
  • [10] ROHOM: Requirement-Aware Online Hybrid On-Chip Memory Management for Multicore Systems
    Chang, Da-Wei
    Lin, Ing-Chao
    Yong, Lin-Chun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2017, 36 (03) : 357 - 369