Low-Overhead Reuse Distance Profiling Tool for Multicore

被引:0
|
作者
Sasongko, Muhammad Aditya [1 ]
Chabbi, Milind [2 ]
Unat, Didem [1 ]
机构
[1] Koc Univ, Istanbul, Turkey
[2] Scalable Machines Res, San Jose, CA USA
来源
EURO-PAR 2021: PARALLEL PROCESSING WORKSHOPS | 2022年 / 13098卷
关键词
Reuse distance; Hardware performance counters; Debug registers; Address sampling;
D O I
10.1007/978-3-031-06156-1_49
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the increase in core count in multicore systems, data movement is one of the main sources of performance slowdown in parallel applications and data locality has become a critical factor in application optimization. One of the important locality metrics is reuse distance, which shows the likelihood of a memory access to be a cache hit. In this work, we propose a low-overhead reuse distance profiling tool for multi-threaded applications. Our method relies on available hardware features in commodity CPUs, namely, Performance Monitoring Units (PMUs) and debug registers, to detect data reuse in private and shared caches by considering inter-thread cache line invalidations. Unlike prior approaches, our tool is fast, accurate, does not change the program behavior and can also handle shared cache accesses. Though it has low runtime (2.9x) and memory overheads (2.8x), our tool achieves 92% accuracy.
引用
收藏
页码:555 / 559
页数:5
相关论文
共 50 条
  • [31] Low-Overhead Defect Tolerance in Crossbar Nanoarchitectures
    Tahoori, Mehdi B.
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2009, 5 (02)
  • [32] sRDMA: A General and Low-Overhead Scheduler for RDMA
    Wang, Xizheng
    Wang, Shuai
    Li, Dan
    PROCEEDINGS OF THE 7TH ASIA-PACIFIC WORKSHOP ON NETWORKING, APNET 2023, 2023, : 21 - 27
  • [33] Low-overhead inline deduplication for persistent memory
    Chen, Wande
    Chen, Zhenke
    Li, Dingding
    Liu, Hai
    Tang, Yong
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (08)
  • [34] Evaluation of a Low-Overhead Forwarding Algorithm for Platooning
    Larsson, Marcus
    Warg, Fredrik
    Karlsson, Kristian
    Jonsson, Magnus
    2015 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2015, : 48 - 55
  • [35] Distop: A low-overhead cluster monitoring system
    Andresen, D
    Schopf, N
    Bowker, E
    Bower, T
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1832 - 1836
  • [36] Low-Overhead Control Channels in Wireless Networks
    Chai, Eugene
    Shin, Kang G.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2015, 14 (11) : 2302 - 2315
  • [37] Low-overhead quantum computing with the color code
    Thomsen, Felix
    Kesselring, Markus S.
    Bartlett, Stephen D.
    Brown, Benjamin J.
    PHYSICAL REVIEW RESEARCH, 2024, 6 (04):
  • [38] Low-overhead message tracking for distributed messaging
    Jun, Seung
    Astley, Mark
    MIDDLEWARE 2006, PROCEEDINGS, 2006, 4290 : 363 - 381
  • [39] A Low-Overhead Integrated Aging and SEU Sensor
    Rohbani, Nezam
    Miremadi, Seyed-Ghassem
    IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2018, 18 (02) : 205 - 213
  • [40] A Low-Overhead Dynamic Optimization Framework for Multicores
    Fletcher, Christopher W.
    Harding, Rachael
    Khan, Omer
    Devadas, Srinivas
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'12), 2012, : 467 - 468