Integrating Memory Perspective into the BSC Performance Tools

被引:2
作者
Servat, Harald [1 ]
Labarta, Jesus [2 ,3 ]
Hoppe, Hans-Christian [1 ]
Gimenez, Judit [2 ,3 ]
Pena, Antonio J. [2 ]
机构
[1] Intel Corp, Santa Clara, CA 95051 USA
[2] BSC, Barcelona, Spain
[3] UPC, Barcelona, Spain
来源
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPPW) | 2017年
关键词
performance analysis; memory references; sampling; instrumentation;
D O I
10.1109/ICPPW.2017.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The growing gap between processor and memory speeds results in complex memory hierarchies as processors evolve to mitigate such differences by taking advantage of locality of reference. In this direction, the BSC performance analysis tools have been recently extended to provide insight relative the application memory accesses depicting their temporal and spatial characteristics, correlating with the source-code and the achieved performance simultaneously. These extensions rely on the Precise Event-Based Sampling (PEBS) mechanism available in recent Intel processors to capture information relative to the application memory accesses. The sampled information is processed with the Folding mechanism to provide a detailed temporal evolution of the memory accesses and in conjunction with the achieved performance and the source-code counterpart. The results obtained from the combination of these tools help application developers to understand better how the application behaves and how the system performs. We demonstrate the value of the complete work-flow by exploring an already optimized state-of-the-art benchmark, providing detailed insight of their memory access behavior.
引用
收藏
页码:231 / 232
页数:2
相关论文
共 7 条
  • [1] Refactoring for Data Locality
    Beyls, Kristof
    D'Hollander, Erik H.
    [J]. COMPUTER, 2009, 42 (02) : 62 - 71
  • [2] High-performance conjugate-gradient benchmark: A new metric for ranking high-performance computing systems
    Dongarra, Jack
    Heroux, Michael A.
    Luszczek, Piotr
    [J]. INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2016, 30 (01) : 3 - 10
  • [3] Liu X., INT C HIGH PERF COMP, P28
  • [4] Pena Antonio J., 2014, 2014 43rd International Conference on Parallel Processing Workshops (ICCPW). Proceedings, P235, DOI 10.1109/ICPPW.2014.40
  • [5] Peña AJ, 2014, IEEE INT C CL COMP, P123, DOI 10.1109/CLUSTER.2014.6968756
  • [6] Servat H., 2011, INT C PAR PROC ICPP
  • [7] Subotic V, 2011, LECT NOTES COMPUT SC, V6852, P39, DOI 10.1007/978-3-642-23400-2_5