Predicting inter-thread cache contention on a chip multi-processor architecture

被引:168
|
作者
Chandra, D [1 ]
Guo, F [1 ]
Kim, S [1 ]
Solihin, Y [1 ]
机构
[1] N Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
关键词
D O I
10.1109/HPCA.2005.27
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the impact of L2 cache sharing on threads that simultaneously share the cache, on a Chip Multi-Processor (CMP) architecture. Cache sharing impacts threads non-uniformly, where some threads may be slowed down significantly, while others are not. This may cause severe performance problems such as sub-optimal throughput, cache thrashing, and thread starvation for threads that fail to occupy sufficient cache space to make good progress. Unfortunately, there is no existing model that allows extensive investigation of the impact of cache sharing. To allow such a study, we propose three performance models that predict the impact of cache sharing on co-scheduled threads. The input to our models is the isolated L2 cache stack distance or circular sequence profile of each thread, which can be easily obtained on-line or off-line. The output of the models is the number of extra L2 cache misses for each thread due to cache sharing. The models differ by their complexity and prediction accuracy. We validate the models against a cycle-accurate simulation that implements a dual-core CMP architecture, on fourteen pairs of mostly SPEC benchmarks. The most accurate model, the Inductive Probability model, achieves an average error of only 3.9%. Finally, to demonstrate the usefulness and practicality of the model, a case study that details the relationship between an application's temporal reuse behavior and its cache sharing impact is presented.
引用
收藏
页码:340 / 351
页数:12
相关论文
共 50 条
  • [41] A new reachability algorithm for symmetric multi-processor architecture
    Sahoo, D
    Jain, J
    Iyer, S
    Dill, D
    AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS, PROCEEDINGS, 2005, 3707 : 26 - 38
  • [42] Validating Multi-processor Cache Coherence Mechanisms Under Diminished Observability
    Kumar, Binod
    Bhosale, Atul Kumar
    Fujita, Masahiro
    Singh, Virendra
    2019 IEEE 28TH ASIAN TEST SYMPOSIUM (ATS), 2019, : 99 - 104
  • [43] Shared memory contention and its impact on multi-processor call control throughput
    Drwiega, T
    TELETRAFFIC CONTRIBUTIONS FOR THE INFORMATION AGE, 1997, 2 : 593 - 602
  • [44] Runtime adaptive multi-processor system-on-chip:: RAMPSoC
    Goehringer, Diana
    Huebner, Michael
    Schatz, Volker
    Becker, Juergen
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 3236 - 3242
  • [45] Model Transformations for the Compilation of Multi-processor Systems-on-Chip
    Piel, Eric
    Marquet, Philippe
    Dekeyser, Jean-Luc
    GENERATIVE AND TRANSFORMATIONAL TECHNIQUES IN SOFTWARE ENGINEERING II, 2008, 5235 : 459 - 473
  • [46] Embedding of a real time image stabilization algorithm on a parameterizable SoPC architecture a chip multi-processor approach
    Lionel Damez
    Loic Sieler
    Alexis Landrault
    Jean Pierre Dérutin
    Journal of Real-Time Image Processing, 2011, 6 : 47 - 58
  • [47] Fault Containment in a Reconfigurable Multi-Processor System-on-a-Chip
    Obermaisser, R.
    Hoeftberger, O.
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2011,
  • [48] A Parallel Genetic Algorithm on a Multi-Processor System-on-Chip
    Ferreira, Rubem Euzebio
    Mourelle, Luiza de Macedo
    Nedjah, Nadia
    TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 164 - +
  • [49] Performance evaluation of an on-chip multi-threaded processor with cache memory managed by logical thread number
    Nakajo, H
    Yamato, M
    Kawahara, S
    Kato, N
    Sasada, K
    Sato, M
    Namiki, M
    PDPTA'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-4, 2003, : 1775 - 1781
  • [50] Performance impact of resource conflicts on chip multi-processor servers
    Lee, Myungho
    Ryu, Yeonseung
    Hong, Sugwon
    Lee, Chungki
    APPLIED PARALLEL COMPUTING: STATE OF THE ART IN SCIENTIFIC COMPUTING, 2007, 4699 : 1168 - +