KPart: A Hybrid Cache Partitioning-Sharing Technique for Commodity Multicores

被引:96
作者
El-Sayed, Nosayba [1 ]
Mukkara, Anurag [1 ]
Tsai, Po-An [1 ]
Kasture, Harshad [1 ,3 ]
Ma, Xiaosong [2 ]
Sanchez, Daniel [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] HBKU, Qatar Comp Res Inst, Doha, Qatar
[3] Oracle Labs, Redwood Shores, CA USA
来源
2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA) | 2018年
基金
美国国家科学基金会;
关键词
cache partitioning; multicore architectures; performance isolation; application clustering;
D O I
10.1109/HPCA.2018.00019
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cache partitioning is now available in commercial hardware. In theory, software can leverage cache partitioning to use the last-level cache better and improve performance. In practice, however, current systems implement way-partitioning, which offers a limited number of partitions and often hurts performance. These limitations squander the performance potential of smart cache management. We present KPart, a hybrid cache partitioning-sharing technique that sidesteps the limitations of way-partitioning and unlocks significant performance on current systems. KPart first groups applications into clusters, then partitions the cache among these clusters. To build clusters, KPart relies on a novel technique to estimate the performance loss an application suffers when sharing a partition. KPart automatically chooses the number of clusters, balancing the isolation benefits of way-partitioning with its potential performance impact. KPart uses detailed profiling information to make these decisions. This information can be gathered either offline, or online at low overhead using a novel profiling mechanism. We evaluate KPart in a real system and in simulation. KPart improves throughput by 24% on average (up to 79%) on an Intel Broadwell-D system, whereas prior per-application partitioning policies improve throughput by just 1.7% on average and hurt 30% of workloads. Simulation results show that KPart achieves most of the performance of more advanced partitioning techniques that are not yet available in hardware.
引用
收藏
页码:104 / 117
页数:14
相关论文
共 66 条
[1]  
ALBONESI D, 1999, MICRO 32
[2]  
[Anonymous], MICRO 39
[3]  
[Anonymous], WDDD 7
[4]  
[Anonymous], WIOSCA
[5]  
[Anonymous], 2009, ACM SIGOPS OPERATING
[6]  
[Anonymous], 2016, IISWC
[7]  
Balasubramonian R., 2000, MICRO 33
[8]   The case for energy-proportional computing [J].
Barroso, Luiz Andre ;
Hoelzle, Urs .
COMPUTER, 2007, 40 (12) :33-+
[9]  
Beckmann N., 2015, HPCA 21
[10]  
Beckmann N., 2013, PACT 22