ShiftsReduce: Minimizing Shifts in Racetrack Memory 4.0

被引：26

作者：

Khan, Asif Ali ^{[1
]}

Hameed, Fazal ^{[1
,2
]}

Blaesing, Robin ^{[3
]}

Parkin, Stuart S. P. ^{[3
]}

Castrillon, Jeronimo ^{[1
]}

机构：

[1] Tech Univ Dresden, Chair Compiler Construct, Dresden, Germany

[2] Inst Space Technol, Islamabad, Pakistan

[3] Max Planck Inst Microstruct Phys, D-06120 Halle, Saale, Germany

来源：

ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION | 2019年 / 16卷 / 04期

关键词：

Compiler optimization; data placement; racetrack memory; domain wall memory; shifts minimization; integer linear programming; heuristics; PHASE-CHANGE MEMORY; PERFORMANCE; OPTIMIZATION; ASSIGNMENT; TORQUE; CACHE;

D O I：

10.1145/3372489

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Racetrack memories (RMs) have significantly evolved since their conception in 2008, making them a serious contender in the field of emerging memory technologies. Despite key technological advancements, the access latency and energy consumption of an RM-based system are still highly influenced by the number of shift operations. These operations are required to move bits to the right positions in the racetracks. This article presents data-placement techniques for RMs that maximize the likelihood that consecutive references access nearby memory locations at runtime, thereby minimizing the number of shifts. We present an integer linear programming (ILP) formulation for optimal data placement in RMs, and we revisit existing offset assignment heuristics, originally proposed for random-access memories. We introduce a novel heuristic tailored to a realistic RM and combine it with a genetic search to further improve the solution. We show a reduction in the number of shifts of up to 52.5%, outperforming the state of the art by up to 16.1%.

引用

页数：23

共 64 条

[1] [Anonymous], 2018, GUROBI OPTIMIZER REF
[2] Atoofian E, 2015, INT CONF COMPIL ARCH, P177, DOI 10.1109/CASES.2015.7324558
[3] Atri S., 2000, PROC INT WORKSHOP LA, P158
[4] BARTLEY DH, 1992, SOFTWARE PRACT EXPER, V22, P101, DOI 10.1002/spe.4380220202
[5] Cache-Conscious Data Placement
Calder, B
Krintz, C
John, S
Austin, T
[J]. ACM SIGPLAN NOTICES, 1998, 33 (11) : 139 - 149
[6] Efficient Data Placement for Improving Data Access Performance on Domain-Wall Memory
Chen, Xianzhang
Sha, Edwin Hsing-Mean
Zhuge, Qingfeng
Xue, Chun Jason
Jiang, Weiwen
Wang, Yuangang
[J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (10) : 3094 - 3104
[7] Desplas N, 2014, INVESTIG TUR, P1
[8] Performance and Energy-Efficient Design of STT-RAM Last-Level Cache
Hameed, Fazal
Khan, Asif Ali
Castrillon, Jeronimo
[J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (06) : 1059 - 1072
[9] Current driven domain wall velocities exceeding the spin angular momentum transfer rate in permalloy nanowires
Hayashi, Masamitsu
Thomas, Luc
Rettner, Charles
Moriya, Rai
Bazaliy, Yaroslaw B.
Parkin, Stuart S. P.
[J]. PHYSICAL REVIEW LETTERS, 2007, 98 (03)
[10] Jinger M., 2013, PROC 16 INT WORKSHOP, P31

← 1 2 3 4 5 6 7 →