ShiftsReduce: Minimizing Shifts in Racetrack Memory 4.0

被引:26
作者
Khan, Asif Ali [1 ]
Hameed, Fazal [1 ,2 ]
Blaesing, Robin [3 ]
Parkin, Stuart S. P. [3 ]
Castrillon, Jeronimo [1 ]
机构
[1] Tech Univ Dresden, Chair Compiler Construct, Dresden, Germany
[2] Inst Space Technol, Islamabad, Pakistan
[3] Max Planck Inst Microstruct Phys, D-06120 Halle, Saale, Germany
关键词
Compiler optimization; data placement; racetrack memory; domain wall memory; shifts minimization; integer linear programming; heuristics; PHASE-CHANGE MEMORY; PERFORMANCE; OPTIMIZATION; ASSIGNMENT; TORQUE; CACHE;
D O I
10.1145/3372489
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Racetrack memories (RMs) have significantly evolved since their conception in 2008, making them a serious contender in the field of emerging memory technologies. Despite key technological advancements, the access latency and energy consumption of an RM-based system are still highly influenced by the number of shift operations. These operations are required to move bits to the right positions in the racetracks. This article presents data-placement techniques for RMs that maximize the likelihood that consecutive references access nearby memory locations at runtime, thereby minimizing the number of shifts. We present an integer linear programming (ILP) formulation for optimal data placement in RMs, and we revisit existing offset assignment heuristics, originally proposed for random-access memories. We introduce a novel heuristic tailored to a realistic RM and combine it with a genetic search to further improve the solution. We show a reduction in the number of shifts of up to 52.5%, outperforming the state of the art by up to 16.1%.
引用
收藏
页数:23
相关论文
共 64 条
  • [1] [Anonymous], 2018, GUROBI OPTIMIZER REF
  • [2] Atoofian E, 2015, INT CONF COMPIL ARCH, P177, DOI 10.1109/CASES.2015.7324558
  • [3] Atri S., 2000, PROC INT WORKSHOP LA, P158
  • [4] BARTLEY DH, 1992, SOFTWARE PRACT EXPER, V22, P101, DOI 10.1002/spe.4380220202
  • [5] Cache-Conscious Data Placement
    Calder, B
    Krintz, C
    John, S
    Austin, T
    [J]. ACM SIGPLAN NOTICES, 1998, 33 (11) : 139 - 149
  • [6] Efficient Data Placement for Improving Data Access Performance on Domain-Wall Memory
    Chen, Xianzhang
    Sha, Edwin Hsing-Mean
    Zhuge, Qingfeng
    Xue, Chun Jason
    Jiang, Weiwen
    Wang, Yuangang
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2016, 24 (10) : 3094 - 3104
  • [7] Desplas N, 2014, INVESTIG TUR, P1
  • [8] Performance and Energy-Efficient Design of STT-RAM Last-Level Cache
    Hameed, Fazal
    Khan, Asif Ali
    Castrillon, Jeronimo
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (06) : 1059 - 1072
  • [9] Current driven domain wall velocities exceeding the spin angular momentum transfer rate in permalloy nanowires
    Hayashi, Masamitsu
    Thomas, Luc
    Rettner, Charles
    Moriya, Rai
    Bazaliy, Yaroslaw B.
    Parkin, Stuart S. P.
    [J]. PHYSICAL REVIEW LETTERS, 2007, 98 (03)
  • [10] Jinger M., 2013, PROC 16 INT WORKSHOP, P31