Data Placement and Duplication for Embedded Multicore Systems With Scratch Pad Memory

被引:60
作者
Guo, Yibo [1 ]
Zhuge, Qingfeng [3 ]
Hu, Jingtong [1 ]
Yi, Juan [3 ]
Qiu, Meikang [2 ]
Sha, Edwin H. -M. [1 ,3 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75080 USA
[2] Univ Kentucky, Dept Elect & Comp Engn, Lexington, KY 40506 USA
[3] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
基金
美国国家科学基金会;
关键词
Data duplication; data placement; embedded systems; multicore; scratch pad memory; MULTIPROCESSOR;
D O I
10.1109/TCAD.2013.2238990
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scratch pad memories (SPM) are attractive alternatives for caches on multicore systems since caches are relatively expensive in terms of area and energy consumption. The key to effectively utilizing SPMs on multicore systems is the data placement algorithm. In this paper, two polynomial time algorithms, regional data placement for multicore (RDPM) and regional data placement for multicore with duplication (RDPM-DUP), have been proposed to generate near-optimal data placement with minimum total cost. There is only one copy for each data in RDPM, while RDPM-DUP allows data duplication. Experimental results show that the proposed RDPM algorithm alone can reduce the time cost of memory accesses by 32.68% on average compared with existing algorithms. With data duplication, the RDPM-DUP algorithm further reduces the time cost by 40.87%. In terms of energy consumption, the proposed RDPM algorithm with exclusive copy can reduce the total cost by 33.47% on average. When RDPM-DUP is applied, the improvement increases up to 38.15% on average.
引用
收藏
页码:809 / 817
页数:9
相关论文
共 37 条
  • [1] Avissar Oren, 2002, ACM Trans. on Embedded Computing Systems (TECS), V1, P6, DOI [10.1145/581888.581891, DOI 10.1145/581888.581891]
  • [2] Banakar R, 2002, CODES 2002: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON HARDWARE/SOFTWARE CODESIGN, P73, DOI 10.1109/CODES.2002.1003604
  • [3] Thousand core chips-a technology perspective
    Borkar, Shekhar
    [J]. 2007 44TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2007, : 746 - 749
  • [4] A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators
    Buchty, Rainer
    Heuveline, Vincent
    Karl, Wolfgang
    Weiss, Jan-Philipp
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (07) : 663 - 675
  • [5] Che WJ, 2010, DES AUT TEST EUROPE, P1118
  • [6] Chen G, 2006, DES AUT TEST EUROPE, P929
  • [7] Dominguez Angel., 2005, J EMBEDDED COMPUTING, V1, P521
  • [8] Gilani S., 2011, DATE'11, P1
  • [9] Optimal Data Placement for Memory Architectures with Scratch-Pad Memories
    Guo, Yibo
    Zhuge, Qingfeng
    Hu, Jingtong
    Sha, Edwin H. -M.
    [J]. TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1045 - 1050
  • [10] MiBench: A free, commercially representative embedded benchmark suite
    Guthaus, MR
    Ringenberg, JS
    Ernst, D
    Austin, TM
    Mudge, T
    Brown, RB
    [J]. WWC-4: IEEE INTERNATIONAL WORKSHOP ON WORKLOAD CHARACTERIZATION, 2001, : 3 - 14