Dynamic scratch-pad memory management with data pipelining for embedded systems

被引:5
|
作者
Yang, Yanqin [2 ,3 ]
Wang, Meng [1 ]
Yan, Haijin [4 ]
Shao, Zili [1 ]
Guo, Minyi [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
[3] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
[4] Motorola Inc, Chicago, IL USA
关键词
scratch-pad memory management; data pipelining; embedded systems;
D O I
10.1002/cpe.1602
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose an effective data pipelining technique, SPDP (Scratch-Pad Data Pipelining), for dynamic scratch-pad memory (SPM) management with DMA (Direct Memory Access). Our basic idea is to overlap the execution of CPU instructions and DMA operations. In SPDP, based on the iteration access patterns of arrays, we group multiple iterations into a block to improve the data locality of regular array accesses. We allocate the data of multiple iterations into different portions of the SPM. In this way, when the CPU executes instructions and accesses data from one portion of the SPM, DMA operations can be performed to transfer data between the off-chip memory and another portion of SPM simultaneously. We perform code transformation to insert DMA instructions to achieve the data pipelining. We have implemented our SPDP technique with the IMPACT compiler, and conduct experiments using a set of loop kernels from DSPstone, Mibench, and Mediabench on the cycle-accurate VLIW simulator of Trimaran. The experimental results show that our technique achieves performance improvement compared with the previous work. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:1874 / 1892
页数:19
相关论文
共 38 条
  • [1] Optimizing Data Distribution for Loops on Embedded Multicore with Scratch-Pad Memory
    Gao, Qiuyan
    Zhuge, Qingfeng
    Zhang, Jun
    Zhu, Guanyu
    Sha, Edwin H. -M.
    JOURNAL OF COMPUTERS, 2014, 9 (05) : 1066 - 1076
  • [2] Memory Allocation for Embedded Systems with a Compile-Time-Unknown Scratch-Pad Size
    Nguyen, Nghi
    Dominguez, Angel
    Barua, Rajeev
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2009, 8 (03)
  • [3] DRDU: A data reuse analysis technique for efficient scratch-pad memory management
    Issenin, Ilya
    Brockmeyer, Erik
    Miranda, Miguel
    Dutt, Nikil
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (02)
  • [4] Implementation of Stack Data Placement and Run Time Management Using a Scratch-Pad Memory for Energy Consumption Reduction of Embedded Applications
    Gauthier, Lovic
    Ishihara, Tohru
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (12) : 2597 - 2608
  • [5] Data Placement and Duplication for Embedded Multicore Systems With Scratch Pad Memory
    Guo, Yibo
    Zhuge, Qingfeng
    Hu, Jingtong
    Yi, Juan
    Qiu, Meikang
    Sha, Edwin H. -M.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2013, 32 (06) : 809 - 817
  • [6] Fine-grain Dynamic Instruction Placement for L0 Scratch-Pad Memory
    Park, Jongsoo
    Balfour, James
    Dally, William J.
    PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES '10), 2010, : 137 - 146
  • [7] Energy efficiency of scratch-pad memory at 65 nm and below: An empirical study
    Takase, Hideki
    Tomiyama, Hiroyuki
    Zeng, Gang
    Takada, Hiroaki
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2008, : 93 - 97
  • [8] Energy efficiency of scratch-pad memory in deep submicron domains: an empirical study
    Takase, Hideki
    Tomiyama, Hiroyuki
    Zeng, Gang
    Takada, Hiroaki
    IEICE ELECTRONICS EXPRESS, 2008, 5 (23) : 1010 - 1016
  • [9] MCAMP: Communication Optimization on Massively Parallel Machines with Hierarchical Scratch-pad Memory
    Hayashizaki, Hiroshige
    Sugawara, Yutaka
    Inaba, Mary
    Hiraki, Kei
    PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 102 - 111
  • [10] Compiler-directed scratch pad memory optimization for embedded multiprocessors
    Kandemir, M
    Kadayif, I
    Choudhary, A
    Ramanujam, J
    Kolcu, I
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2004, 12 (03) : 281 - 287