Dynamic scratch-pad memory management with data pipelining for embedded systems

被引：5

作者：

Yang, Yanqin ^{[2
,3
]}

Wang, Meng ^{[1
]}

Yan, Haijin ^{[4
]}

Shao, Zili ^{[1
]}

Guo, Minyi ^{[2
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China

[3] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China

[4] Motorola Inc, Chicago, IL USA

来源：

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE | 2010年 / 22卷 / 13期

关键词：

scratch-pad memory management; data pipelining; embedded systems;

D O I：

10.1002/cpe.1602

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we propose an effective data pipelining technique, SPDP (Scratch-Pad Data Pipelining), for dynamic scratch-pad memory (SPM) management with DMA (Direct Memory Access). Our basic idea is to overlap the execution of CPU instructions and DMA operations. In SPDP, based on the iteration access patterns of arrays, we group multiple iterations into a block to improve the data locality of regular array accesses. We allocate the data of multiple iterations into different portions of the SPM. In this way, when the CPU executes instructions and accesses data from one portion of the SPM, DMA operations can be performed to transfer data between the off-chip memory and another portion of SPM simultaneously. We perform code transformation to insert DMA instructions to achieve the data pipelining. We have implemented our SPDP technique with the IMPACT compiler, and conduct experiments using a set of loop kernels from DSPstone, Mibench, and Mediabench on the cycle-accurate VLIW simulator of Trimaran. The experimental results show that our technique achieves performance improvement compared with the previous work. Copyright (C) 2010 John Wiley & Sons, Ltd.

引用

页码：1874 / 1892

页数：19

共 38 条

[1] Optimizing Data Distribution for Loops on Embedded Multicore with Scratch-Pad Memory
Gao, Qiuyan
Zhuge, Qingfeng
Zhang, Jun
Zhu, Guanyu
Sha, Edwin H. -M.
JOURNAL OF COMPUTERS, 2014, 9 (05) : 1066 - 1076
[2] Memory Allocation for Embedded Systems with a Compile-Time-Unknown Scratch-Pad Size
Nguyen, Nghi
Dominguez, Angel
Barua, Rajeev
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2009, 8 (03)
[3] DRDU: A data reuse analysis technique for efficient scratch-pad memory management
Issenin, Ilya
Brockmeyer, Erik
Miranda, Miguel
Dutt, Nikil
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2007, 12 (02)
[4] Implementation of Stack Data Placement and Run Time Management Using a Scratch-Pad Memory for Energy Consumption Reduction of Embedded Applications
Gauthier, Lovic
Ishihara, Tohru
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (12) : 2597 - 2608
[5] Data Placement and Duplication for Embedded Multicore Systems With Scratch Pad Memory
Guo, Yibo
Zhuge, Qingfeng
Hu, Jingtong
Yi, Juan
Qiu, Meikang
Sha, Edwin H. -M.
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2013, 32 (06) : 809 - 817
[6] Fine-grain Dynamic Instruction Placement for L0 Scratch-Pad Memory
Park, Jongsoo
Balfour, James
Dally, William J.
PROCEEDINGS OF THE 2010 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES '10), 2010, : 137 - 146
[7] Energy efficiency of scratch-pad memory at 65 nm and below: An empirical study
Takase, Hideki
Tomiyama, Hiroyuki
Zeng, Gang
Takada, Hiroaki
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2008, : 93 - 97
[8] Energy efficiency of scratch-pad memory in deep submicron domains: an empirical study
Takase, Hideki
Tomiyama, Hiroyuki
Zeng, Gang
Takada, Hiroaki
IEICE ELECTRONICS EXPRESS, 2008, 5 (23) : 1010 - 1016
[9] MCAMP: Communication Optimization on Massively Parallel Machines with Hierarchical Scratch-pad Memory
Hayashizaki, Hiroshige
Sugawara, Yutaka
Inaba, Mary
Hiraki, Kei
PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, : 102 - 111
[10] Compiler-directed scratch pad memory optimization for embedded multiprocessors
Kandemir, M
Kadayif, I
Choudhary, A
Ramanujam, J
Kolcu, I
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2004, 12 (03) : 281 - 287

← 1 2 3 4 →