Partitioning and scheduling DSP applications with maximal memory access hiding

被引:6
|
作者
Wang, Z [1 ]
Sha, EHM
Wang, YK
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Texas, Dept Comp Sci, Richardson, TX 75083 USA
关键词
loop pipelining; initial data; maximal overlap; balanced partition scheduling;
D O I
10.1155/S1110865702205041
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an iteration space partitioning scheme to reduce the CPU idle time due to the long memory access latency. We take into consideration both the data accesses of intermediate and initial data. An algorithm is proposed to find the largest overlap for initial data to reduce the entire memory traffic. In order to efficiently hide the memory latency, another algorithm is developed to balance the ALU and memory schedules. The experiments on DSP benchmarks show that the algorithms significantly outperform the known existing methods.
引用
收藏
页码:926 / 935
页数:10
相关论文
共 50 条
  • [1] Partitioning and Scheduling DSP Applications with Maximal Memory Access Hiding
    Zhong Wang
    Edwin Hsing-Mean Sha
    Yuke Wang
    EURASIP Journal on Advances in Signal Processing, 2002
  • [2] Partitioning and scheduling DSP applications with maximal memory access hiding
    Wang, Zhong
    Sha, Edwin Hsing-Mean
    Wang, Yuke
    1600, Hindawi Publishing Corporation (2002):
  • [3] Optimal two level partitioning and loop scheduling for hiding memory latency for DSP applications
    Wang, Z
    Kirkpatrick, M
    Sha, EHM
    37TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2000, 2000, : 540 - 545
  • [4] Minimization of memory access overhead for multidimensional DSP applications via multilevel partitioning and scheduling
    Wang, JQY
    Sha, EHM
    Passos, NL
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 1997, 44 (09) : 741 - 753
  • [5] LOOP SCHEDULING WITH MEMORY ACCESS REDUCTION UNDER REGISTER CONSTRAINTS FOR DSP APPLICATIONS
    Wang, Meng
    Liu, Duo
    Wang, Yi
    Shao, Zili
    SIPS: 2009 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, 2009, : 139 - 144
  • [6] Loop scheduling with memory access reduction subject to register constraints for DSP applications
    Wang, Yi
    Jia, Zhiping
    Chen, Renhai
    Wang, Meng
    Liu, Duo
    Shao, Zili
    SOFTWARE-PRACTICE & EXPERIENCE, 2014, 44 (08): : 999 - 1026
  • [7] Iterational Retiming with Partitioning: Loop Scheduling with Complete Memory Latency Hiding
    Xue, Chun Jason
    Hu, Jingtong
    Shao, Zili
    Sha, Edwin
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2010, 9 (03)
  • [8] Efficient variable partitioning and scheduling for DSP processors with multiple memory modules
    Zhuge, QF
    Sha, EHMS
    Xiao, B
    Chantrapornchai, C
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (04) : 1090 - 1099
  • [9] Optimal loop scheduling for hiding memory latency based on two-level partitioning and prefetching
    Wang, Z
    O'Neil, TW
    Sha, EHM
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (11) : 2853 - 2864
  • [10] Memory access scheduling
    Rixner, S
    Dally, WJ
    Kapasi, UJ
    Mattson, P
    Owens, JD
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 128 - 138