Optimal two level partitioning and loop scheduling for hiding memory latency for DSP applications

被引:0
作者
Wang, Z [1 ]
Kirkpatrick, M [1 ]
Sha, EHM [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
来源
37TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2000 | 2000年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The large latency of memory accesses in modem computers is a key obstacle in achieving high processor utilization. To hide this latency, this paper proposes a new memory management technique that can be applied to computer architectures with three levels of memory. The technique takes advantage of access pattern information that is available at compile time by prefetching certain data elements from the higher level memory. It as well maintains certain data for a period of time to prevent unnecessary data swapping. Data locality is much improved compared with the usual pattern by partitioning the iteration space and reducing execution in each partition. These combined approaches lead to improvements in average execution times of approximately 35% over the one-level partition algorithm and more than 80% over list scheduling and hardware prefetching.
引用
收藏
页码:540 / 545
页数:6
相关论文
共 50 条
[21]   Scheduling loop applications in software distributed shared memory systems [J].
Liang, Tyng-Yeu ;
Shieh, Ce-Kuen ;
Liu, Deh-Cheng .
IEICE Transactions on Information and Systems, 2000, E83-D (09) :1721-1730
[22]   Scheduling loop applications in software distributed shared memory systems [J].
Liang, TY ;
Shieh, CK ;
Liu, DC .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (09) :1721-1730
[23]   Memory Partitioning-Based Modulo Scheduling for High-level Synthesis [J].
Lu, Tianyi ;
Yin, Shouyi ;
Yao, Xianqing ;
Xie, Zhicong ;
Liu, Leibo .
2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, :2671-2674
[24]   Joint Modulo Scheduling and Memory Partitioning with Multi-Bank Memory for High-Level Synthesis [J].
Lu, Tianyi ;
Yin, Shouyi ;
Yao, Xianqing ;
Xie, Zhicong ;
Liu, Leibo ;
Wei, Shaojun .
FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, :290-290
[25]   Identifying the optimal level of parallelism in transactional memory applications [J].
Didona, Diego ;
Felber, Pascal ;
Harmanci, Derin ;
Romano, Paolo ;
Schenker, Joerg .
COMPUTING, 2015, 97 (09) :939-959
[26]   Identifying the optimal level of parallelism in transactional memory applications [J].
Diego Didona ;
Pascal Felber ;
Derin Harmanci ;
Paolo Romano ;
Jörg Schenker .
Computing, 2015, 97 :939-959
[27]   Two-level information hiding for quantum images using optimal LSB [J].
Luo, Gaofeng ;
Zhou, Ri-Gui ;
Mao, Yingling .
QUANTUM INFORMATION PROCESSING, 2019, 18 (10)
[28]   A delay-optimal static scheduling of DSP applications mapped onto multiprocessor architectures [J].
Itradat, Awni ;
Ahmad, M. O. ;
Shatnawi, Ali .
PAR ELEC 2006: INTERNATIONAL SYMPOSIUM ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2006, :386-+
[29]   Two-level information hiding for quantum images using optimal LSB [J].
Gaofeng Luo ;
Ri-Gui Zhou ;
Yingling Mao .
Quantum Information Processing, 2019, 18
[30]   A multi-level memory system architecture for high performance DSP applications [J].
Agarwala, S ;
Fuoco, C ;
Anderson, T ;
Comisky, D ;
Mobley, C .
2000 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS & PROCESSORS, PROCEEDINGS, 2000, :408-413