Fast memory bank assignment for fixed-point digital signal processors

被引:10
作者
Cho, J
Paek, Y [1 ]
Whalley, D
机构
[1] Seoul Natl Univ, Sch Elect Engn, Seoul 151744, South Korea
[2] Korea Adv Inst Sci & Technol, Seoul, South Korea
[3] Florida State Univ, Tallahassee, FL 32306 USA
关键词
algorithms; compiler; dependence analysis; DSP; dual memory banks; maximum spanning tree; nonorthogonal architecture;
D O I
10.1145/966137.966140
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Most vendors of digital signal processors (DSPs) support a Harvard architecture, which has two or more memory buses, one for program and one or more for data and allow the processor to access multiple words of data from memory in a single instruction cycle. Also, many existing fixed-point DSPs are known to have an irregular architecture with heterogeneous registers, which contains multiple register files that are distributed and dedicated to different sets of instructions. Although there have been several studies conducted to efficiently assign data to multimemory banks, most of them assumed processors with relatively simple, homogeneous general-purpose registers. Thus, several vendor-provided compilers for DSPs that we examined were unable to efficiently assign data to multiple data memory banks, thereby often failing to generate highly optimized code for their machines. As a consequence, programmers for these DSPs often manually assign program variables to memories so as to fully utilize multimemory banks in their code. This paper reports on our recent attempt to address this problem by presenting an algorithm that helps the compiler to efficiently assign data to multimemory banks. Our algorithm differs from previous work in that it assigns variables to memory banks in separate, decoupled code generation phases, instead of a single, tightly coupled phase. The experimental results have revealed that our decoupled algorithm greatly simplifies our code generation process; thus our compiler runs extremely fast, yet generates target code that is comparable in quality to the code generated by a coupled approach.
引用
收藏
页码:52 / 74
页数:23
相关论文
共 11 条
[1]  
[Anonymous], P INT S MICR MICRO
[2]  
Araujo G., 1998, ACM Transactions on Design Automation of Electronic Systems, V3, P136, DOI 10.1145/290833.290837
[3]   Compiler support for scalable and efficient memory systems [J].
Barua, R ;
Lee, W ;
Amarasinghe, S ;
Agarwal, A .
IEEE TRANSACTIONS ON COMPUTERS, 2001, 50 (11) :1234-1247
[4]  
PANDA PR, 1999, P INT C COMP AID DES, P477
[5]   SHORTEST CONNECTION NETWORKS AND SOME GENERALIZATIONS [J].
PRIM, RC .
BELL SYSTEM TECHNICAL JOURNAL, 1957, 36 (06) :1389-1401
[6]   Simultaneous reference allocation in code generation for dual data memory bank ASIPs [J].
Sudarsanam, A ;
Malik, S .
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2000, 5 (02) :242-264
[7]  
[No title captured]
[8]  
[No title captured]
[9]  
[No title captured]
[10]  
[No title captured]