Design and Implementation of Memory Access Fast Switching Structure in Cluster-Based Reconfigurable Array Processor

被引:0
作者
Shan R. [1 ]
Jiang L. [2 ]
Deng J. [2 ]
Li X. [3 ]
Shen X. [1 ]
机构
[1] School of Micro-electronics, Xidian University, Xi'an
[2] School of Electronic Engineering, Xi'an University of Posts and Telecommunication, Xi'an
[3] School of Computer Science & Technology, Xi'an University of Posts and Telecommunication, Xi'an
来源
Journal of Beijing Institute of Technology (English Edition) | 2017年 / 26卷 / 04期
基金
中国国家自然科学基金;
关键词
Array processor; Distributed memory; Memory access; Switching structure;
D O I
10.15918/j.jbit1004-0579.201726.0409
中图分类号
学科分类号
摘要
Memory access fast switching structures in cluster are studied, and three kinds of fast switching structures (FS, LR2SS, and LAPS) are proposed. A mixed simulation test bench is constructed and used for statistic of data access delay among these three structures in various cases. Finally these structures are realized on Xilinx FPGA development board and DCT, FFT, SAD, IME, FME, and de-blocking filtering algorithms are mapped onto the structures. Compared with available architectures, our proposed structures have lower data access delay and lower area. © 2017 Editorial Department of Journal of Beijing Institute of Technology.
引用
收藏
页码:494 / 504
页数:10
相关论文
共 21 条
[1]  
Shi C., Yang J., Han Y., Et al., A 1000 fps vision chip based on a dynamically reconfigurable hybrid architecture comprising a PE array processor and self-organizing map neural network, IEEE Journal of Solid-State Circuits, 49, 9, pp. 2067-2082, (2014)
[2]  
Yang C., Liu L., Wang Y., Et al., Configuration approaches to enhance computing efficiency of coarse-grained reconfigurable array, Journal of Circuits System & Computers, 24, 3, pp. 426-429, (2015)
[3]  
Patel K., Bleakley C.J., Coarse grained reconfigurable array based architecture for low power real-time seizure detection, Journal of Signal Processing Systems, 82, 1, pp. 55-68, (2016)
[4]  
Tang C., Liu D., Xing Z., Et al., Memory access analysis of many-core system with abundant bandwidth, IEEE International Symposium on Embedded Multicore/many-Core Systems-On-Chip, (2015)
[5]  
Chen Y., Liu L.B., Yin S.Y., Et al., Efficient and flexible memory architecture to alleviate data and context bandwidth bottlenecks of coarse-grained reconfigurable arrays, Science China Physics, Mechanics & Astronomy, 57, 12, pp. 2214-2227, (2014)
[6]  
Liu Y., Zhang W., Scratchpad memory architectures and allocation algorithms for hard real-time multicore processors, Journal of Computing Science & Engineering, 9, 2, pp. 51-72, (2015)
[7]  
Chakraborty P., Panda P.R., Sen S., Partitioning and data mapping in reconfigurable cache and scratchpad memory-based architectures, Acm Transactions on Design Automation of Electronic Systems, 22, 1, pp. 1-25, (2016)
[8]  
Nouri S., Hussain W., Nurmi J., Implementation of IEEE-802. 11a/g receiver blocks on a coarse-grained reconfigurable array, Design and Architectures for Signal and Image Processing, (2015)
[9]  
Majzoub S., Diab H., MorphoSys reconfigurable hardware for cryptography: the two fish case, The Journal of Supercomputing, 59, 1, pp. 22-41, (2012)
[10]  
Bell S., Edwards B., Amann J., Et al., TILE64-processor: a 64-Core SoC with mesh interconnect, IEEE International Solid-state Circuits Conference, (2008)