Hardware Implementation of Reconfigurable 1D Convolution

被引:0
作者
Lei Rao
Bin Zhang
Jizhong Zhao
机构
[1] Xi’an Jiaotong University,
来源
Journal of Signal Processing Systems | 2016年 / 82卷
关键词
Convolution; Field programmable gate array; Reconfigurable architecture;
D O I
暂无
中图分类号
学科分类号
摘要
Convolution has been extensively used in image processing and computer vision, including image enhancement, smoothing, and structure extraction. However, convolution operation typically requires a significant amount of computing resources. A novel one-dimensional (1D) convolution processor with reconfigurable architecture is implemented in this study. This processor is a combination of a line buffer, controller units, as well as a reconfigurable and separable convolution module. The use of a reconfigurable architecture and separable convolution approach improves the flexibility and performance of the convolution processor. The reconfigurable and separable convolution array, which is the main component of the processor, can simultaneously execute convolution operation with different kernels, with a maximum kernel size of up to 24 × 24. Experimental results show that the maximum frames rate of the processor is approximately 194 frames per second (fps), which exceeds the real-time requirement. Synthesis results show that the processor occupies 13.39 mm 2 at a 204 MHz system clock and consumes a power of 419 mW at maximum kernel size at a 120 MHz system clock in SMIC 0.18 μm CMOS technology. Verification experiments on field programmable gate arrays (FPGAs) demonstrate that the processor is suitable for real-time image processing applications even for high-resolution images.
引用
收藏
页码:1 / 16
页数:15
相关论文
共 55 条
[1]  
Chan C(2014)Postreconstruction nonlocal means filtering of whole-body PET with an anatomical prior IEEE Transactions on Medical Imaging 33 636-650
[2]  
Fulton R(2014)A new approach for the extraction of aboveground circular structures from Near-Nadir VHR satellite imagery IEEE Transactions on Geoscience and Remote Sensing 52 3125-3140
[3]  
Barnett R(2014)A 0.18-μm CMOS SoC for a 100-m-range 10-frame/s 200 × 96-pixel time-of-flight depth sensor IEEE Journal of Solid-State Circuits 49 315-330
[4]  
Feng DD(2013)Single-channel transient interference suppression with diffusion maps IEEE Transactions on Audio, Speech, and Language Processing 21 132-144
[5]  
Meikle S(2014)Compressive circulant matrix based analog to information conversion IEEE Signal Processing Letters 21 428-431
[6]  
Ok AO(2014)Determination of displacement from an image sequence based on time-reversal invariance IEEE Transactions on Geoscience and Remote Sensing 52 2575-2592
[7]  
Niclass C(2013)Multicasting mesh AER: A scalable assembly approach for reconfigurable neuromorphic structured AER systems Application to convNets, IEEE Transactions on Biomedical Circuits and Systems 7 82-102
[8]  
Soga M(2004)An optimized architecture for implementing image convolution with reconfigurable hardware Proceedings of the 2004 World Automation Congress 16 131-136
[9]  
Matsubara H(2007)A multiwindow partial buffering scheme for FPGA-Based 2-D convolvers IEEE Transactions on Circuits and Systems II: Express Briefs 54 200-204
[10]  
Ogawa M(2006)Area-efficient 2-D shift-variant convolvers for FPGA-based digital image processing IEEE Transactions on Circuits and Systems II: Express Briefs 53 105-109