Parallel embedded processor architecture for FPGA-based image processing using parallel software skeletons

被引:0
作者
Hanen Chenini
Jean Pierre Dérutin
Romuald Aufrère
Roland Chapuis
机构
[1] Blaise Pascal University,Institut Pascal
[2] Blaise Pascal University,UMR 6602 CNRS
关键词
Unify Modeling Language; Processing Node; Direct Memory Access; Image Processing Application; Hardware Description Language;
D O I
10.1186/1687-6180-2013-153
中图分类号
学科分类号
摘要
Today, the problem of designing suitable multiprocessor architecture tailored for a target application field raises the need for a fast and efficient multiprocessor system-on-chip (MPSoC) design environment. Additionally, the implementation of image processing applications on MPSoC system will need to exploit the parallelism and the pipelining in algorithms with the hope of delivering significant reduction in execution times. To take advantage of parallelization on homogeneous MPSoCs and to reduce the programming effort, the proposed design methodology offers more opportunities for accelerating the parallelization of sequential processing image algorithms on pipeline architecture. Our approach provides rapid prototyping tool as a graphic programming environment (CubeGen). Further, it offers a set of parallel software skeletons as a communication library, providing a software abstraction to enable quick implementation of complex image processing applications on field-programmable gate array (FPGA) platform. The design of homogeneous network of communicating processor is presented from the hardware and software specification down to synthesizable hardware description. Then, we extend our approach to support more complex applications by implementing a soft multiprocessor for 'multihypotheses model-driven approach for road recognition’ and show the impact of various configuration choices (hardware and software) to match the specific application needs. Using the images of a real road scene, the performance results of the road recognition algorithm on a Xilinx Virtex-6 FPGA platform not only achieve the desired latency but also further improve the tracking performance which depends mainly on the number of hypotheses.
引用
收藏
相关论文
共 45 条
  • [1] Chenini H(2012)Fast prototyping of embedded image processing application on homogenous system - a parallel particle filter tracking method on Homogeneous Network of Communicating Processors(HNCP) VISAPP 2 122-133
  • [2] Dérutin JP(1997)Hardware/software co-design IEEE MICRO 85 349-365
  • [3] Chateau T(2009)Performance evaluation of UML2-modeled embedded streaming applications with system-level simulation EURASIP J. Embedded Syst 2009 6:3-6:3
  • [4] Michell GD(2006)Rapid prototyping for heterogeneous multicomponent systems: an MPEG-4 stream over an UMTS communication link EURASIP J. Adv. Signal Process 2006 64369-63
  • [5] Gupta RK(2002)Multiprocessor SoC platforms: a component-based design approach IEEE Design Test Comput 19 52-320
  • [6] Arpinen T(2006)UML-based multiprocessor SoC design framework ACM Trans. Embed. Comput. Syst 5 281-15
  • [7] Salminen E(2007)A SystemC-based design methodology for digital signal processing systems EURASIP J. Embedded Syst 2007 15-1154
  • [8] Hämäläinen TD(2002)Towards a general framework for FPGAs based image processing using hardware skeletons Parallel Comput 28 1141-107
  • [9] Hännikäinen M(2001)A model-driven approach for real-time road recognition Mach. Vis. Appl 13 95-undefined
  • [10] Raulet M(undefined)undefined undefined undefined undefined-undefined