A Scalable Massively Parallel Processor for Real-Time Image Processing

被引:7
作者
Kurafuji, Takashi [1 ]
Haraguchi, Masaru [1 ]
Nakajima, Masami [1 ]
Nishijima, Tetsu [1 ]
Tanizaki, Tetsushi [1 ]
Yamasaki, Hiroyuki [1 ]
Sugimura, Takeaki [1 ]
Imai, Yuta
Ishizaki, Masakatsu [1 ]
Kumaki, Takeshi
Murata, Kan [1 ]
Yoshida, Kanako [1 ]
Shimomura, Eisuke [1 ]
Noda, Hideyuki [1 ]
Okuno, Yoshihiro [1 ]
Kamijo, Shunsuke [3 ]
Koide, Tetsushi [2 ]
Mattausch, Hans Juergen [2 ]
Arimoto, Kazutami [1 ]
机构
[1] Renesas Elect Corp, Itami, Hyogo 6640005, Japan
[2] Hiroshima Univ, Higashi Ku, Hiroshima 7398511, Japan
[3] Univ Tokyo, Tokyo 1130033, Japan
关键词
Image processor; power efficiency; area efficiency; SIMD; scalable architecture; fine grained processing element;
D O I
10.1109/JSSC.2011.2159528
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes a high performance scalable massively parallel single-instruction multiple-data (SIMD) processor and power/area efficient real-time image processing. The SIMD processor combines 4-bit processing elements (PEs) with SRAM on a small area and thus enables at the same time a high performance of 191 GOPS, a high power efficiency of 310 GOPS/W, and a high area efficiency of 31.6 GOPS/mm(2). The applied pipeline architecture is optimized to reduce the number of controller overhead cycles so that the SIMD parallel processing unit can be utilized during up to 99% of the operating time of typical application programs. The processor can be also optimized for low cost, low power, and high performance multimedia system-on-a-chip (SoC) solutions. A combination of custom and automated implementation techniques enables scalability in the number of PEs. The processor has two operating modes, a normal frequency (NF) mode for higher power efficiency and a double frequency (DF) mode for higher performance. The combination of high area efficiency, high power efficiency, high performance, and the flexibility of the SIMD processor described in this paper expands the application of real-time image processing technology to a variety of electronic devices.
引用
收藏
页码:2363 / 2373
页数:11
相关论文
共 10 条
  • [1] Abbo A., 2007, 2007 IEEE International Solid-State Circuits Conference (IEEE Cat. No.07CH37858), P270, DOI 10.1109/ISSCC.2007.373398
  • [2] CHENG CC, 2008, 2008 IEEE INT SOL ST, P306
  • [3] Traffic Monitoring and Accident Detection at Intersections
    Kamijo, Shunsuke
    Matsushita, Yasuyuki
    Ikeuchi, Katsushi
    Sakauchi, Masao
    [J]. IEEE Transactions on Intelligent Transportation Systems, 2000, 1 (02) : 108 - 117
  • [4] KIM JY, 2009, 2009 ISSCC, P150
  • [5] Kurafuji Takashi, 2010, 2010 IEEE International Solid-State Circuits Conference (ISSCC), P334, DOI 10.1109/ISSCC.2010.5433910
  • [6] Kyo S, 2003, ISSCC DIG TECH PAP I, V46, P48
  • [7] The design and implementation of the massively parallel processor based on the matrix architecture
    Noda, Hideyuki
    Nakajima, Masami
    Dosaka, Katsumi
    Nakata, Kiyoshi
    Higashida, Motoki
    Yamamoto, Osamu
    Mizumoto, Katsuya
    Tanizaki, Tetsushi
    Gyohten, Takayuki
    Okuno, Yoshihiro
    Kondo, Hiroyuki
    Shimazu, Yukihiko
    Arimoto, Kazutami
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2007, 42 (01) : 183 - 192
  • [8] SATO Y, 2009, ISPACS 2009 DEC, P485
  • [9] SUGIMURA T, 2008, ISPACS 2008 DEC, P146
  • [10] YAMASAKI H, 2010, P COOL CHIPS 13 APR, P395