ViP: A Hierarchical Parallel Vision Processor for Hybrid Vision Chip

被引:6
作者
Zheng, Xuemin [1 ,2 ,3 ]
Cheng, Li [1 ,2 ,3 ]
Zhao, Mingxin [1 ,2 ,3 ]
Luo, Qian [1 ,2 ,3 ]
Li, Honglong [1 ,2 ,3 ]
Dou, Runjiang [1 ,2 ,3 ]
Yu, Shuangming [1 ,2 ,3 ]
Wu, Nanjian [1 ,2 ,3 ]
Liu, Liyuan [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Semicond, State Key Lab Superlattices & Microstruct, Beijing 100083, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing 100083, Peoples R China
[3] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelect Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Parallel processing; Artificial neural networks; Kernel; Reduced instruction set computing; Computer architecture; Hardware; Hierarchical parallel; vision chip; computer vision; neural network;
D O I
10.1109/TCSII.2022.3156945
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, the vision chip bridging sensing and processing has been extensively employed in high-speed image processing, owing to its excellent performance, low power consumption, and economical cost. However, there is a dilemma in designing processors to support conventional computer vision algorithms and neural networks since the two algorithms have a non-trivial trade-off in proposing a unified architecture. By analyzing computation properties, we propose a novel hierarchical parallel vision processor (ViP) for hybrid vision chips to accelerate both traditional computer vision (CV) and neural network (NN). The ViP architecture includes three parallelism levels: PE for pixel-centric, computing core (CC) for block, and vision core (VC) for global. PEs contain dedicated computing units and data paths for convolution operations without degrading its flexibility. Each CC is driven by customized SIMD instructions and can be dynamically connected for meeting block parallelism requirements. ViP is fabricated in 65nm CMOS technology and achieves a peak performance of 614.4 GOPS and an energy efficiency of 640 GOPS/W at 200 MHz clock frequency. Notably, several experiments on CV and NN are performed, illustrating an ultra-low latency in executing hybrid algorithms.
引用
收藏
页码:2957 / 2961
页数:5
相关论文
共 50 条
  • [41] A Vision Chip for Color Segmentation and Pattern Matching
    Ralph Etienne-Cummings
    Philippe Pouliquen
    M. Anthony Lewis
    EURASIP Journal on Advances in Signal Processing, 2003
  • [42] Development of higher order autocorrelation vision chip
    Yamamoto, K
    Kubozono, M
    Ishii, I
    2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vols 1-4, 2005, : 3258 - 3263
  • [43] A vision chip for color segmentation and pattern matching
    Etienne-Cummings, R
    Pouliquen, P
    Lewis, MA
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (07) : 703 - 712
  • [44] Vision chip architecture with light adaptation mechanism
    Tetsuya Yagi
    Haruo Kobayashi
    Takashi Matsumoto
    Koji Tanaka
    Artificial Life and Robotics, 1998, 2 (1) : 12 - 18
  • [45] A 1000 frames/s Vision Chip Using Scalable Pixel-Neighborhood-Level Parallel Processing
    Schmitz, Joseph A.
    Gharzai, Mahir K.
    Balkir, Sina
    Hoffman, Michael W.
    White, Daniel J.
    Schemm, Nathan
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (02) : 556 - 568
  • [46] Investigation of Hierarchical Spectral Vision Transformer Architecture for Classification of Hyperspectral Imagery
    Liu, Wei
    Prasad, Saurabh
    Crawford, Melba
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [47] A Manycore Vision Processor for Real-Time Smart Cameras
    Silva, Bruno A. da
    Lima, Arthur M.
    Arias-Garcia, Janier
    Huebner, Michael
    Yudi, Jones
    SENSORS, 2021, 21 (21)
  • [48] Parallel implementation of vision algorithms on distributed systems
    You, J
    Hungenahally, S
    1998 SECOND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ELECTRONIC SYSTEMS, KES '98, PROCEEDINGS, VOL, 3, 1998, : 354 - 362
  • [49] Optimal parallel algorithms for computer vision problems
    Wu, CH
    Horng, SJ
    Tsai, HR
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2002, 62 (06) : 1021 - 1041
  • [50] Parallel processing in apple grading with machine vision
    Xu, J
    Wang, MH
    ACTUAL TASKS ON AGRICULTURAL ENGINEERING, PROCEEDINGS, 1998, 26 : 311 - 320