Exploring the Tradeoffs of Application-Specific Processing

被引:1
作者
Schabel, Joshua C. [1 ]
Franzon, Paul D. [1 ]
机构
[1] North Carolina State Univ, Dept Elect & Comp Engn, Raleigh, NC 27695 USA
关键词
ASIP; SIMD; CGRA; processing-in-memory; processing-near-memory; HTM; sparsey; artificial neural networks; ARCHITECTURE; SPECIALIZATION; DESIGN;
D O I
10.1109/JETCAS.2018.2849939
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-traditional processing schemes continue to grow in popularity as a means to achieve high performance with greater energy-efficiency. Data-centric processing is one such scheme that targets functional-specialization and memory bandwidth limitations, opening up small processors to wide memory IO. These functional-specific accelerators prove to be an essential component to achieve energy-efficiency and performance, but purely application-specific integrated circuit accelerators have expensive design overheads with limited reusability. We propose an architecture that combines existing processing schemes utilizing CGRAs for dynamic data path configuration as a means to add flexibility and reusability to data-centric acceleration. While flexibility adds a large energy overhead, performance can be regained through intelligent mappings to the CGRA for the functions of interest, while reusability can he gained through incrementally adding general purpose functionality to the processing elements. Building upon previous work accelerating sparse encoded neural networks, we present a CGRA architecture for mapping functional accelerators operating at 500 MHz in 32 nm. This architecture achieves a latency-per-function within 2x of its function-specific counterparts with energy-per-operation increases between 21-188 x, and energy-per-area increases between 1.8-3.6x.
引用
收藏
页码:531 / 542
页数:12
相关论文
共 50 条
  • [41] Application-specific coarse-grained reconfigurable array: architecture and design methodology
    Zhou, Li
    Liu, Dongpei
    Zhang, Jianfeng
    Liu, Hengzhu
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2015, 102 (06) : 897 - 910
  • [42] Application-specific workload shaping in multimedia-enabled personal mobile devices
    Raman, Balaji
    Chakraborty, Samarjit
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2008, 7 (02)
  • [43] Mobile Ecosystem Driven Application-Specific Low-Power Control Microarchitecture
    Bournoutian, Garo
    Orailoglu, Alex
    2015 33RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2015, : 720 - 727
  • [44] Automatic Design of Application-Specific Reconfigurable Processor Extensions with UPaK Synthesis Kernel
    Wolinski, Christophe
    Kuchcinski, Krzysztof
    Raffin, Erwan
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2009, 15 (01)
  • [45] Automatic Tool-Flow for Mapping Applications to an Application-Specific CGRA Architecture
    Fricke, Florian
    Werner, Andre
    Shahin, Keyvan
    Werner, Florian
    Huebner, Michael
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 147 - 154
  • [46] Design application-specific digital controllers using field programmable gate arrays
    Niu, JY
    Fleming, PJ
    Thompson, HA
    INTELLIGENT CONTROL SYSTEMS AND SIGNAL PROCESSING 2003, 2003, : 359 - 364
  • [47] Optimizing the H.264/AVC Video Encoder Application Structure for Reconfigurable and Application-Specific Platforms
    Shafique, Muhammad
    Bauer, Lars
    Henkel, Joerg
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2010, 60 (02): : 183 - 210
  • [48] Optimizing the H.264/AVC Video Encoder Application Structure for Reconfigurable and Application-Specific Platforms
    Muhammad Shafique
    Lars Bauer
    Jörg Henkel
    Journal of Signal Processing Systems, 2010, 60 : 183 - 210
  • [49] Application-Specific Service Technologies for Commodity Operating Systems in Real-Time Environments
    West, Richard
    Parmer, Gabriel
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2011, 10 (03)
  • [50] An Application Specific Vector Processor for Efficient Massive MIMO Processing
    Attari, Mohammad
    Ferreira, Lucas
    Liu, Liang
    Malkowsky, Steffen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (09) : 3804 - 3815