A Hardware Instruction Generation Mechanism for Energy-Efficient Computational Memories

被引:0
|
作者
De La Fuente, Leo [1 ]
Christmann, Jean-Frederic [1 ]
Pezzin, Manuel [1 ]
Remars, Matthias [1 ]
Sentieys, Olivier [2 ]
机构
[1] Univ Grenoble Alpes, CEA, List, F-38000 Grenoble, France
[2] Univ Rennes, Inria, Rennes, France
关键词
near-memory computing; macro-instruction; matrix multiplication; GeMM; embedded systems;
D O I
10.1109/ISCAS58744.2024.10557870
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the Computing-In-Memory (CIM) approach, computations are directly performed within the data storage unit, which often results in energy reduction. This makes it particularly well fitted for embedded systems, highly constrained in energy efficiency. It is commonly admitted that this energy reduction comes from less data transfers between the CPU and the main memory. Nevertheless, preparing and sending instructions to the computational memory also consumes energy and time, hence limiting overall performance. In this paper, we present a hardware instruction generation mechanism integrated in computational memories and evaluate its benefit for Integer General Matrix Multiplication (IGeMM) operations. The proposed mechanism is implemented in the computational memory controller and translates macro-instructions into corresponding micro-instructions needed to execute the kernel on stored data. We modified an existing near-memory computing architecture and extracted corresponding energy consumption figures using post-layout simulations for the complete SoC. Our proposed architecture, NEar memory computing Macro-Instruction Kernel Accelerator (NeMIKA), provides an 8.2x speed-up and a 4.6x energy consumption reduction compared to a state-of-the-art CIM accelerator based on micro-instructions, while inducing an area overhead of only 0.1%.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] An energy-efficient partitioned instruction cache architecture for embedded processors
    Kim, CH
    Chung, SW
    Jhon, CS
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (04): : 1450 - 1458
  • [22] Loop Instruction Caching for Energy-Efficient Embedded Multitasking Processors
    Gu, Ji
    Ishihara, Tohru
    Lee, Kyungsoo
    2012 IEEE 10TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2012, : 97 - 106
  • [23] Energy-efficient instruction dispatch buffer design for superscalar processors
    Kucuk, G
    Ghose, K
    Ponomarev, DV
    Kogge, PM
    ISLPED'01: PROCEEDINGS OF THE 2001 INTERNATIONAL SYMPOSIUM ON LOWPOWER ELECTRONICS AND DESIGN, 2001, : 237 - 242
  • [24] Energy-efficient Hardware Acceleration of Shallow Machine Learning Applications
    Zeng, Ziqing
    Sapatnekar, Sachin S.
    2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
  • [25] Domino: Graph Processing Services on Energy-efficient Hardware Accelerator
    Xu, Chongchong
    Wang, Chao
    Gong, Lei
    Jin, Lihui
    Li, Xi
    Zhou, Xuehai
    2018 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2018), 2018, : 274 - 281
  • [26] Energy-Efficient Synchronous Counter Design with Minimum Hardware Overhead
    Katreepalli, Raghava
    Haniotakis, Themistoklis
    2017 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2017, : 1423 - 1427
  • [27] An Energy-Efficient Hardware Accelerator for Hierarchical Deep Reinforcement Learning
    Shiri, Aidin
    Prakash, Bharat
    Mazumder, Arnab Neelim
    Waytowich, Nicholas R.
    Oates, Tim
    Mohsenin, Tinoosh
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [28] Energy-Efficient Hardware Prefetching for CMPs using Heterogeneous Interconnects
    Flores, Antonio
    Aragon, Juan L.
    Acacio, Manuel E.
    PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, : 147 - 154
  • [29] Energy-Efficient Deployment of Machine Learning Workloads on Neuromorphic Hardware
    Chandarana, Peyton
    Mohammadi, Mohammadreza
    Seekings, James
    Zand, Ramtin
    2022 IEEE 13TH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC), 2022, : 149 - 155
  • [30] Runtime Reconfigurable Hardware Accelerator for Energy-Efficient Transposed Convolutions
    Marrazzo, Emanuel
    Spagnolo, Fanny
    Perri, Stefania
    PRIME 2022: 17TH INTERNATIONAL CONFERENCE ON PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS, 2022, : 49 - 52