A High-Performance, Energy-Efficient Modular DMA Engine Architecture

被引:4
作者
Benz, Thomas [1 ]
Rogenmoser, Michael [1 ]
Scheffler, Paul [1 ]
Riedel, Samuel [1 ]
Ottaviano, Alessandro [1 ]
Kurth, Andreas [1 ]
Hoefler, Torsten [2 ]
Benini, Luca [3 ,4 ]
机构
[1] Swiss Fed Inst Technol, Integrated Syst Lab IIS, CH-8092 Zurich, Switzerland
[2] Swiss Fed Inst Technol, Scalable Parallel Comp Lab SPCL, CH-8092 Zurich, Switzerland
[3] Swiss Fed Inst Technol, Integrated Syst Lab IIS, Zurich, Switzerland
[4] Univ Bologna, Dept Elect Elect & Informat Engn DEI, I-40126 Bologna, Italy
关键词
DMA; DMAC; direct memory access; memory systems; high-performance; energy-efficiency; edge AI; AXI; TileLink;
D O I
10.1109/TC.2023.3329930
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data transfers are essential in today's computing systems as latency and complex memory access patterns are increasingly challenging to manage. Direct memory access engines (DMAES) are critically needed to transfer data independently of the processing elements, hiding latency and achieving high throughput even for complex access patterns to high-latency memory. With the prevalence of heterogeneous systems, DMAEs must operate efficiently in increasingly diverse environments. This work proposes a modular and highly configurable open-source DMAE architecture called intelligent DMA (iDMA), split into three parts that can be composed and customized independently. The front-end implements the control plane binding to the surrounding system. The mid-end accelerates complex data transfer patterns such as multi-dimensional transfers, scattering, or gathering. The back-end interfaces with the on-chip communication fabric (data plane). We assess the efficiency of iDMA in various instantiations: In high-performance systems, we achieve speedups of up to 15.8$\boldsymbol{\times}$x with only 1% additional area compared to a base system without a DMAE. We achieve an area reduction of 10% while improving ML inference performance by 23% in ultra-low-energy edge AI systems over an existing DMAE solution. We provide area, timing, latency, and performance characterization to guide its instantiation in various systems.
引用
收藏
页码:263 / 277
页数:15
相关论文
共 50 条
  • [31] A Reconfigurable Spatial Architecture for Energy-Efficient Inception Neural Networks
    Luo, Lichuan
    Kang, Wang
    Liu, Junzhan
    Zhang, He
    Zhang, Youguang
    Liu, Dijun
    Ouyang, Peng
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2023, 13 (01) : 7 - 20
  • [32] Large-Memory Nodes for Energy Efficient High-Performance Computing
    Zivanovic, Darko
    Radulovic, Milan
    Llort, German
    Zaragoza, David
    Strassburg, Janko
    Carpenter, Paul M.
    Radojkovic, Petar
    Ayguade, Eduard
    MEMSYS 2016: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2016, : 3 - 9
  • [33] Design and Verification of a Scalable Enhanced High Performance DMA Architecture for Complex SoC
    Zhao, Hualong
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 4303 - 4308
  • [34] Energy-Efficient and High Performance Approximate Multiplier Using Compressors Based on Input Reordering
    Liu, Zhenhao
    Guo, Yi
    Sun, Xiaoting
    Kimura, Shinji
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 0545 - 0550
  • [35] Thermal performance of energy-efficient buildings for sustainable development
    Dhanasingh Sivalinga Vijayan
    Arvindan Sivasuriyan
    Parthiban Patchamuthu
    Revathy jayaseelan
    Environmental Science and Pollution Research, 2022, 29 : 51130 - 51142
  • [36] Thermal performance of energy-efficient buildings for sustainable development
    Vijayan, Dhanasingh Sivalinga
    Sivasuriyan, Arvindan
    Patchamuthu, Parthiban
    Jayaseelan, Revathy
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (34) : 51130 - 51142
  • [37] Performance study of an energy-efficient display case refrigerator
    Tao, WH
    Huang, CM
    Hsu, CL
    Lin, JY
    CHEMICAL ENGINEERING COMMUNICATIONS, 2004, 191 (04) : 550 - 565
  • [38] Toward Performance and Energy-Efficient Edge-of-Things
    Akhunzada, Adnan
    Zeadally, Sherali
    ul Islam, Saif
    IT PROFESSIONAL, 2023, 25 (01) : 17 - 23
  • [39] Mining building performance data for energy-efficient operation
    Ahmed, Ammar
    Korres, Nicholas E.
    Ploennigs, Joern
    Elhadi, Haithum
    Menzel, Karsten
    ADVANCED ENGINEERING INFORMATICS, 2011, 25 (02) : 341 - 354
  • [40] HREN: A Hybrid Reliable and Energy-Efficient Network-on-Chip Architecture
    Bhamidipati, Padmaja
    Karanth, Avinash
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (02) : 537 - 548