CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

被引:1
|
作者
Shi, Man [1 ]
Colleman, Steven [1 ]
VanDeMieroop, Charlotte [1 ]
Joseph, Antony [2 ]
Meijer, Maurice [2 ]
Dehaene, Wim [1 ]
Verhelst, Marian [1 ]
机构
[1] Katholieke Univ Leuven, MICAS ESAT, Leuven, Belgium
[2] NXP Semicond, Bengaluru, India
基金
欧洲研究理事会;
关键词
Deep neural networks; Cross-layer; Data Layout; Dataflow Optimization; PROCESSOR;
D O I
10.1109/ISQED57927.2023.10129330
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multi-bank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5:5x energy reduction and 1:35x latency reduction with negligible hardware cost.
引用
收藏
页码:172 / 179
页数:8
相关论文
共 50 条
  • [1] Shortcut Mining: Exploiting Cross-layer Shortcut Reuse in DCNN Accelerators
    Azizimazreah, Arash
    Chen, Lizhong
    2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, : 94 - 105
  • [2] ProxSim: GPU-based Simulation Framework for Cross-Layer Approximate DNN Optimization
    De la Parra, Cecilia
    Guntoro, Andre
    Kumar, Akash
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 1193 - 1198
  • [3] MultiFuse: Efficient Cross Layer Fusion for DNN Accelerators with Multi-level Memory Hierarchy
    Chang, Chia-Wei
    Liou, Jing-Jia
    Huang, Chih-Tsun
    Hsu, Wei-Chung
    Lu, Juin-Ming
    2023 IEEE 41ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2023, : 614 - 622
  • [4] Cross-Layer Optimization for Opportunistic Multi-MAC Aggregation
    Elkourdi, Tariq
    Chincholi, Amith
    Le, Tan
    Demir, Alpaslan
    2013 IEEE 77TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2013,
  • [5] Cross-Layer Analysis of Multi-Static RFID Systems Exploiting Capture Diversity
    Valentini, Roberto
    Di Marco, Piergiuseppe
    Alesii, Roberto
    Santucci, Fortunato
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (10) : 6620 - 6632
  • [6] Cross-layer based multi-metrics AODV routing optimization
    School of Electronics Eng., Beijing Univ. of Posts and Telecommunications, Beijing 100876, China
    不详
    不详
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban), 2008, 4 (153-158):
  • [7] Cross-layer rate optimization in multi-hop Aloha networks
    Wang, X
    Kar, K
    ICC 2005: IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5, 2005, : 2942 - 2946
  • [8] A Concept for Cross-Layer Optimization of Wireless Sensor Networks in the Logistics Domain by Exploiting Business Knowledge
    Zoeller, Sebastian
    Reinhardt, Andreas
    Meyer, Marek
    Steinmetz, Ralf
    IEEE LOCAL COMPUTER NETWORK CONFERENCE, 2010, : 935 - 937
  • [9] DeepNVM plus plus : Cross-Layer Modeling and Optimization Framework of Nonvolatile Memories for Deep Learning
    Inci, Ahmet
    Isgenc, Mehmet Meric
    Marculescu, Diana
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (10) : 3426 - 3437
  • [10] Multi-objective optimization of cross-layer configuration for cognitive wireless network
    Jiang, Hong
    Bao, Yujun
    Li, Qiang
    Huang, Yuqing
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, PROCEEDINGS, 2009, : 379 - 383