CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

被引:1
|
作者
Shi, Man [1 ]
Colleman, Steven [1 ]
VanDeMieroop, Charlotte [1 ]
Joseph, Antony [2 ]
Meijer, Maurice [2 ]
Dehaene, Wim [1 ]
Verhelst, Marian [1 ]
机构
[1] Katholieke Univ Leuven, MICAS ESAT, Leuven, Belgium
[2] NXP Semicond, Bengaluru, India
基金
欧洲研究理事会;
关键词
Deep neural networks; Cross-layer; Data Layout; Dataflow Optimization; PROCESSOR;
D O I
10.1109/ISQED57927.2023.10129330
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multi-bank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5:5x energy reduction and 1:35x latency reduction with negligible hardware cost.
引用
收藏
页码:172 / 179
页数:8
相关论文
共 50 条
  • [21] A cross-layer protocol for exploiting cooperative diversity in multi-hop wireless ad hoc networks
    Wu, Qilin
    Zhou, Xianzhong
    Ge, Fangzhen
    WIRELESS NETWORKS, 2017, 23 (05) : 1591 - 1610
  • [22] Cross-Layer Optimization for Multi-Hop Wireless Networks With Successive Interference Cancellation
    Jiang, Canming
    Shi, Yi
    Qin, Xiaoqi
    Yuan, Xu
    Hou, Y. Thomas
    Lou, Wenjing
    Kompella, Sastry
    Midkiff, Scott F.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2016, 15 (08) : 5819 - 5831
  • [23] Cross-layer Routing Optimization for Centralized Multi-hop Cognitive Radio Networks
    Salah, Ahmed
    Abd El-Atty, Reba
    Rizk, Rawya Y.
    2015 11TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2015, : 25 - 31
  • [24] Cross-layer optimization of video stream in multi-channel wireless multihop networks
    Huang X.
    Feng S.-L.
    Ke F.
    Zhuang H.-C.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2010, 38 (06): : 35 - 41
  • [25] Optimization on Distributed Cross-layer Design for MRMC Wireless Multi-hop Networks
    Li, Kewei
    Wang, Furong
    Xie, Xu
    Wang, Hao
    2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 2781 - 2784
  • [26] A fuzzy logic approach to cross-layer route optimization in multi-hop CRNs
    Murawski, Robert
    Ekici, Eylem
    Thomas, Ryan W.
    DEFENSE TRANSFORMATION AND NET-CENTRIC SYSTEMS 2010, 2010, 7707
  • [27] Cross-layer optimization for multi-camera streaming over ad hoc networks
    Yang, Yixin
    Sun, Lifeng
    Liang, Ke
    Zhong, Yuzhou
    PROCEEDINGS OF THE 10TH IASTED INTERNATIONAL CONFERENCE ON INTERNET AND MULTIMEDIA SYSTEMS AND APPLICATIONS, 2006, : 188 - +
  • [28] Design and Optimization of a Cross-Layer Routing Protocol for Multi-Hop Wireless Sensor Networks
    Gao, Fei
    Wen, Hongli
    Zhao, Lifen
    Chen, Yuebin
    2013 INTERNATIONAL CONFERENCE ON SENSOR NETWORK SECURITY TECHNOLOGY AND PRIVACY COMMUNICATION SYSTEM (SNS & PCS), 2013, : 5 - 8
  • [29] Dynamic Session Control Over IMS for Cross-Layer Optimization of Multi-Stream Video
    Hyungkeuk Lee
    Suyoung Park
    Sanghoon Lee
    Wireless Personal Communications, 2009, 51 : 517 - 534
  • [30] A Cross-layer Optimization for a LP-based Multi-Reader Coordination in RFID Systems
    Choi, Jinchul
    Lee, Cheawoo
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,