CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

被引:1
|
作者
Shi, Man [1 ]
Colleman, Steven [1 ]
VanDeMieroop, Charlotte [1 ]
Joseph, Antony [2 ]
Meijer, Maurice [2 ]
Dehaene, Wim [1 ]
Verhelst, Marian [1 ]
机构
[1] Katholieke Univ Leuven, MICAS ESAT, Leuven, Belgium
[2] NXP Semicond, Bengaluru, India
基金
欧洲研究理事会;
关键词
Deep neural networks; Cross-layer; Data Layout; Dataflow Optimization; PROCESSOR;
D O I
10.1109/ISQED57927.2023.10129330
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multi-bank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5:5x energy reduction and 1:35x latency reduction with negligible hardware cost.
引用
收藏
页码:172 / 179
页数:8
相关论文
共 50 条
  • [31] Cross-layer Optimization for Wireless Mesh Networks with Multi-antenna Beam-forming
    Yazdanpanah, Mina
    Assi, Chadi
    Shayan, Yousef
    2010 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC 2010), 2010,
  • [32] Cross-layer optimization for dynamic rate allocation in a multi-user video streaming system
    Yaacoub, C.
    Farah, J.
    Rachkidy, N.
    Pesquet-Popescu, B.
    NEW TECHNOLOGIES, MOBILITY AND SECURITY, 2007, : 619 - 619
  • [33] Cross-layer optimization of reliable transmissions over IEEE 802.11 multi-hop networks
    Catalan, Marisa
    Calveras, Anna
    Galvez, Sergio
    2006 3RD INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, VOLS 1-2, 2006, : 650 - +
  • [34] Dynamic Session Control Over IMS for Cross-Layer Optimization of Multi-Stream Video
    Lee, Hyungkeuk
    Park, Suyoung
    Lee, Sanghoon
    WIRELESS PERSONAL COMMUNICATIONS, 2009, 51 (03) : 517 - 534
  • [35] DC Programming and DCA Based Cross-Layer Optimization in Multi-hop TDMA Networks
    Hoai An Le Thi
    Quang Thuan Nguyen
    Khoa Tran Phan
    Tao Pham Dinh
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT II, 2013, 7803 : 398 - 408
  • [36] Cross-layer rate control and power allocation optimization in multi-channel wireless networks
    Department of Electronics and Information Engineering, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
    Tien Tzu Hsueh Pao, 2009, 6 (1203-1209):
  • [37] Distributed Stochastic Cross-layer Optimization for Multi-hop Wireless Networks with Cooperative Communications
    Yang, Shusen
    Sheng, Zhengguo
    McCann, Julie A.
    Leung, Kin K.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2014, 13 (10) : 2269 - 2282
  • [38] Cooperative multi-hop wireless sensor-actuator networks: Exploiting actuator cooperation and cross-layer optimizations
    Munir, Muhammad Farukh
    Papadogiannis, Agisilaos
    Filali, Fethi
    WCNC 2008: IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-7, 2008, : 2881 - +
  • [39] NUM-Based Distributed Cross-Layer Optimization Algorithm for Multi-Hop Satellite Networks
    Wang, Changlong
    Chen, Jianzhou
    Zhao, Pei
    Liu, Lixiang
    PROCESSING OF 2014 INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INFORMATION INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2014,
  • [40] Cross-Layer Optimization for Multi-View Video Streaming Service over Wireless Broadband Internet
    Son, Junghyun
    Suh, Doug Young
    Park, Gwang-Hoon
    Kim, Kyuheon
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2011, E94B (07) : 2104 - 2113