CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

被引:1
|
作者
Shi, Man [1 ]
Colleman, Steven [1 ]
VanDeMieroop, Charlotte [1 ]
Joseph, Antony [2 ]
Meijer, Maurice [2 ]
Dehaene, Wim [1 ]
Verhelst, Marian [1 ]
机构
[1] Katholieke Univ Leuven, MICAS ESAT, Leuven, Belgium
[2] NXP Semicond, Bengaluru, India
基金
欧洲研究理事会;
关键词
Deep neural networks; Cross-layer; Data Layout; Dataflow Optimization; PROCESSOR;
D O I
10.1109/ISQED57927.2023.10129330
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switching the dataflow from one layer to the next layer within one DNN model can result in hardware inefficiencies stemming from memory data layout mismatch among the layers. Unfortunately, all existing frameworks treat each layer independently and typically model memories as black boxes (one large monolithic wide memory), which ignores the data layout and can not deal with the data layout dependencies of sequential layers. These frameworks are not capable of doing dataflow cross-layer optimization. This work, hence, aims at cross-layer dataflow optimization, taking the data dependency and data layout reshuffling overheads among layers into account. Additionally, we propose to exploit the multi-bank memories typically present in modern DNN accelerators towards efficiently reshuffling data to support more dataflow at low overhead. These innovations are supported through the Cross-layer Memory-aware Dataflow Scheduler (CMDS). CMDS can model DNN execution energy/latency while considering the different data layout requirements due to the varied optimal dataflow of layers. Compared with the state-of-the-art (SOTA), which performs layer-optimized memory-unaware scheduling, CMDS achieves up to 5:5x energy reduction and 1:35x latency reduction with negligible hardware cost.
引用
收藏
页码:172 / 179
页数:8
相关论文
共 50 条
  • [41] Adaptive Cross-Layer Resource Optimization in Heterogeneous Wireless Networks with Multi-Homing User Equipments
    Wu, Weihua
    Yang, Qinghai
    Li, Bingbing
    Kwak, Kyung Sup
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2016, 18 (05) : 784 - 795
  • [42] A Fast Convergent Cross-layer Resource Optimization Allocation Algorithm in Wireless Multi-hop Networks
    Feng Wei
    Xu Yongxin
    Liu Hao
    Xu Xiaorong
    Yao Yingbiao
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (05) : 1217 - 1224
  • [43] Mixed Timescale Cross-layer Optimization for Multi-antenna D2D Networks
    Liu, An
    Lau, Vincent
    Zhuang, Fuxin
    Chen, Junting
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,
  • [44] An MILP-Based Cross-Layer Optimization for a Multi-Reader Arbitration in the UHF RFID System
    Choi, Jinchul
    Lee, Chaewoo
    SENSORS, 2011, 11 (03) : 2347 - 2368
  • [45] MACRO: Multi-agent Reinforcement Learning-based Cross-layer Optimization of Operational Amplifier
    Chen, Zihao
    Meng, Songlei
    Yang, Fan
    Shang, Li
    Zeng, Xuan
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 423 - 428
  • [46] Cross-layer optimization for LDPC-coded multi-rate multiuser systems with QoS constraints
    Li, Kai
    Wang, Xiaodong
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 2874 - +
  • [47] Cross-layer optimization of adaptive multi-rate wireless networks using truncated Chase combining HARQ
    Ramis, Jaume
    Femenias, Guillem
    Riera-Palou, Felip
    Carrasco, Loren
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [48] Cross-Layer Optimization Spatial Multi-Channel Directional Neighbor Discovery with Random Reply in mmWave FANET
    Song, Yifei
    Zeng, Liang
    Liu, Zeyu
    Song, Zhe
    Zeng, Jie
    An, Jianping
    ELECTRONICS, 2022, 11 (10)
  • [49] A Cross-layer Approach to the Downlink Performance Analysis and Optimization of Distributed Antenna Systems in Multi-cell Environments
    Samano-Robles, Ramiro
    Gameiro, Atilio
    2009 1ST INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION, VEHICULAR TECHNOLOGY, INFORMATION THEORY AND AEROSPACE & ELECTRONIC SYSTEMS TECHNOLOGY, VOLS 1 AND 2, 2009, : 107 - 111
  • [50] A cross-layer optimization based integrated routing and grooming algorithm for green multi-granularity transport networks
    Wang, Xingwei
    Cheng, Hui
    Li, Keqin
    Li, Jie
    Sun, Jiajia
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (06) : 807 - 822