A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration

被引:0
|
作者
Li, Yuan [1 ]
Louri, Ahmed [1 ]
Karanth, Avinash [2 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
[2] Ohio Univ, Sch Elect Engn & Comp Sci, Athens, OH 45701 USA
基金
美国国家科学基金会;
关键词
Accelerator; dataflow; deep neural network; silicon photonics;
D O I
10.1109/TPDS.2023.3327535
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale deep neural network (DNN) accelerators are poised to facilitate the concurrent processing of diverse DNNs, imposing demanding challenges on the interconnection fabric. These challenges encompass overcoming performance degradation and energy increase associated with system scaling while also necessitating flexibility to support dynamic partitioning and adaptable organization of compute resources. Nevertheless, conventional metallic-based interconnects frequently confront inherent limitations in scalability and flexibility. In this paper, we leverage silicon photonic interconnects and adopt an algorithm-architecture co-design approach to develop MDA, a DNN accelerator meticulously crafted to empower high-performance and energy-efficient concurrent processing of diverse DNNs. Specifically, MDA consists of three novel components: 1) a resource allocation algorithm that assigns compute resources to concurrent DNNs based on their computational demands and priorities; 2) a dataflow selection algorithm that determines off-chip and on-chip dataflows for each DNN, with the objectives of minimizing off-chip and on-chip memory accesses, respectively; 3) a flexible silicon photonic network that can be dynamically segmented into sub-networks, each interconnecting the assigned compute resources of a certain DNN while adapting to the communication patterns dictated by the selected on-chip dataflow. Simulation results show that the proposed MDA accelerator outperforms other state-of-the-art multi-DNN accelerators, including PREMA, AI-MT, Planaria, and HDA. MDA accelerator achieves a speedup of 3.6, accompanied by substantial improvements of 7.3x, 12.7x, and 9.2x in energy efficiency, service-level agreement (SLA) satisfaction rate, and fairness, respectively.
引用
收藏
页码:46 / 58
页数:13
相关论文
共 50 条
  • [31] Survey of Novel Architectures for Energy Efficient High-Performance Mobile Computing Platforms
    O'Connor, Owen
    Elfouly, Tarek
    Alouani, Ali
    ENERGIES, 2023, 16 (16)
  • [32] High-performance Reconfigurable DNN Accelerator on a Bandwidth-limited Embedded System
    Hu, Xianghong
    Huang, Hongmin
    Li, Xueming
    Zheng, Xin
    Ren, Qinyuan
    He, Jingyu
    Xiong, Xiaoming
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (06)
  • [33] Two-layer integrated photonic architectures with multiport photodetectors for high-fidelity and energy-efficient matrix multiplications
    Tang, Rui
    Okano, Makoto
    Toprasertpong, Kasidit
    Takagi, Shinichi
    Englund, Dirk
    Takenaka, Mitsuru
    OPTICS EXPRESS, 2022, 30 (19) : 33940 - 33954
  • [34] An energy-efficient coarse grained spatial architecture for convolutional neural networks AlexNet
    Zhao, Boya
    Wang, Mingjiang
    Liu, Ming
    IEICE ELECTRONICS EXPRESS, 2017, 14 (15):
  • [35] A Heterogeneous and Reconfigurable Embedded Architecture for Energy-Efficient Execution of Convolutional Neural Networks
    Luebeck, Konstantin
    Bringmann, Oliver
    ARCHITECTURE OF COMPUTING SYSTEMS - ARCS 2019, 2019, 11479 : 267 - 280
  • [36] Hybrid and Heterogeneous Photonic Integrated Circuits for High-Performance Applications
    Heck, Martijn J. R.
    INTEGRATED OPTICS: DEVICES, MATERIALS, AND TECHNOLOGIES XIX, 2015, 9365
  • [37] HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration
    Dhingra, Pratyush
    Doppa, Janardhan Rao
    Pande, Partha Pratim
    PROCEEDINGS OF THE 29TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED 2024, 2024,
  • [38] Large-Scale Integrated Photonic Device Platform for Energy-Efficient AI/ML Accelerators
    Tossoun, Bassem
    Xiao, Xian
    Cheung, Stanley
    Yuan, Yuan
    Peng, Yiwei
    Srinivasan, Sudharsanan
    Giamougiannis, George
    Huang, Zhihong
    Singaraju, Prerana
    London, Yanir
    Hejda, Matej
    Sundararajan, Sri Priya
    Hu, Yingtao
    Gong, Zheng
    Baek, Jongseo
    Descos, Antoine
    Kapusta, Morten
    Bohm, Fabian
    Van Vaerenbergh, Thomas
    Fiorentino, Marco
    Kurczveil, Geza
    Liang, Di
    Beausoleil, Raymond G.
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2025, 31 (03)
  • [39] Silicon Photonic 2.5D Multi-Chip Module Transceiver for High-Performance Data Centers
    Abrams, Nathan C.
    Cheng, Qixiang
    Glick, Madeleine
    Jezzini, Moises
    Morrissey, Padraic
    O'Brien, Peter
    Bergman, Keren
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2020, 38 (13) : 3346 - 3357
  • [40] Energy-Efficient Photonics in Future High-Connectivity Computing Systems
    Krishnamoorthy, A. V.
    Schwetman, H.
    Zheng, X.
    Ho, R.
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2015, 33 (04) : 889 - 900