A High-Performance and Energy-Efficient Photonic Architecture for Multi-DNN Acceleration

被引:0
|
作者
Li, Yuan [1 ]
Louri, Ahmed [1 ]
Karanth, Avinash [2 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
[2] Ohio Univ, Sch Elect Engn & Comp Sci, Athens, OH 45701 USA
基金
美国国家科学基金会;
关键词
Accelerator; dataflow; deep neural network; silicon photonics;
D O I
10.1109/TPDS.2023.3327535
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale deep neural network (DNN) accelerators are poised to facilitate the concurrent processing of diverse DNNs, imposing demanding challenges on the interconnection fabric. These challenges encompass overcoming performance degradation and energy increase associated with system scaling while also necessitating flexibility to support dynamic partitioning and adaptable organization of compute resources. Nevertheless, conventional metallic-based interconnects frequently confront inherent limitations in scalability and flexibility. In this paper, we leverage silicon photonic interconnects and adopt an algorithm-architecture co-design approach to develop MDA, a DNN accelerator meticulously crafted to empower high-performance and energy-efficient concurrent processing of diverse DNNs. Specifically, MDA consists of three novel components: 1) a resource allocation algorithm that assigns compute resources to concurrent DNNs based on their computational demands and priorities; 2) a dataflow selection algorithm that determines off-chip and on-chip dataflows for each DNN, with the objectives of minimizing off-chip and on-chip memory accesses, respectively; 3) a flexible silicon photonic network that can be dynamically segmented into sub-networks, each interconnecting the assigned compute resources of a certain DNN while adapting to the communication patterns dictated by the selected on-chip dataflow. Simulation results show that the proposed MDA accelerator outperforms other state-of-the-art multi-DNN accelerators, including PREMA, AI-MT, Planaria, and HDA. MDA accelerator achieves a speedup of 3.6, accompanied by substantial improvements of 7.3x, 12.7x, and 9.2x in energy efficiency, service-level agreement (SLA) satisfaction rate, and fairness, respectively.
引用
收藏
页码:46 / 58
页数:13
相关论文
共 50 条
  • [41] High-Performance Fully Integrated Silicon Photonic Microwave Mixer Subsystems
    Bottenfield, Christian G.
    Ralph, Stephen E.
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2020, 38 (19) : 5536 - 5545
  • [42] Advanced Photonic BiCMOS Technology with high-performance Ge Photo Detectors
    Lischke, S.
    Knoll, D.
    Mai, C.
    Zimmermann, L.
    OPTICAL SENSING, IMAGING, AND PHOTON COUNTING: FROM X-RAYS TO THZ 2019, 2019, 11088
  • [43] High-Performance Passive Silicon Photonic Waveguide Devices: Progress and Challenges
    Liu Dajian
    Zhao Weike
    Zhang Long
    Song Lijia
    Guo Jingshu
    Xie Yiwei
    Li Huan
    Yu Zejie
    Liu Liu
    Shi Yaocheng
    Dai Daoxin
    ACTA OPTICA SINICA, 2022, 42 (17)
  • [44] TALIPOT: Energy-Efficient DNN Booster Employing Hybrid Bit Parallel-Serial Processing in MSB-First Fashion
    Karadeniz, Mahmut Burak
    Altun, Mustafa
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (08) : 2714 - 2727
  • [45] Energy-Efficient Architecture for FPGA-based Deep Convolutional Neural Networks with Binary Weights
    Duan, Yunzhi
    Li, Shuai
    Zhang, Ruipeng
    Wang, Qi
    Chen, Jienan
    Sobelman, Gerald E.
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [46] Enabling High-Performance DNN Inference Accelerators using Non-Volatile Analog Memory (Invited)
    Chen, An
    Ambrogio, Stefano
    Narayanan, Pritish
    Tsai, Hsinyu
    Mackin, Charles
    2020 IEEE ELECTRON DEVICES TECHNOLOGY AND MANUFACTURING CONFERENCE (EDTM 2020), 2020,
  • [47] TM-Pass Polarizer for Ultradense High-Performance Photonic Integrated Circuits
    Dhingra, Nikhil
    Dell'Olio, Francesco
    2020 IEEE PHOTONICS CONFERENCE (IPC), 2020,
  • [48] Ultralow-power silicon photonic interconnect for high-performance computing systems
    Li, Guoliang
    Zheng, Xuezhe
    Lexau, Jon
    Luo, Ying
    Thacker, Hiren
    Pinguet, Thierry
    Dong, Po
    Feng, Dazeng
    Liao, Shirong
    Shafiiha, Roshanak
    Asghari, Mehdi
    Yao, Jin
    Shi, Jing
    Shubin, Ivan N.
    Patil, Dinesh
    Liu, Frankie
    Raj, Kannan
    Ho, Ron
    Cunningham, John E.
    Krishnamoorthy, Ashok V.
    OPTOELECTRONIC INTERCONNECTS AND COMPONENT INTEGRATION IX, 2010, 7607
  • [49] High-Performance Winograd Based Accelerator Architecture for Convolutional Neural Network
    Vardhana, M.
    Pinto, Rohan
    IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 21 - 24
  • [50] Two-level reconfigurable architecture for high-performance signal processing
    Johnsson, D
    Bengtsson, J
    Svensson, B
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 177 - 183