Neuromorphic Silicon Photonics and Hardware-Aware Deep Learning for High-Speed Inference

被引:48
作者
Moralis-Pegios, Miltiadis [1 ,2 ]
Mourgias-Alexandris, George [1 ,2 ]
Tsakyridis, Apostolos [1 ,2 ]
Giamougiannis, George [1 ,2 ]
Totovic, Angelina [1 ,2 ]
Dabos, George [1 ,2 ]
Passalis, Nikolaos [1 ,2 ]
Kirtas, Manos [1 ,2 ]
Rutirawut, T. [3 ]
Gardes, F. Y. [3 ]
Tefas, Anastasios [1 ,2 ]
Pleros, Nikos [1 ,2 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 57001, Greece
[2] Aristotle Univ Thessaloniki, Ctr Interdisciplinary Res & Innovat, Thessaloniki 57001, Greece
[3] Univ Southampton, Optoelect Res Ctr, Southampton SO17 1BJ, Hants, England
基金
欧盟地平线“2020”;
关键词
Photonics; Adaptive optics; Neuromorphics; Optical modulation; Layout; Computer architecture; High-speed optical techniques; Neural networks; neuromorphic computing; neuromorphic photonics; optical neural network accelerators; TRANSMITTER; NEURON; CHIP;
D O I
10.1109/JLT.2022.3171831
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The relentless growth of Artificial Intelligence (AI) workloads has fueled the drive towards non-Von Neuman architectures and custom computing hardware. Neuromorphic photonic engines aspire to synergize the low-power and high-bandwidth credentials of light-based deployments with novel architectures, towards surpassing the computing performance of their electronic counterparts. In this paper, we review recent progress in integrated photonic neuromorphic architectures and analyze the architectural and photonic hardware-based factors that limit their performance. Subsequently, we present our approach towards transforming silicon coherent neuromorphic layouts into high-speed and high-accuracy Deep Learning (DL) engines by combining robust architectures with hardware-aware DL training. Circuit robustness is ensured through a crossbar layout that circumvents insertion loss and fidelity constraints of state-of-the-art linear optical designs. Concurrently, we employ DL training models adapted to the underlying photonic hardware, incorporating noise- and bandwidth-limitations together with the supported activation function directly into Neural Network (NN) training. We validate experimentally the high-speed and high-accuracy advantages of hardware-aware DL models when combined with robust architectures through a SiPho prototype implementing a single column of a 4:4 photonic crossbar. This was utilized as the pen-ultimate hidden layer of a NN, revealing up to 5.93% accuracy improvement at 5GMAC/sec/axon when noise-aware training is enforced and allowing accuracies of 99.15% and 79.8% for the MNIST and CIFAR-10 classification tasks. Channel-aware training was then demonstrated by integrating the frequency response of the photonic hardware in NN training, with its experimental validation with the MNIST dataset revealing an accuracy increase of 12.93% at a record-high rate of 25GMAC/sec/axon.
引用
收藏
页码:3243 / 3254
页数:12
相关论文
共 49 条
[1]  
Alexandris G. M, 2020, IEEE J SEL TOP QUANT, V26, P1, DOI DOI 10.1109/JSTQE.2020.2995830
[2]   Optics in Computing: From Photonic Network-on-Chip to Chip-to-Chip Interconnects and Disintegrated Architectures [J].
Alexoudi, Theonitsa ;
Terzenidis, Nikolaos ;
Pitris, Stelios ;
Moralis-Pegios, Miltiadis ;
Maniotis, Pavlos ;
Vagionas, Christos ;
Mitsolidou, Charoula ;
Mourgias-Alexandris, George ;
Kanellos, George T. ;
Miliou, Amalia ;
Vyrsokinos, Konstantinos ;
Pleros, Nikos .
JOURNAL OF LIGHTWAVE TECHNOLOGY, 2019, 37 (02) :363-379
[3]  
[Anonymous], 2011, SLAA510 TEX INSTR
[4]   Ensemble deep learning in bioinformatics [J].
Cao, Yue ;
Geddes, Thomas Andrew ;
Yang, Jean Yee Hwa ;
Yang, Pengyi .
NATURE MACHINE INTELLIGENCE, 2020, 2 (09) :500-508
[5]   Optimal design for universal multiport interferometers [J].
Clements, William R. ;
Humphreys, Peter C. ;
Metcalf, Benjamin J. ;
Kolthammer, W. Steven ;
Walmsley, Ian A. .
OPTICA, 2016, 3 (12) :1460-1465
[6]   Adaptive sigmoid-like and PReLU activation functions for all-optical perceptron [J].
Crnjanski, Jasna ;
Krstic, Marko ;
Totovic, Angelina ;
Pleros, Nikos ;
Gvozdic, Dejan .
OPTICS LETTERS, 2021, 46 (09) :2003-2006
[7]  
Dabos G., 2021, PROC SPIE11689
[8]  
de Lima T. F, 2020, PROC OPT FIBER COMMU, pM2K
[9]   Parallel convolutional processing using an integrated photonic tensor core [J].
Feldmann, J. ;
Youngblood, N. ;
Karpov, M. ;
Gehring, H. ;
Li, X. ;
Stappers, M. ;
Le Gallo, M. ;
Fu, X. ;
Lukashchuk, A. ;
Raja, A. S. ;
Liu, J. ;
Wright, C. D. ;
Sebastian, A. ;
Kippenberg, T. J. ;
Pernice, W. H. P. ;
Bhaskaran, H. .
NATURE, 2021, 589 (7840) :52-+
[10]  
Giamougiannis G., ECOC 2021