Reconfigurable Activation Functions in Integrated Optical Neural Networks

被引:17
作者
Rausell Campo, Jose Roberto [1 ]
Perez-Lopez, Daniel [1 ,2 ]
机构
[1] Univ Politecn Valencia, Inst Telecommun & Multimedia Applicat ITEAM, Valencia 46022, Spain
[2] Univ Politecn Valencia, iPron Programmable Photon SL, Ed 9B, Valencia 46022, Spain
基金
欧盟地平线“2020”;
关键词
Optical interferometry; Nonlinear optics; Optical imaging; Optical modulation; Optical signal processing; Optical bistability; Adaptive optics; Complex-valued neural networks; electro-optic modulation; machine learning; nonlinear optics; optical activation functions; optical neural networks;
D O I
10.1109/JSTQE.2022.3169833
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The implementation of nonlinear activation functions is one of the key challenges that optical neural networks face. To the date, different approaches have been proposed, including switching to digital implementations, electro-optical or all optical. In this article, we compare the response of different electro-optic architectures where part of the input optical signal is converted into the electrical domain and used to self-phase modulate the intensity of the remaining optical signal. These architectures are made up of Mach Zehnder Interferometers (MZI) and microring resonators (MRR). We have compared the corresponding transfer functions with commonly used activation functions in state-of-the-art machine learning models and carried out an in-depth analysis of the capabilities of those architectures to generate the proposed activation functions. We demonstrate that a ring assisted MZI and a two-ring assisted MZI present the highest expressivity among the proposed structures. To the best of our knowledge, this is the first time that a quantified analysis of the capabilities of optical devices to mimic state-of-the-art activation functions is presented. The obtained activation functions are benchmarked on two machine learning examples: classification task using the Iris dataset, and image recognition using the MNIST dataset. We use complex-valued feed-forward neural networks and get test accuracies of 97% and 95% respectively.
引用
收藏
页数:13
相关论文
共 61 条
[1]  
[Anonymous], 2015, Tensor Flow: Largescale Machine Learning on Heterogeneous Systems Software
[2]  
[Anonymous], 2005, Proceedings of International Joint Conference on Neural Networks
[3]   Silicon-Germanium Avalanche Receivers With fJ/bit Energy Consumption [J].
Benedikovic, Daniel ;
Virot, Leopold ;
Aubin, Guy ;
Hartmann, Jean-Michel ;
Amar, Farah ;
Le Roux, Xavier ;
Alonso-Ramos, Carlos ;
Cassan, Eric ;
Marris-Morini, Delphine ;
Boeuf, Frederic ;
Fedeli, Jean-Marc ;
Szelag, Bertrand ;
Vivien, Laurent .
IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2022, 28 (02)
[4]  
Boeck R, 2011, Silicon Ring Resonator Add-Drop Multiplexers
[5]   Silicon microring resonators [J].
Bogaerts, Wim ;
De Heyn, Peter ;
Van Vaerenbergh, Thomas ;
De Vos, Katrien ;
Selvaraja, Shankar Kumar ;
Claes, Tom ;
Dumon, Pieter ;
Bienstman, Peter ;
Van Thourhout, Dries ;
Baets, Roel .
LASER & PHOTONICS REVIEWS, 2012, 6 (01) :47-73
[6]   Challenges in the Path Toward a Scalable Silicon Photonics Implementation of Deep Neural Networks [J].
Burgos, Carlos Mauricio Villegas ;
Vamivakas, Nickolas .
IEEE JOURNAL OF QUANTUM ELECTRONICS, 2019, 55 (05)
[7]  
Cabanillas J. M. Fargas, PROC OSA ADV PHOTON, V2021
[8]  
Clevert D.-A., 2016, C TRACK P, P1
[9]   Adaptive sigmoid-like and PReLU activation functions for all-optical perceptron [J].
Crnjanski, Jasna ;
Krstic, Marko ;
Totovic, Angelina ;
Pleros, Nikos ;
Gvozdic, Dejan .
OPTICS LETTERS, 2021, 46 (09) :2003-2006
[10]   Photonic Integrated Reconfigurable Linear Processors as Neural Network Accelerators [J].
De Marinis, Lorenzo ;
Cococcioni, Marco ;
Liboiron-Ladouceur, Odile ;
Contestabile, Giampiero ;
Castoldi, Piero ;
Andriolli, Nicola .
APPLIED SCIENCES-BASEL, 2021, 11 (13)