Fully forward mode training for optical neural networks

被引:9
|
作者
Xue, Zhiwei [1 ,2 ,3 ,4 ]
Zhou, Tiankuang [1 ,2 ,3 ]
Xu, Zhihao [1 ,2 ,3 ,4 ]
Yu, Shaoliang [5 ]
Dai, Qionghai [2 ,3 ,6 ]
Fang, Lu [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Tsinghua Univ, Inst Brain & Cognit Sci, Beijing, Peoples R China
[4] Tsinghua Univ, Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[5] Zhejiang Lab, Res Ctr Intelligent Optoelect Comp, Hangzhou, Peoples R China
[6] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
INVERSE DESIGN; ARTIFICIAL-INTELLIGENCE; BACKPROPAGATION; TIME;
D O I
10.1038/s41586-024-07687-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Optical computing promises to improve the speed and energy efficiency of machine learning applications1-6. However, current approaches to efficiently train these models are limited by in silico emulation on digital computers. Here we develop a method called fully forward mode (FFM) learning, which implements the compute-intensive training process on the physical system. The majority of the machine learning operations are thus efficiently conducted in parallel on site, alleviating numerical modelling constraints. In free-space and integrated photonics, we experimentally demonstrate optical systems with state-of-the-art performances for a given network size. FFM learning shows training the deepest optical neural networks with millions of parameters achieves accuracy equivalent to the ideal model. It supports all-optical focusing through scattering media with a resolution of the diffraction limit; it can also image in parallel the objects hidden outside the direct line of sight at over a kilohertz frame rate and can conduct all-optical processing with light intensity as weak as subphoton per pixel (5.40 x 1018- operations-per-second-per-watt energy efficiency) at room temperature. Furthermore, we prove that FFM learning can automatically search non-Hermitian exceptional points without an analytical model. FFM learning not only facilitates orders-of-magnitude-faster learning processes, but can also advance applied and theoretical fields such as deep neural networks, ultrasensitive perception and topological photonics. We present fully forward mode learning, which conducts machine learning operations on site, leading to faster learning and promoting advancement in numerous fields.
引用
收藏
页码:280 / 286
页数:17
相关论文
共 50 条
  • [41] Scientific Computing with Diffractive Optical Neural Networks
    Chen, Ruiyang
    Tang, Yingheng
    Ma, Jianzhu
    Gao, Weilu
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (12)
  • [42] Role of all-optical neural networks
    Matuszewski, M.
    Prystupiuk, A.
    Opala, A.
    PHYSICAL REVIEW APPLIED, 2024, 21 (01)
  • [43] Lake Level Prediction using Feed Forward and Recurrent Neural Networks
    Hrnjica, Bahrudin
    Bonacci, Ognjen
    WATER RESOURCES MANAGEMENT, 2019, 33 (07) : 2471 - 2484
  • [44] Training Deep Neural Networks with Constrained Learning Parameters
    Date, Prasanna
    Carothers, Christopher D.
    Mitchell, John E.
    Hendler, James A.
    Magdon-Ismail, Malik
    2020 INTERNATIONAL CONFERENCE ON REBOOTING COMPUTING (ICRC 2020), 2020, : 107 - 115
  • [45] A Fully Analog Memristor-Based Neural Network with Online Gradient Training
    Rosenthal, Eyal
    Greshnikov, Sergey
    Soudry, Daniel
    Kvatinsky, Shahar
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1394 - 1397
  • [46] On the Artificial Neural Networks used for the Forward Problem for the Electrical Impedance Equation
    Robles, C. M. A.
    Ponomaryov, V.
    Ramirez T, M. P.
    WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, WCECS 2015, VOL I, 2015, : 86 - 91
  • [47] Multi-way backpropagation networks for training compact deep neural networks
    Guo, Yong
    Chen, Jian
    Du, Qing
    Van Den Hengel, Anton
    Shi, Qinfeng
    Tan, Mingkui
    NEURAL NETWORKS, 2020, 126 : 250 - 261
  • [48] MoS2 as Nonlinear Optical Material for Optical Neural Networks
    Teng, Caihong
    Zou, Jihua
    Tang, Xingyu
    Huang, Yixuan
    He, Weijie
    Du, Wen
    Luo, Lingzhi
    Ren, Aobo
    Wu, Jiang
    Wang, Zhiming
    IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2023, 29 (02)
  • [49] Design of a sliding mode control scheme using neural networks
    Lee, DW
    Cho, JH
    Kim, YT
    APPLICATIONS AND SCIENCE OF ARTIFICIAL NEURAL NETWORKS III, 1997, 3077 : 642 - 651
  • [50] Maximizing Fairness in Deep Neural Networks via Mode Connectivity
    Andreeva, Olga
    Almeida, Matthew
    Ding, Wei
    Crouter, Scott E.
    Chen, Ping
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (03) : 36 - 44