Mixed precision quantization of silicon optical neural network chip

被引:1
|
作者
Zhang, Ye [1 ]
Wang, Ruiting [2 ,3 ]
Zhang, Yejin [2 ,3 ]
Pan, Jiaoqing [2 ,3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Beijing 100192, Peoples R China
[2] Chinese Acad Sci, Key Lab Semicond Mat Sci, Inst Semicond, Beijing 100083, Peoples R China
[3] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelect Engn, Beijing 100049, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
GENETIC ALGORITHM; END;
D O I
10.1016/j.optcom.2024.131231
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In recent years, the field of neural network research has witnessed remarkable advancements in various domains. One of the emerging approaches is the integration of photonic computing, which leverages the unique properties of light for ultra-fast information processing. In this article, we establish a mixed precision quantization model to silicon-based optical neural networks and evaluates their performance on the MNIST and Fashion-MNIST datasets. Through a genetic algorithm- based optimization process, we achieve significant parameter compression while maintaining competitive accuracy. Our findings demonstrate that with an average quantization bitwidth of 4.5 bits on the MNIST dataset, we achieve an impressive 85.94% reduction in parameter size compared to traditional 32-bit networks, with only a marginal accuracy drop of 0.65%. Similarly, on the Fashion-MNIST dataset, we achieve an average quantization bitwidth of 5.67 bits, resulting in an 82.28% reduction in parameter size with a slight accuracy drop of 0.8%.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Compact Design of On-chip Elman Optical Recurrent Neural Network
    Feng, Chenghao
    Zhao, Zheng
    Ying, Zhoufeng
    Gu, Jiaqi
    Pan, David Z.
    Chen, Ray T.
    2020 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2020,
  • [42] Chip-Based High-Dimensional Optical Neural Network
    Xinyu Wang
    Peng Xie
    Bohan Chen
    Xingcai Zhang
    Nano-Micro Letters, 2022, 14
  • [43] Optical bistability on a silicon chip
    Almeida, VR
    Lipson, M
    OPTICS LETTERS, 2004, 29 (20) : 2387 - 2389
  • [44] Structured Dynamic Precision for Deep Neural Networks Quantization
    Huang, Kai
    Li, Bowen
    Xiong, Dongliang
    Jiang, Haitian
    Jiang, Xiaowen
    Yan, Xiaolang
    Claesen, Luc
    Liu, Dehong
    Chen, Junjian
    Liu, Zhili
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2023, 28 (01)
  • [45] A Review of State-of-the-art Mixed-Precision Neural Network Frameworks
    Rakka, Mariam
    Fouda, Mohammed E.
    Khargonekar, Pramod
    Kurdahi, Fadi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 7793 - 7812
  • [46] Adaptive Quantization for Deep Neural Network
    Zhou, Yiren
    Moosavi-Dezfooli, Seyed-Mohsen
    Cheung, Ngai-Man
    Frossard, Pascal
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4596 - 4604
  • [47] Trainable Thresholds for Neural Network Quantization
    Goncharenko, Alexander
    Denisov, Andrey
    Alyamkin, Sergey
    Terentev, Evgeny
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 302 - 312
  • [48] Data-Free Network Compression via Parametric Non-uniform Mixed Precision Quantization
    Chikin, Vladimir
    Antiukh, Mikhail
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 450 - 459
  • [49] Integrated silicon optical switch for high-speed network-on-chip
    Ho Duc Tam Linh
    Nguyen Van Quang
    Dao Duy Tu
    Nguyen Van An
    Vuong Quang Phuoc
    PROCEEDINGS OF 202013TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2020), 2020, : 61 - 64
  • [50] All-optical reconfigurable optical neural network chip based on wavelength division multiplexing
    Liao, Shasha
    Tang, Liang
    Huang, Yixiang
    Liu, Yejun
    Liu, Li
    OPTICS EXPRESS, 2024, 32 (22): : 38160 - 38173