Multichannel meta-imagers for accelerating machine vision

被引:32
作者
Zheng, Hanyu [1 ]
Liu, Quan [2 ]
Kravchenko, Ivan I. [3 ]
Zhang, Xiaomeng [4 ]
Huo, Yuankai [2 ]
Valentine, Jason G. [4 ]
机构
[1] Vanderbilt Univ, Dept Elect & Comp Engn, Nashville, TN USA
[2] Vanderbilt Univ, Dept Comp Sci, Nashville, TN USA
[3] Oak Ridge Natl Lab, Ctr Nanophase Mat Sci, Oak Ridge, TN USA
[4] Vanderbilt Univ, Dept Mech Engn, Nashville, TN 37235 USA
关键词
ARTIFICIAL-INTELLIGENCE;
D O I
10.1038/s41565-023-01557-2
中图分类号
TB3 [工程材料学];
学科分类号
0805 ; 080502 ;
摘要
Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.
引用
收藏
页码:471 / 478
页数:9
相关论文
共 60 条
  • [1] An on-chip photonic deep neural network for image classification
    Ashtiani, Farshid
    Geers, Alexander J.
    Aflatouni, Firooz
    [J]. NATURE, 2022, 606 (7914) : 501 - +
  • [2] Computation at the speed of light: metamaterials for all-optical calculations and neural networks
    Badloe, Trevon
    Lee, Seokho
    Rho, Junsuk
    [J]. ADVANCED PHOTONICS, 2022, 4 (06):
  • [3] A General and Adaptive Robust Loss Function
    Barron, Jonathan T.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4326 - 4334
  • [4] Single-shot optical neural network
    Bernstein, Liane
    Sludds, Alexander
    Panuski, Christopher
    Trajtenberg-Mills, Sivan
    Hamerly, Ryan
    Englund, Dirk
    [J]. SCIENCE ADVANCES, 2023, 9 (25)
  • [5] Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification
    Chang, Julie
    Sitzmann, Vincent
    Dun, Xiong
    Heidrich, Wolfgang
    Wetzstein, Gordon
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [6] Chen YH, 2016, ISSCC DIG TECH PAP I, V59, P262, DOI 10.1109/ISSCC.2016.7418007
  • [7] Optical frontend for a convolutions neural network
    Colburn, Shane
    Chu, Yi
    Shilzerman, Eli
    Majumdar, Arka
    [J]. APPLIED OPTICS, 2019, 58 (12) : 3179 - 3186
  • [8] High-Index Dielectric Metasurfaces Performing Mathematical Operations
    Cordaro, Andrea
    Kwon, Hoyeong
    Sounas, Dimitrios
    Koenderink, A. Femius
    Alu, Andrea
    Polman, Albert
    [J]. NANO LETTERS, 2019, 19 (12) : 8148 - 8423
  • [9] Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
    Ding, Xiaohan
    Zhang, Xiangyu
    Han, Jungong
    Ding, Guiguang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11953 - 11965
  • [10] RepVGG: Making VGG-style ConvNets Great Again
    Ding, Xiaohan
    Zhang, Xiangyu
    Ma, Ningning
    Han, Jungong
    Ding, Guiguang
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737