Multichannel meta-imagers for accelerating machine vision

被引：32

作者：

Zheng, Hanyu ^{[1
]}

Liu, Quan ^{[2
]}

Kravchenko, Ivan I. ^{[3
]}

Zhang, Xiaomeng ^{[4
]}

Huo, Yuankai ^{[2
]}

Valentine, Jason G. ^{[4
]}

机构：

[1] Vanderbilt Univ, Dept Elect & Comp Engn, Nashville, TN USA

[2] Vanderbilt Univ, Dept Comp Sci, Nashville, TN USA

[3] Oak Ridge Natl Lab, Ctr Nanophase Mat Sci, Oak Ridge, TN USA

[4] Vanderbilt Univ, Dept Mech Engn, Nashville, TN 37235 USA

来源：

NATURE NANOTECHNOLOGY | 2024年 / 19卷 / 04期

关键词：

ARTIFICIAL-INTELLIGENCE;

D O I：

10.1038/s41565-023-01557-2

中图分类号：

TB3 [工程材料学];

学科分类号：

0805 ; 080502 ;

摘要：

Rapid developments in machine vision technology have impacted a variety of applications, such as medical devices and autonomous driving systems. These achievements, however, typically necessitate digital neural networks with the downside of heavy computational requirements and consequent high energy consumption. As a result, real-time decision-making is hindered when computational resources are not readily accessible. Here we report a meta-imager designed to work together with a digital back end to offload computationally expensive convolution operations into high-speed, low-power optics. In this architecture, metasurfaces enable both angle and polarization multiplexing to create multiple information channels that perform positively and negatively valued convolution operations in a single shot. We use our meta-imager for object classification, achieving 98.6% accuracy in handwritten digits and 88.8% accuracy in fashion images. Owing to its compactness, high speed and low power consumption, our approach could find a wide range of applications in artificial intelligence and machine vision applications. A metasurface-based approach is used to implement computationally expensive digital convolution operations in high-speed, low-power optics for improving the latency and power consumption of machine vision systems.

引用

页码：471 / 478

页数：9

共 60 条

[1] An on-chip photonic deep neural network for image classification
Ashtiani, Farshid
Geers, Alexander J.
Aflatouni, Firooz
[J]. NATURE, 2022, 606 (7914) : 501 - +
[2] Computation at the speed of light: metamaterials for all-optical calculations and neural networks
Badloe, Trevon
Lee, Seokho
Rho, Junsuk
[J]. ADVANCED PHOTONICS, 2022, 4 (06):
[3] A General and Adaptive Robust Loss Function
Barron, Jonathan T.
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4326 - 4334
[4] Single-shot optical neural network
Bernstein, Liane
Sludds, Alexander
Panuski, Christopher
Trajtenberg-Mills, Sivan
Hamerly, Ryan
Englund, Dirk
[J]. SCIENCE ADVANCES, 2023, 9 (25)
[5] Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification
Chang, Julie
Sitzmann, Vincent
Dun, Xiong
Heidrich, Wolfgang
Wetzstein, Gordon
[J]. SCIENTIFIC REPORTS, 2018, 8
[6] Chen YH, 2016, ISSCC DIG TECH PAP I, V59, P262, DOI 10.1109/ISSCC.2016.7418007
[7] Optical frontend for a convolutions neural network
Colburn, Shane
Chu, Yi
Shilzerman, Eli
Majumdar, Arka
[J]. APPLIED OPTICS, 2019, 58 (12) : 3179 - 3186
[8] High-Index Dielectric Metasurfaces Performing Mathematical Operations
Cordaro, Andrea
Kwon, Hoyeong
Sounas, Dimitrios
Koenderink, A. Femius
Alu, Andrea
Polman, Albert
[J]. NANO LETTERS, 2019, 19 (12) : 8148 - 8423
[9] Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Ding, Xiaohan
Zhang, Xiangyu
Han, Jungong
Ding, Guiguang
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11953 - 11965
[10] RepVGG: Making VGG-style ConvNets Great Again
Ding, Xiaohan
Zhang, Xiangyu
Ma, Ningning
Han, Jungong
Ding, Guiguang
Sun, Jian
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737

← 1 2 3 4 5 6 →