Real-Valued Optical Matrix Computing with Simplified MZI Mesh

被引:6
|
作者
Wu, Bo [1 ]
Liu, Shaojie [1 ]
Cheng, Junwei [1 ]
Dong, Wenchan [1 ]
Zhou, Hailong [1 ]
Dong, Jianji [1 ,2 ]
Li, Ming [3 ]
Zhang, Xinliang [1 ,2 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China
[2] Opt Valley Lab, Wuhan 430074, Peoples R China
[3] Chinese Acad Sci, Inst Semicond, State Key Lab Integrated Optoelect, Beijing 100083, Peoples R China
来源
INTELLIGENT COMPUTING | 2023年 / 2卷
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORKS; DESIGN;
D O I
10.34133/icomputing.0047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Mach-Zehnder interferometer (MZI) mesh, a mainstream structure for optical matrix-vector multiplication (MVM), has been widely employed in recently developed optical neural networks (ONNs) and combination optimization problem solvers. The conventional MZI mesh was designed specifically for complex-valued optical MVM. The network includes 2N2 phase shifters, and coherent detection is indispensable for retrieving the output complex-valued vectors. Nonetheless, the majority of applications, including ONNs, merely require real-valued optical matrices with N2 degrees of freedom (DOFs). The DOF gap between the 2 types of matrices results in a severe redundancy in the number of phase shifters when the conventional MZI mesh is applied to implement real-valued optical MVM. In this study, we propose a simplified MZI mesh for performing real-valued incoherent optical MVM. It has N2 phase shifters and an optical depth of N + 1, and it outperforms the conventional MZI mesh. Furthermore, we constructed an ONN with the proposed MZI mesh and successfully performed the iris classification task via in situ training of particle swarm optimization. More importantly, we introduced a matched on-chip nonlinear activation function, so the proposed MZI mesh can be cascaded onto a single chip. Overall, the proposed real-valued MZI mesh and in situ training method are space efficient, energy efficient, scalable, and robust to fabrication errors. Therefore, they are suitable for large-scale ONNs.
引用
收藏
页数:8
相关论文
共 50 条