Probing shallower: perceptual loss trained Phase Extraction Neural Network (PLT-PhENN) for artifact-free reconstruction at low photon budget

被引：17

作者：

Deng, Mo ^{[1
]}

Goy, Alexandre ^{[2
,4
]}

Li, Shuai ^{[2
,5
]}

Arthur, Kwabena ^{[2
]}

Barbastathis, George ^{[2
,3
]}

机构：

[1] MIT, Dept Elect Engn & Comp Sci, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] MIT, Dept Mech Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[3] Singapore MIT Alliance Res & Technol Smart Ctr, Singapore 117543, Singapore

[4] Omnisens SA, Riond Bosson 3, CH-1110 Morges, VD, Switzerland

[5] Sensebrain Technol Ltd LLC, 2550 N 1st St,Suite 300, San Jose, CA 95131 USA

来源：

OPTICS EXPRESS | 2020年 / 28卷 / 02期

关键词：

IMAGE; RETRIEVAL; RESOLUTION; ALGORITHM;

D O I：

10.1364/OE.381301

中图分类号：

O43 [光学];

学科分类号：

070207 ; 0803 ;

摘要：

Deep neural networks (DNNs) are efficient solvers for ill-posed problems and have been shown to outperform classical optimization techniques in several computational imaging problems. In supervised mode, DNNs are trained by minimizing a measure of the difference between their actual output and their desired output; the choice of measure, referred to as "loss function," severely impacts performance and generalization ability. In a recent paper [A. Goy et al., Phys. Rev. Lett. 121(24), 243902 (2018)], we showed that DNNs trained with the negative Pearson correlation coefficient (NPCC) as the loss function are particularly fit for photon-starved phase-retrieval problems, though the reconstructions are manifestly deficient at high spatial frequencies. In this paper, we show that reconstructions by DNNs trained with default feature loss (defined at VGG layer ReLU-22) contain more fine details; however, grid-like artifacts appear and are enhanced as photon counts become very low. Two additional key findings related to these artifacts are presented here. First, the frequency signature of the artifacts depends on the VGG's inner layer that perceptual loss is defined upon, halving with each MaxPooling2D layer deeper in the VGG. Second, VGG ReLU-12 outperforms all other layers as the defining layer for the perceptual loss. (C) 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement

引用

页码：2511 / 2535

页数：25

共 60 条

[1]

[Anonymous], ARXIV170101687V1

[2]

[Anonymous], 2014, Comput. Sci.

[3]

[Anonymous], ABS180503593 CORR

[4]

[Anonymous], 2015, Tensor Flow: Largescale Machine Learning on Heterogeneous Systems Software

[5]

[Anonymous], 2019, P IEEE INT C COMP VI

[6]

[Anonymous], 2017, P IEEE C COMP VIS PA

[7]

[Anonymous], 2014, Neural Inform. Process. Syst

[8]

[Anonymous], ARXIV190711713

[9]

[Anonymous], 2006, 2006 IEEE COMP SOC C, DOI DOI 10.1109/CVPR.2006.142

[10]

Arjovsky M, 2017, PR MACH LEARN RES, V70

← 1 2 3 4 5 6 →