Adv-Depth: Self-Supervised Monocular Depth Estimation With an Adversarial Loss

被引：15

作者：

Li, Kunhong ^{[1
]}

Fu, Zhiheng ^{[2
]}

Wang, Hanyun ^{[3
]}

Chen, Zonghao ^{[4
]}

Guo, Yulan ^{[1
,5
]}

机构：

[1] Sun Yat Sen Univ SYSU, Sch Elect & Commun Engn, Guangzhou 510275, Peoples R China

[2] Univ Western Australia UWA, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia

[3] Informat Engn Univ, Sch Surveying & Mapping, Zhengzhou 45000, Peoples R China

[4] Alibaba Grp, Hangzhou 310000, Peoples R China

[5] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2021年 / 28卷

基金：

中国国家自然科学基金;

关键词：

Generative adversarial networks; Generators; Estimation; Gallium nitride; Feature extraction; Training; Task analysis; Monocular depth estimation; self-supervised learning; single-image depth prediction;

D O I：

10.1109/LSP.2021.3065203

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Loss function plays a key role in self-supervised monocular depth estimation methods. Current reprojection loss functions are hand-designed and mainly focus on local patch similarity but overlook the global distribution differences between a synthetic image and a target image. In this paper, we leverage global distribution differences by introducing an adversarial loss into the training stage of self-supervised depth estimation. Specifically, we formulate this task as a novel view synthesis problem. We use a depth estimation module and a pose estimation module to form a generator, and then design a discriminator to learn the global distribution differences between real and synthetic images. With the learned global distribution differences, the adversarial loss can be back-propagated to the depth estimation module to improve its performance. Experiments on the KITTI dataset have demonstrated the effectiveness of the adversarial loss. The adversarial loss is further combined with the reprojection loss to achieve the state-of-the-art performance on the KITTI dataset.

引用

页码：638 / 642

页数：5

共 34 条

[1]

Bian J., 2019, NeurIPS, V32, P35

[2]

Casser V, 2019, AAAI CONF ARTIF INTE, P8001

[3]

Chang Shu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12364), P572, DOI 10.1007/978-3-030-58529-7_34

[4] Distortion-Aware Monocular Depth Estimation for Omnidirectional Images [J].

Chen, Hong-Xiang ;

Li, Kunhong ;

Fu, Zhiheng ;

Li, Mengyi ;

Chen, Zonghao ;

Guo, Yulan .

IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) :334-338

[5]

Eigen D, 2014, ADV NEUR IN, V27

[6] Deep Ordinal Regression Network for Monocular Depth Estimation [J].

Fu, Huan ;

Gong, Mingming ;

Wang, Chaohui ;

Batmanghelich, Kayhan ;

Tao, Dacheng .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2002-2011

[7] Vision meets robotics: The KITTI dataset [J].

Geiger, A. ;

Lenz, P. ;

Stiller, C. ;

Urtasun, R. .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2013, 32 (11) :1231-1237

[8] Digging Into Self-Supervised Monocular Depth Estimation [J].

Godard, Clement ;

Mac Aodha, Oisin ;

Firman, Michael ;

Brostow, Gabriel .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3827-3837

[9] Unsupervised Monocular Depth Estimation with Left-Right Consistency [J].

Godard, Clement ;

Mac Aodha, Oisin ;

Brostow, Gabriel J. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6602-6611

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 →