Towards Real-Time Advancement of Underwater Visual Quality With GAN

被引:113
作者
Chen, Xingyu [1 ,2 ]
Yu, Junzhi [1 ,2 ,3 ]
Kong, Shihan [1 ,2 ]
Wu, Zhengxing [1 ,2 ]
Fang, Xi [4 ]
Wen, Li [4 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Peking Univ, Beijing Innovat Ctr Engn Sci & Adv Technol, Beijing 100871, Peoples R China
[4] Beihang Univ, Sch Mech Engn & Automat, Beijing 100191, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Generative adversarial networks (GAN) image restoration; machine learning; underwater vision; IMAGE-ENHANCEMENT;
D O I
10.1109/TIE.2019.2893840
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Low visual quality has prevented underwater robotic vision from a wide range of applications. Although several algorithms have been developed, real time and adaptive methods are deficient for real-world tasks. In this paper, we address this difficulty based on generative adversarial networks (GAN), and propose a GAN-based restoration scheme (GAN-RS). In particular, we develop a multibranch discriminator including an adversarial branch and a critic branch for the purpose of simultaneously preserving image content and removing underwater noise. In addition to adversarial learning, a novel dark channel prior loss also promotes the generator to produce realistic vision. More specifically, an underwater index is investigated to describe underwater properties, and a loss function based on the underwater index is designed to train the critic branch for underwater noise suppression. Through extensive comparisons on visual quality and feature restoration, we confirm the superiority of the proposed approach. Consequently, the GAN-RS can adaptively improve underwater visual quality in real time and induce an overall superior restoration performance. Finally, a real-world experiment is conducted on the seabed for grasping marine products, and the results are quite promising. The source code is publicly available(1).
引用
收藏
页码:9350 / 9359
页数:10
相关论文
共 35 条
[1]  
Ancuti C, 2012, PROC CVPR IEEE, P81, DOI 10.1109/CVPR.2012.6247661
[2]  
[Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.632
[3]  
[Anonymous], 2017, P ASM 36 INT C OC P ASM 36 INT C OC
[4]  
[Anonymous], 2015, P BRIT MACH VIS C
[5]  
[Anonymous], 2014, ARXIV PREPRINT ARXIV
[7]  
Che XY, 2017, IEEE ANN INT CONF CY, P271, DOI 10.1109/CYBER.2017.8446363
[8]   Temporally Identity-Aware SSD With Attentional LSTM [J].
Chen, Xingyu ;
Yu, Junzhi ;
Wu, Zhengxing .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) :2674-2686
[9]   Underwater Image Enhancement by Wavelength Compensation and Dehazing [J].
Chiang, John Y. ;
Chen, Ying-Ching .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) :1756-1769
[10]   Tracking Live Fish From Low-Contrast and Low-Frame-Rate Stereo Videos [J].
Chuang, Meng-Che ;
Hwang, Jenq-Neng ;
Williams, Kresimir ;
Towler, Richard .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (01) :167-179