Unsupervised underwater image enhancement via content-style representation disentanglement

被引:14
作者
Zhu, Pengli [1 ,2 ]
Liu, Yancheng [1 ]
Wen, Yuanquan [1 ]
Xu, Minyi [1 ]
Fu, Xianping [3 ]
Liu, Siyuan [1 ]
机构
[1] Dalian Maritime Univ, Coll Marine Engn, Dalian, Peoples R China
[2] Natl Univ Singapore, Coll Design & Engn, Singapore, Singapore
[3] Dalian Maritime Univ, Coll Informat Sci & Technol, Dalian, Peoples R China
关键词
Underwater image enhancement; Representation disentanglement; Unsupervised learning; Cycle-consistent adversarial translation; NEURAL-NETWORK; TRANSLATION;
D O I
10.1016/j.engappai.2023.106866
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The absorption and scattering properties of the water medium cause various types of distortion in underwater images, which seriously affects the accuracy and effectiveness of subsequent processing. The application of supervised learning algorithms in underwater image enhancement is limited by the difficulty of obtaining a large number of underwater paired images in practical applications. As a solution, we propose an unsupervised representation disentanglement based underwater image enhancement method (URD-UIE). URD-UIE disentangles content information (e.g., texture, semantics) and style information (e.g., chromatic aberration, blur, noise, and clarity) from underwater images and then employs the disentangled information to generate the target distortion-free image. Our proposed method URD-UIE adopts an unsupervised cycle-consistent adversarial translation architecture and combines multiple loss functions to impose specific constraints on the output results of each module to ensure the structural consistency of underwater images before and after enhancement. The experimental results demonstrate that the URD-UIE technique effectively enhances the quality of underwater images when training with unpaired data, resulting in a significant improvement in the performance of the standard model for underwater object detection and semantic segmentation.
引用
收藏
页数:13
相关论文
共 76 条
[51]   UMGAN: Underwater Image Enhancement Network for Unpaired Image-to-Image Translation [J].
Sun, Boyang ;
Mei, Yupeng ;
Yan, Ni ;
Chen, Yingyi .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (02)
[52]   Deep pixel-to-pixel network for underwater image enhancement and restoration [J].
Sun, Xin ;
Liu, Lipeng ;
Li, Qiong ;
Dong, Junyu ;
Lima, Estanislau ;
Yin, Ruiying .
IET IMAGE PROCESSING, 2019, 13 (03) :469-474
[53]   Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis [J].
Ulyanov, Dmitry ;
Vedaldi, Andrea ;
Lempitsky, Victor .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4105-4113
[54]   Underwater self-supervised monocular depth estimation and its application in image enhancement [J].
Wang, Junting ;
Ye, Xiufen ;
Liu, Yusong ;
Mei, Xinkui ;
Hou, Jun .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[55]   Deep convolutional cross-connected kernel mapping support vector machine based on SelectDropout [J].
Wang, Qi ;
Liu, Zhaoying ;
Zhang, Ting ;
Alasmary, Hisham ;
Waqas, Muhammad ;
Halim, Zahid ;
Li, Yujian .
INFORMATION SCIENCES, 2023, 626 :694-709
[56]   Adaptive feature fusion for time series classification [J].
Wang, Tian ;
Liu, Zhaoying ;
Zhang, Ting ;
Hussain, Syed Fawad ;
Waqas, Muhammad ;
Li, Yujian .
KNOWLEDGE-BASED SYSTEMS, 2022, 243
[57]   High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs [J].
Wang, Ting-Chun ;
Liu, Ming-Yu ;
Zhu, Jun-Yan ;
Tao, Andrew ;
Kautz, Jan ;
Catanzaro, Bryan .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8798-8807
[58]  
Yan SZ, 2023, Arxiv, DOI arXiv:2107.02660
[59]   An Underwater Color Image Quality Evaluation Metric [J].
Yang, Miao ;
Sowmya, Arcot .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :6062-6071
[60]   Underwater Image Enhancement Using Stacked Generative Adversarial Networks [J].
Ye, Xinchen ;
Xu, Hongcan ;
Ji, Xiang ;
Xu, Rui .
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 :514-524