Self-Supervised Underwater Image Generation for Underwater Domain Pre-Training

被引:3
作者
Wu, Zhiheng [1 ,2 ]
Wu, Zhengxing [1 ,2 ]
Chen, Xingyu [3 ]
Lu, Yue [1 ,2 ]
Yu, Junzhi [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Lab Cognit & Decis Intelligence Complex Syst, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Peking Univ, Coll Engn, Dept Adv Mfg & Robot, State Key Lab Turbulence & Complex Syst,BIC ESAT, Beijing 100871, Peoples R China
基金
北京市自然科学基金;
关键词
Object detection; pre-training; self-supervised learning; semantic segmentation; underwater image generation;
D O I
10.1109/TIM.2024.3373105
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapid progress in computer vision has presented new opportunities for enhancing the visual capabilities of underwater robots. However, most deep learning-based visual perception algorithms often underperform due to the scarcity of underwater datasets. To address this issue, we propose an underwater image synthesis method for pre-training in the underwater domain. By leveraging self-supervised learning, we simulate the physical imaging process of underwater scenes, allowing for style transfer from in-air images to underwater images using a reduced amount of underwater data. Furthermore, we propose a pre-training strategy that utilizes synthetic underwater images to enhance underwater visual perception. Finally, abundant experiments are conducted, including quantitative and qualitative comparisons. The results validate the effectiveness and superiority of the proposed underwater image synthesis method, highlighting the substantial improvement in underwater environment perception achieved through the underwater domain pre-training (UDP) strategy.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 47 条
[11]   RUIG: Realistic Underwater Image Generation Towards Restoration [J].
Desai, Chaitra ;
Tabib, Ramesh Ashok ;
Reddy, Sai Sudheer ;
Patil, Ujwala ;
Mudenagudi, Uma .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :2181-2189
[12]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[13]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[14]  
Ge Z, 2021, Arxiv, DOI arXiv:2107.08430
[15]   Digging Into Self-Supervised Monocular Depth Estimation [J].
Godard, Clement ;
Mac Aodha, Oisin ;
Firman, Michael ;
Brostow, Gabriel .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3827-3837
[16]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343
[17]  
Hendrycks D, 2019, PR MACH LEARN RES, V97
[18]   Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation [J].
Inoue, Naoto ;
Furuta, Ryosuke ;
Yamasaki, Toshihiko ;
Aizawa, Kiyoharu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5001-5009
[19]   Semantic Segmentation of Underwater Imagery: Dataset and Benchmark [J].
Islam, Md Jahidul ;
Edge, Chelsey ;
Xiao, Yuyang ;
Luo, Peigen ;
Mehtaz, Muntaqim ;
Morse, Christopher ;
Enan, Sadman Sakib ;
Sattar, Junaed .
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :1769-1776
[20]   COMPUTER MODELING AND THE DESIGN OF OPTIMAL UNDERWATER IMAGING-SYSTEMS [J].
JAFFE, JS .
IEEE JOURNAL OF OCEANIC ENGINEERING, 1990, 15 (02) :101-111