UnitModule: A lightweight joint image enhancement module for underwater object detection

被引：23

作者：

Liu, Zhuoyan ^{[1
]}

Wang, Bo ^{[1
]}

Li, Ye ^{[1
]}

He, Jiaxian ^{[1
]}

Li, Yunfeng ^{[1
]}

机构：

[1] Harbin Engn Univ, Natl Key Lab Autonomous Marine Vehicle Technol, Harbin 150001, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 151卷

基金：

中国国家自然科学基金;

关键词：

Underwater object detection; Image enhancement; Unsupervised learning; Joint training;

D O I：

10.1016/j.patcog.2024.110435

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Underwater object detection faces the problem of underwater image degradation, which affects the performance of the detector. Underwater object detection methods based on noise reduction and image enhancement usually do not provide images preferred by the detector or require additional datasets. In this paper, we propose a plug-and-play U nderwater joint n t i mage enhancement t Module (UnitModule) that provides the input image preferred by the detector. We design an unsupervised learning loss for the joint training of UnitModule with the detector without additional datasets to improve the interaction between UnitModule and the detector. Furthermore, a color cast predictor with the assisting color cast loss and a data augmentation called Underwater Color Random Transfer (UCRT) are designed to improve the performance of UnitModule on underwater images with different color casts. Extensive experiments are conducted on DUO for different object detection models, where UnitModule achieves the highest performance improvement of 2.6 AP for YOLOv5-S and gains the improvement of 3.3 AP on the brand-new test set (URPCtest). t est ). And UnitModule significantly improves the performance of all object detection models we test, especially for models with a small number of parameters. In addition, UnitModule with a small number of parameters of 31K has little effect on the inference speed of the original object detection model. Our quantitative and visual analysis also demonstrates the effectiveness of UnitModule in enhancing the input image and improving the perception ability of the detector for object features. The code is available at https://github.com/LEFTeyex/UnitModule.

引用

页数：11

共 47 条

[11] Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Han, Jungong ;

Ding, Guiguang .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11953-11965

[12]

Ding XY, 2017, OCEANS-IEEE

[13] Transmission Estimation in Underwater Single Images [J].

Drews-, P., Jr. ;

do Nascimento, E. ;

Moraes, F. ;

Botelho, S. ;

Campos, M. .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :825-830

[14] Dual Refinement Underwater Object Detection Network [J].

Fan, Baojie ;

Chen, Wei ;

Cong, Yang ;

Tian, Jiandong .

COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :275-291

[15] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

[16]

Ge Z, 2021, Arxiv, DOI arXiv:2107.08430

[17] Underwater Image Processing and Object Detection Based on Deep CNN Method [J].

Han, Fenglei ;

Yao, Jingzheng ;

Zhu, Haitao ;

Wang, Chunhui .

JOURNAL OF SENSORS, 2020, 2020

[18] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[19] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) :1904-1916

[20] Underwater object detection algorithm based on feature enhancement and progressive dynamic aggregation strategy [J].

Hua, Xia ;

Cui, Xiaopeng ;

Xu, Xinghua ;

Qiu, Shaohua ;

Liang, Yingjie ;

Bao, Xianqiang ;

Li, Zhong .

PATTERN RECOGNITION, 2023, 139

← 1 2 3 4 5 →