HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection

被引:23
作者
Chen, Gangqi [1 ]
Mao, Zhaoyong [2 ]
Wang, Kai [3 ]
Shen, Junge [2 ]
机构
[1] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
[3] Henan Key Lab Underwater Intelligent Equipment, Zhengzhou 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; underwater object detection; transformer; lightweight; feeble and small object;
D O I
10.3390/rs15041076
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
As marine observation technology develops rapidly, underwater optical image object detection is beginning to occupy an important role in many tasks, such as naval coastal defense tasks, aquaculture, etc. However, in the complex marine environment, the images captured by an optical imaging system are usually severely degraded. Therefore, how to detect objects accurately and quickly under such conditions is a critical problem that needs to be solved. In this manuscript, a novel framework for underwater object detection based on a hybrid transformer network is proposed. First, a lightweight hybrid transformer-based network is presented that can extract global contextual information. Second, a fine-grained feature pyramid network is used to overcome the issues of feeble signal disappearance. Third, the test-time-augmentation method is applied for inference without introducing additional parameters. Extensive experiments have shown that the approach we have proposed is able to detect feeble and small objects in an efficient and effective way. Furthermore, our model significantly outperforms the latest advanced detectors with respect to both the number of parameters and the mAP by a considerable margin. Specifically, our detector outperforms the baseline model by 6.3 points, and the model parameters are reduced by 28.5 M.
引用
收藏
页数:22
相关论文
共 61 条
[1]   Generation and Processing of Simulated Underwater Images for Infrastructure Visual Inspection with UUVs [J].
Alvarez-Tunon, Olaya ;
Jardon, Alberto ;
Balaguer, Carlos .
SENSORS, 2019, 19 (24)
[2]   Diving deeper into underwater image enhancement: A survey [J].
Anwar, Saeed ;
Li, Chongyi .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 89
[3]  
Bochkovskiy A., 2020, YOLOv4: Optimal Speed and Accuracy of Object Detection
[4]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[5]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[6]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[7]   You Only Look One-level Feature [J].
Chen, Qiang ;
Wang, Yingming ;
Yang, Tong ;
Zhang, Xiangyu ;
Cheng, Jian ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13034-13043
[8]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[9]  
Cong Tan, 2021, 2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), P0127, DOI 10.1109/IEMCON53756.2021.9623066
[10]  
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021