Deep learning based efficient ship detection from drone-captured images for maritime surveillance

被引:37
作者
Cheng, Shuxiao [1 ]
Zhu, Yishuang [1 ,2 ]
Wu, Shaohua [1 ,2 ]
机构
[1] Harbin Inst Technol Shenzhen, Guangdong Prov Key Lab Aerosp Commun & Networking, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Dept Broadband Commun, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Ship detection; Drone-captured images; Maritime surveillance; Convolutional Neural Network (CNN); YOLOv5;
D O I
10.1016/j.oceaneng.2023.115440
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
The use of drones to observe ships is an effective means of maritime surveillance. However, the object scale from drone-captured images changes dramatically, presenting a significant challenge for ship detection. Additionally, the limited computing resources on drones make it difficult to achieve fast detection speed. To address these issues, we propose an efficient deep learning based network, namely the YOLOv5-ODConvNeXt, for ship detection from drone-captured images. YOLOv5-ODConvNeXt is a more accurate and faster network designed to improve the efficiency of maritime surveillance. Based on YOLOv5, we implement Omni-dimensional Convolution (ODConv) in the YOLOv5 backbone to boost the accuracy without increasing the network width and depth. We also replace the original C3 block with a ConvNeXt block in YOLOv5 backbone to accelerate detection speed with only a slight decline in accuracy. We test our model on a self-constructed ship detection dataset containing 3200 images captured by drones or with a drone view. The experimental results show that our model achieves 48.0% AP, exceeding the accuracy of YOLOv5s by 1.2% AP. The detection speed of our network is 8.3 ms per image on an NVIDIA RTX3090 GPU, exceeding the detection speed of YOLOv5s by 13.3%. Our code is available at https://github.com/chengshuxiao/YOLOv5-ODConvNeXt.
引用
收藏
页数:12
相关论文
共 58 条
[1]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[2]  
Ba JL., 2016, arXiv
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[5]   A Degraded Reconstruction Enhancement-Based Method for Tiny Ship Detection in Remote Sensing Images With a New Large-Scale Dataset [J].
Chen, Jianqi ;
Chen, Keyan ;
Chen, Hao ;
Zou, Zhengxia ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6]   You Only Look One-level Feature [J].
Chen, Qiang ;
Wang, Yingming ;
Yang, Tong ;
Zhang, Xiangyu ;
Cheng, Jian ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13034-13043
[7]   Dynamic Convolution: Attention over Convolution Kernels [J].
Chen, Yinpeng ;
Dai, Xiyang ;
Liu, Mengchen ;
Chen, Dongdong ;
Yuan, Lu ;
Liu, Zicheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11027-11036
[8]   Deep learning for autonomous ship-oriented small ship detection [J].
Chen, Zhijun ;
Chen, Depeng ;
Zhang, Yishi ;
Cheng, Xiaozhao ;
Zhang, Mingyang ;
Wu, Chaozhong .
SAFETY SCIENCE, 2020, 130
[9]  
Dai JF, 2016, ADV NEUR IN, V29
[10]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773