Modified Yolov3 for Ship Detection with Visible and Infrared Images

被引:22
作者
Chang, Lena [1 ]
Chen, Yi-Ting [2 ]
Wang, Jung-Hua [2 ,3 ]
Chang, Yang-Lang [4 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Commun Nav & Control Engn, Keelung 202301, Taiwan
[2] Natl Taiwan Ocean Univ, Dept Elect Engn, Keelung 202301, Taiwan
[3] Natl Taiwan Ocean Univ, AI Res Ctr, Dept Elect Engn, Keelung 202301, Taiwan
[4] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 106344, Taiwan
关键词
ship detection; Yolov3; spatial pyramid pooling; infrared images; visible images; SHAPE;
D O I
10.3390/electronics11050739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the demands for international marine transportation increase rapidly, effective port management has become an important issue. Automatic ship recognition can facilitate the realization of smart ports, and improve the efficiency of port operation and management. In order to take into account the processing efficiency and detection accuracy at the same time, the study presented an improved deep-learning network based on You only look once version 3 (Yolov3) for all-day ship detection with visible and infrared images. Yolov3 network can simultaneously improve the recognition ability of large and small objects through multiscale feature-extraction architecture. Considering reducing computational time and network complexity with relatively competitive detection accuracy, the study modified the architecture of Yolov3 by choosing an appropriate input image size, fewer convolution filters, and detection scales. In addition, the reduced Yolov3 was further modified with the spatial pyramid pooling (SPP) module to improve the network performance in feature extraction. Therefore, the proposed modified network can achieve the purpose of multi-scale, multi-type, and multi-resolution ship detection. In the study, a common self-built data set was introduced, aiming to conduct all-day and real-time ship detection. The data set included a total of 5557 infrared and visible light images from six common ship types in northern Taiwan ports. The experimental results on the data set showed that the proposed modified network architecture achieved acceptable performance in ship detection, with the mean average precision (mAP) of 93.2%, processing 104 frames per second (FPS), and 29.2 billion floating point operations (BFLOPs). Compared with the original Yolov3, the proposed method can increase mAP and FPS by about 5.8% and 8%, respectively, while reducing BFLOPs by about 47.5%. Furthermore, the computational efficiency and detection performance of the proposed approach have been verified in the comparative experiments with some existing convolutional neural networks (CNNs). In conclusion, the proposed method can achieve high detection accuracy with lower computational costs compared to other networks.
引用
收藏
页数:20
相关论文
共 57 条
[1]   Nonparametric Denoising Methods Based on Contour let Transform with Sharp Frequency Localization: Application to Low Exposure Time Electron Microscopy Images [J].
Ahmed, Soumia Sid ;
Messali, Zoubeida ;
Ouahabi, Abdeldjalil ;
Trepout, Sylvain ;
Messaoudi, Cedric ;
Marco, Sergio .
ENTROPY, 2015, 17 (05) :3461-3478
[2]  
[Anonymous], 2018, TZUTALIN TZUTALIN LA
[3]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[4]  
[Anonymous], 1990, Nonlinear Digital Filters: Principles and Applications
[5]  
[Anonymous], 2020, ALEXEYAB ALEXEYABDAR
[6]  
Bochkovskiy A., 2020, PREPRINT
[7]  
Chang L., 2021, P 2021 IEEE INT GEOS
[8]   Ship Detection Based on YOLOv2 for SAR Imagery [J].
Chang, Yang-Lang ;
Anagaw, Amare ;
Chang, Lena ;
Wang, Yi Chun ;
Hsiao, Chih-Yu ;
Lee, Wei-Hong .
REMOTE SENSING, 2019, 11 (07)
[9]   Robust Visual Ship Tracking with an Ensemble Framework via Multi-View Learning and Wavelet Filter [J].
Chen, Xinqiang ;
Chen, Huixing ;
Wu, Huafeng ;
Huang, Yanguo ;
Yang, Yongsheng ;
Zhang, Wenhui ;
Xiong, Pengwen .
SENSORS, 2020, 20 (03)
[10]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297