Modified Yolov3 for Ship Detection with Visible and Infrared Images

被引:21
作者
Chang, Lena [1 ]
Chen, Yi-Ting [2 ]
Wang, Jung-Hua [2 ,3 ]
Chang, Yang-Lang [4 ]
机构
[1] Natl Taiwan Ocean Univ, Dept Commun Nav & Control Engn, Keelung 202301, Taiwan
[2] Natl Taiwan Ocean Univ, Dept Elect Engn, Keelung 202301, Taiwan
[3] Natl Taiwan Ocean Univ, AI Res Ctr, Dept Elect Engn, Keelung 202301, Taiwan
[4] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 106344, Taiwan
关键词
ship detection; Yolov3; spatial pyramid pooling; infrared images; visible images; SHAPE;
D O I
10.3390/electronics11050739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the demands for international marine transportation increase rapidly, effective port management has become an important issue. Automatic ship recognition can facilitate the realization of smart ports, and improve the efficiency of port operation and management. In order to take into account the processing efficiency and detection accuracy at the same time, the study presented an improved deep-learning network based on You only look once version 3 (Yolov3) for all-day ship detection with visible and infrared images. Yolov3 network can simultaneously improve the recognition ability of large and small objects through multiscale feature-extraction architecture. Considering reducing computational time and network complexity with relatively competitive detection accuracy, the study modified the architecture of Yolov3 by choosing an appropriate input image size, fewer convolution filters, and detection scales. In addition, the reduced Yolov3 was further modified with the spatial pyramid pooling (SPP) module to improve the network performance in feature extraction. Therefore, the proposed modified network can achieve the purpose of multi-scale, multi-type, and multi-resolution ship detection. In the study, a common self-built data set was introduced, aiming to conduct all-day and real-time ship detection. The data set included a total of 5557 infrared and visible light images from six common ship types in northern Taiwan ports. The experimental results on the data set showed that the proposed modified network architecture achieved acceptable performance in ship detection, with the mean average precision (mAP) of 93.2%, processing 104 frames per second (FPS), and 29.2 billion floating point operations (BFLOPs). Compared with the original Yolov3, the proposed method can increase mAP and FPS by about 5.8% and 8%, respectively, while reducing BFLOPs by about 47.5%. Furthermore, the computational efficiency and detection performance of the proposed approach have been verified in the comparative experiments with some existing convolutional neural networks (CNNs). In conclusion, the proposed method can achieve high detection accuracy with lower computational costs compared to other networks.
引用
收藏
页数:20
相关论文
共 50 条
[31]   A yolov8-based lightweight detection model for different perspectives infrared images [J].
Cao, Lei ;
Wang, Qing ;
Luo, Yunhui ;
Hou, Yongjie ;
Zheng, Wanglin ;
Qu, Haiming .
OPTICS COMMUNICATIONS, 2025, 582
[32]   Inshore Ship Detection in Multispectral Satellite Images [J].
Besbinar, Beril ;
Gurbuz, Yeti Ziya ;
Alatan, A. Aydin .
2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, :2029-2032
[33]   Multi-resolution networks for ship detection in infrared remote sensing images [J].
Zhou, Min ;
Jing, Minhao ;
Liu, Dunge ;
Xia, Zhenghuan ;
Zou, Zhengxia ;
Shi, Zhenwei .
INFRARED PHYSICS & TECHNOLOGY, 2018, 92 :183-189
[34]   Fusion of visible and infrared images in HSV color space [J].
Manchanda, Meenu ;
Sharma, Rajiv .
2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
[35]   Fusion of infrared and visible images using multiscale morphology [J].
Araceli Saravia, Cecilia ;
Mereles Peralta, Magali E. ;
Mello Roman, Julio Cesar ;
Vazquez Noguera, Jose Luis ;
Legal Ayala, Horacio .
2019 XLV LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2019), 2019,
[36]   A Multiscale Morphological Method for Visible and Infrared Images Fusion [J].
Mello Roman, Julio Cesar ;
Vazquez Noguera, Jose Luis ;
Legal-Ayala, Horacio .
2020 XLVI LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2020), 2021, :488-495
[37]   ADFuse: An Adaptive Fusion Method for Infrared and Visible Images [J].
Xu, Wanying ;
Yang, Dongxu ;
Zheng, Yongbin ;
Sun, Peng ;
Bai, Shengjian .
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 3, 2025, 1339 :135-146
[38]   YOLOv7oSAR: A Lightweight High-Precision Ship Detection Model for SAR Images Based on the YOLOv7 Algorithm [J].
Liu, Yilin ;
Ma, Yong ;
Chen, Fu ;
Shang, Erping ;
Yao, Wutao ;
Zhang, Shuyan ;
Yang, Jin .
REMOTE SENSING, 2024, 16 (05)
[39]   Street scenes object detection based on infrared images and improved YOLOv5 network [J].
Tan, Ailing ;
Li, Xiaohang ;
Zhao, Yong ;
Gao, Meijing .
JOURNAL OF ELECTRONIC IMAGING, 2025, 34 (03)
[40]   Robust Ship Detection in Infrared Images through Multiscale Feature Extraction and Lightweight CNN [J].
Miao, Rui ;
Jiang, Hongxu ;
Tian, Fangzheng .
SENSORS, 2022, 22 (03)