Target detection in sonar images based on variable scale prior frame

被引:0
作者
Huang S. [1 ,2 ]
Song C. [1 ,3 ]
Li X. [1 ,2 ]
机构
[1] Key Laboratory of Underwater Vehicle Information Technology, Institute of Acoustics, Chinese Academy of Sciences, Beijing
[2] College of Electronics, Electrical and Telecommunications Engineering, University of Chinese Academy of Sciences, Beijing
[3] Intelligent Sensing and Computing Research Center, Institute of Automation, Chinese Academy of Sciences, Beijing
来源
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics | 2024年 / 46卷 / 03期
关键词
data enhancement; lightweight model; scale clustering; sonar image; target detection;
D O I
10.12305/j.issn.1001-506X.2024.03.01
中图分类号
学科分类号
摘要
In recent years, target detection in sonar images using deep learning has become a hot research topic. However, sonar images have problems such as the concentration of target scale distribution and the difficulty of data acquisition, which makes the detection effect difficult to meet the requirements. A target detection method based on variable scale prior frame is proposed to address this issue. Firstly, considering the particularity of target scale distribution in sonar images, variable scale prior frames arc generated based on prior statistics. Secondly, in order to solve the problem of sonar image scarcity, data augmentation methods arc used to expand the training set. Finally, the lightweighting of the model is explored by deleting the large object detection layer of the model, simplifying the model structure without reducing model accuracy. In order to evaluate the effectiveness of the algorithm, comprehensive experiments arc conducted on forward-looking sonar images as an example to determine the mean average precision (mAP)@0. 75 and mAP@ 0. 5:0. 95 reached 0. 585 and 0. 559 respectively, which increased by 5. 8% and 3. 1% compared to the original Yolov5 network, while giga floating-point operations (GFLOPs) decreased to 14. 9. The results show that the proposed algorithm has higher accuracy and a light weight model structure. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:771 / 778
页数:7
相关论文
共 32 条
[1]  
SUN Y Y., A review of deep learning and its application in image classification and recognition, Information Technology and Informatization, 1, pp. 138-140, (2018)
[2]  
YU C C, LI H B., A review of face recognition methods based on deep learning, Chinese Journal of Engineering Mathematics, 38, 4, pp. 451-469, (2021)
[3]  
WANG Y M, LI Y, ZOU H., Masked face recognition system based on attention mechanism, Information, 14, 2, (2023)
[4]  
ZHU Z, HUANG G, DENG J K, Et al., WebFace260M: a benchmark for million-scale deep face recognition, IEEE Trans. on Pattern Analysis and Machine Intelligence, 45, 2, pp. 2627-2644, (2023)
[5]  
TONG C, HAN Y, FENG W, Et al., Research progress in deep learning processing methods for medical images, Beijing Biomedical Engineering, 40, 2, pp. 198-202, (2021)
[6]  
JIANG H Y, DIAO Z S, SHI T Y, Et al., A review of deep learning-based multiple-lesion recognition from medical images: classification, detection and segmentation, Computers in Bi- ology and Medicine, 157, (2023)
[7]  
SMITHSON C J R, EICHBAUM Q G, GAUTHIER I., Object recognition ability predicts category learning with medical images, Cognitive Research: Principles and Implications, 8, 1, (2023)
[8]  
WANG X, ZHANG X C, SU C., Land use classification of remote sensing images based on multi-scale learning and depth Convolutional neural network, Journal of Zhejiang University (Science Edition), 47, 6, pp. 715-723, (2020)
[9]  
GADIRAJU K K, VATSAVAI R R., Remote sensing based crop type classification via deep transfer learning, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 16, pp. 4699-4712, (2023)
[10]  
HAO X J, LIU L, YANG R J, Et al., A review of data augmentation methods of remote sensing image target recognition, Remote Sensing, 15, 3, (2023)