Detection and segmentation of underwater objects from forward-looking sonar based on a modified Mask RCNN

被引：53

作者：

Fan, Zhimiao ^{[1
]}

Xia, Weijie ^{[1
]}

Liu, Xue ^{[1
]}

Li, Hailin ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2021年 / 15卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Object detection and segmentation; Forward-looking sonar; Mask RCNN; Adagrad optimizer; Resnet;

D O I：

10.1007/s11760-020-01841-x

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Nowadays, high-frequency forward-looking sonar is an effective device to obtain the main information of underwater objects. Detection and segmentation of underwater objects are also one of the key topics of current research. Deep learning has shown excellent performance in image features extracting and has been extensively used in image object detection and instance segmentation. With the network depth increasing, training accuracy gets saturated and training parameters also increase rapidly. In this paper, a series of residual blocks are used to build a 32-layer feature extraction network and take place of the Resnet50/101 in Mask RCNN, which reduces the training parameters of the network while guaranteeing the detection performance. The parameters of the proposed network are 29% less than Resnet50 and 50.2% less than Resnet101, which is of great significance for future hardware implementation. In addition, Adagrad optimizer is introduced into this research to improve the detection performance of sonar images. Finally, the object detection results of 500 test sonar images show that the mAP is 96.97% that is only 0.18% less than Resnet50 (97.15%) but more than Resnet101 (95.15%).

引用

页码：1135 / 1143

页数：9

共 28 条

[1] A Statistically-Based Method for the Detection of Underwater Objects in Sonar Imagery
Ahu, Avi
Diamant, Roee
[J]. IEEE SENSORS JOURNAL, 2019, 19 (16) : 6858 - 6871
[2] Azimi-Sadjadi MR., 2019, IEEE J OCEANIC ENG, V2019, P1
[3] Cho H, 2015, IEEE MTTS INT MICROW, P14
[4] Duchi J, 2011, J MACH LEARN RES, V12, P2121
[5] Girshick R., 2014, P IEEE C COMPUTER VI, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]
[6] He K, P IEEE C COMP VIS PA, P770, DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]
[7] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[8] YOLOv3-DPFIN: A Dual-Path Feature Fusion Neural Network for Robust Real-Time Sonar Target Detection
Kong, Wanzeng
Hong, Jichen
Jia, Mingyang
Yao, Jinliang
Gong, Weihua
Hu, Hua
Zhang, Haigang
[J]. IEEE SENSORS JOURNAL, 2020, 20 (07) : 3745 - 3756
[9] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[10] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755

← 1 2 3 →