Detection and segmentation of underwater objects from forward-looking sonar based on a modified Mask RCNN

被引:53
作者
Fan, Zhimiao [1 ]
Xia, Weijie [1 ]
Liu, Xue [1 ]
Li, Hailin [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection and segmentation; Forward-looking sonar; Mask RCNN; Adagrad optimizer; Resnet;
D O I
10.1007/s11760-020-01841-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, high-frequency forward-looking sonar is an effective device to obtain the main information of underwater objects. Detection and segmentation of underwater objects are also one of the key topics of current research. Deep learning has shown excellent performance in image features extracting and has been extensively used in image object detection and instance segmentation. With the network depth increasing, training accuracy gets saturated and training parameters also increase rapidly. In this paper, a series of residual blocks are used to build a 32-layer feature extraction network and take place of the Resnet50/101 in Mask RCNN, which reduces the training parameters of the network while guaranteeing the detection performance. The parameters of the proposed network are 29% less than Resnet50 and 50.2% less than Resnet101, which is of great significance for future hardware implementation. In addition, Adagrad optimizer is introduced into this research to improve the detection performance of sonar images. Finally, the object detection results of 500 test sonar images show that the mAP is 96.97% that is only 0.18% less than Resnet50 (97.15%) but more than Resnet101 (95.15%).
引用
收藏
页码:1135 / 1143
页数:9
相关论文
共 28 条
  • [1] A Statistically-Based Method for the Detection of Underwater Objects in Sonar Imagery
    Ahu, Avi
    Diamant, Roee
    [J]. IEEE SENSORS JOURNAL, 2019, 19 (16) : 6858 - 6871
  • [2] Azimi-Sadjadi MR., 2019, IEEE J OCEANIC ENG, V2019, P1
  • [3] Cho H, 2015, IEEE MTTS INT MICROW, P14
  • [4] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [5] Girshick R., 2014, P IEEE C COMPUTER VI, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]
  • [6] He K, P IEEE C COMP VIS PA, P770, DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]
  • [7] He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [8] YOLOv3-DPFIN: A Dual-Path Feature Fusion Neural Network for Robust Real-Time Sonar Target Detection
    Kong, Wanzeng
    Hong, Jichen
    Jia, Mingyang
    Yao, Jinliang
    Gong, Weihua
    Hu, Hua
    Zhang, Haigang
    [J]. IEEE SENSORS JOURNAL, 2020, 20 (07) : 3745 - 3756
  • [9] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [10] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755