ReBiDet: An Enhanced Ship Detection Model Utilizing ReDet and Bi-Directional Feature Fusion

被引:4
作者
Yan, Zexin [1 ]
Li, Zhongbo [1 ]
Xie, Yongqiang [1 ]
Li, Chengyang [1 ,2 ]
Li, Shaonan [1 ]
Sun, Fangwei [1 ]
机构
[1] Acad Mil Sci, Inst Syst Engn, Beijing 100000, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100000, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 12期
关键词
artificial intelligence; deep learning; remote sensing images; ship detection; bi-directional feature fusion; feature pyramid network; anchor size; K-means; sampler;
D O I
10.3390/app13127080
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
To enhance ship detection accuracy in the presence of complex scenes and significant variations in object scales, this study introduces three enhancements to ReDet, resulting in a more powerful ship detection model called rotation-equivariant bidirectional feature fusion detector (ReBiDet). Firstly, the feature pyramid network (FPN) structure in ReDet is substituted with a rotation-equivariant bidirectional feature fusion feature pyramid network (ReBiFPN) to effectively capture and enrich multiscale feature information. Secondly, K-means clustering is utilized to group the aspect ratios of ground truth boxes in the dataset and adjust the anchor size settings accordingly. Lastly, the difficult positive reinforcement learning (DPRL) sampler is employed instead of the random sampler to address the scale imbalance issue between objects and backgrounds in the dataset, enabling the model to prioritize challenging positive examples. Through numerous experiments conducted on the HRSC2016 and DOTA remote sensing image datasets, the effectiveness of the proposed improvements in handling complex environments and small object detection tasks is validated. The ReBiDet model demonstrates state-of-the-art performance in remote sensing object detection tasks. Compared to the ReDet model and other advanced models, our ReBiDet achieves mAP improvements of 3.20, 0.42, and 1.16 on HRSC2016, DOTA-v1.0, and DOTA-v1.5, respectively, with only a slight increase of 0.82 million computational parameters.
引用
收藏
页数:25
相关论文
共 58 条
[21]   Rotation-sensitive Regression for Oriented Scene Text Detection [J].
Liao, Minghui ;
Zhu, Zhen ;
Shi, Baoguang ;
Xia, Gui-song ;
Bai, Xiang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5909-5918
[22]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[23]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[24]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[25]   Path Aggregation Network for Instance Segmentation [J].
Liu, Shu ;
Qi, Lu ;
Qin, Haifang ;
Shi, Jianping ;
Jia, Jiaya .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768
[26]   Arbitrary-Oriented Ship Detection Framework in Optical Remote-Sensing Images [J].
Liu, Wenchao ;
Ma, Long ;
Chen, He .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (06) :937-941
[27]  
Liu ZK, 2017, IEEE IMAGE PROC, P900, DOI 10.1109/ICIP.2017.8296411
[28]   A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines [J].
Liu, Zikun ;
Yuan, Liu ;
Weng, Lubin ;
Yang, Yiping .
ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, :324-331
[29]  
Lyu C, 2022, Arxiv, DOI [arXiv:2212.07784, 10.48550/arXiv.2212.07784]
[30]   EfficientDet: Scalable and Efficient Object Detection [J].
Tan, Mingxing ;
Pang, Ruoming ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10778-10787