Boosting R-CNN: Reweighting R-CNN samples by RPN?s error for underwater object detection

被引:142
作者
Song, Pinhao [1 ]
Li, Pengteng [2 ]
Dai, Linhui [1 ]
Wang, Tao [1 ]
Chen, Zhan [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Key Lab Machine Percept, Shenzhen 518055, Guangdong, Peoples R China
[2] Shenzhen Univ, Shenzhen 518060, Guangdong, Peoples R China
关键词
Underwater object detection; Hard example mining; Uncertainty modeling; RECOGNITION; FASTER;
D O I
10.1016/j.neucom.2023.01.088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complicated underwater environments bring new challenges to object detection, such as unbalanced light conditions, low contrast, occlusion, and mimicry of aquatic organisms. Under these circumstances, the objects captured by the underwater camera will become vague, and the generic detectors often fail on these vague objects. This work aims to solve the problem from two perspectives: uncertainty modeling and hard example mining. We propose a two-stage underwater detector named boosting R-CNN, which comprises three key components. First, a new region proposal network named RetinaRPN is proposed, which provides high-quality proposals and considers objectness and IoU prediction for uncertainty to model the object prior probability. Second, the probabilistic inference pipeline is introduced to combine the first-stage prior uncertainty and the second-stage classification score to model the final detection score. Finally, we propose a new hard example mining method named boosting reweighting. Specifically, when the region proposal network miscalculates the object prior probability for a sample, boosting reweighting will increase the classification loss of the sample in the R-CNN head during training, while reducing the loss of easy samples with accurately estimated priors. Thus, a robust detection head in the second stage can be obtained. During the inference stage, the R-CNN has the capability to rectify the error of the first stage to improve the performance. Comprehensive experiments on two underwater datasets and two generic object detection datasets demonstrate the effectiveness and robustness of our method. The link of code: https://github.com/mousecpn/Boosting-R-CNN. (c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:150 / 164
页数:15
相关论文
共 71 条
[1]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[2]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3]  
Cai Z., 2018, P IEEE CVF C COMP VI
[4]  
Cao Y., 2020, P IEEECVF C COMPUTER
[5]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[6]  
Chen K, 2020, Arxiv, DOI arXiv:2004.03580
[7]  
Chen K, 2019, Arxiv, DOI arXiv:1906.07155
[8]  
Chen L, 2022, Arxiv, DOI arXiv:2010.10006
[9]  
Chen XY, 2020, Arxiv, DOI arXiv:2003.01913
[10]  
Chen Yu, 2020, ADV NEURAL INFORM PR, V33