Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss

被引:91
作者
Ming, Qi [1 ]
Miao, Lingjuan [1 ]
Zhou, Zhiqiang [1 ]
Yang, Xue [1 ,2 ]
Dong, Yunpeng [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
关键词
Optimization; Object detection; Detectors; Training; Remote sensing; Transforms; Task analysis; Bounding box regression; convolutional neural networks; oriented object detection; representation ambiguity;
D O I
10.1109/LGRS.2021.3115110
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Arbitrary-oriented objects exist widely in remote sensing images. The mainstream rotation detectors use oriented bounding boxes (OBBs) or quadrilateral bounding boxes (QBBs) to represent the rotating objects. However, these methods suffer from the representation ambiguity for oriented object definition, which leads to suboptimal regression optimization and the inconsistency between the loss metric and the localization accuracy of the predictions. In this letter, we propose a representation invariance loss (RIL) to optimize the bounding box regression for the rotating objects in the remote sensing images. RIL treats multiple representations of an oriented object as multiple equivalent local minima and hence transforms bounding box regression into an adaptive matching process with these local minima. Next, the Hungarian matching algorithm is adopted to obtain the optimal regression strategy. Besides, we propose a normalized rotation loss to alleviate the weak correlation between different variables and their unbalanced loss contribution in OBB representation. Extensive experiments on remote sensing datasets show that our method achieves consistent and substantial improvement. The code and models are available at https://github.com/ming71/RIDet to facilitate future research.
引用
收藏
页数:5
相关论文
共 17 条
[1]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[2]   3D Stackable Broadband Photoresponsive InGaAs Biristor Neuron for a Neuromorphic Visual System with Near 1 V Operation [J].
Han, Joon-Kyu ;
Sim, Jaeho ;
Geum, Dae-Myeong ;
Kim, Seong Kwang ;
Yu, Ji-Man ;
Kim, Jongmin ;
Kim, Sanghyeon ;
Choi, Yang-Kyu .
2021 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2021,
[3]   The Hungarian Method for the assignment problem [J].
Kuhn, HW .
NAVAL RESEARCH LOGISTICS, 2005, 52 (01) :7-21
[4]  
Liu Y., 2019, P IJCAI, P1
[5]   A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines [J].
Liu, Zikun ;
Yuan, Liu ;
Weng, Lubin ;
Yang, Yiping .
ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, :324-331
[6]   CFC-Net: A Critical Feature Capturing Network for Arbitrary-Oriented Object Detection in Remote-Sensing Images [J].
Ming, Qi ;
Miao, Lingjuan ;
Zhou, Zhiqiang ;
Dong, Yunpeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]  
Ming Q, 2021, AAAI CONF ARTIF INTE, V35, P2355
[8]   Sparse Label Assignment for Oriented Object Detection in Aerial Images [J].
Ming, Qi ;
Miao, Lingjuan ;
Zhou, Zhiqiang ;
Song, Junjie ;
Yang, Xue .
REMOTE SENSING, 2021, 13 (14)
[9]  
Qian W, 2021, AAAI CONF ARTIF INTE, V35, P2458
[10]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007