MKIoU loss: toward accurate oriented object detection in aerial images

被引:2
作者
Yu, Xinyi [1 ]
Lu, Jiangping [1 ]
Lin, Mi [1 ]
Zhou, Libo [1 ]
Ou, Linlin [1 ]
机构
[1] Zhejiang Univ Technol, Hangzhou, Peoples R China
关键词
oriented object detection; MKIoU Loss; Gaussian angle loss; aerial images;
D O I
10.1117/1.JEI.32.3.033030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Oriented bounding box regression is crucial for oriented object detection. However, regression-based methods often suffer from boundary problems and the inconsistency between loss and evaluation metrics. A modulated Kalman intersection over union (IoU) loss of approximate SkewIoU is proposed, named MKIoU. To avoid boundary problems, we convert the oriented bounding box to Gaussian distribution then use the Kalman filter to approximate the intersection area. However, there exists significant difference between the calculated and actual intersection areas. Thus, we propose a modulation factor to adjust the sensitivity of angle deviation and width-height offset to loss variation, making the loss more consistent with the evaluation metric. Furthermore, the Gaussian modeling method avoids the boundary problem but causes the angle confusion of square objects simultaneously. Thus, the Gaussian angle loss (GA loss) is presented to solve this problem by adding a corrected loss for square targets. The proposed GA loss can be easily extended to other Gaussian-based methods. Experiments on three publicly available aerial image datasets, DOTA, UCAS-AOD, and HRSC2016, show the effectiveness of the proposed method. (C) 2023 SPIE and IS&T
引用
收藏
页数:15
相关论文
共 56 条
  • [1] Bhattacharyya A, 1946, SANKHYA, V7, P401
  • [2] Chen Z., 2020, P EUR C COMP VIS ECC, P195
  • [3] Cheng G, 2022, Arxiv, DOI [arXiv:2110.01931, DOI 10.1109/TGRS.2022.3183022]
  • [4] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [5] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [6] Feng PM, 2020, INT CONF ACOUST SPEE, P4057, DOI 10.1109/ICASSP40776.2020.9053562
  • [7] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [8] Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection
    Guo, Zonghao
    Liu, Chang
    Zhang, Xiaosong
    Jiao, Jianbin
    Ji, Xiangyang
    Ye, Qixiang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8788 - 8797
  • [9] Align Deep Features for Oriented Object Detection
    Han, Jiaming
    Ding, Jian
    Li, Jie
    Xia, Gui-Song
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [10] ReDet: A Rotation-equivariant Detector for Aerial Object Detection
    Han, Jiaming
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2785 - 2794