MKIoU loss: toward accurate oriented object detection in aerial images

被引：2

作者：

Yu, Xinyi ^{[1
]}

Lu, Jiangping ^{[1
]}

Lin, Mi ^{[1
]}

Zhou, Libo ^{[1
]}

Ou, Linlin ^{[1
]}

机构：

[1] Zhejiang Univ Technol, Hangzhou, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2023年 / 32卷 / 03期

关键词：

oriented object detection; MKIoU Loss; Gaussian angle loss; aerial images;

D O I：

10.1117/1.JEI.32.3.033030

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Oriented bounding box regression is crucial for oriented object detection. However, regression-based methods often suffer from boundary problems and the inconsistency between loss and evaluation metrics. A modulated Kalman intersection over union (IoU) loss of approximate SkewIoU is proposed, named MKIoU. To avoid boundary problems, we convert the oriented bounding box to Gaussian distribution then use the Kalman filter to approximate the intersection area. However, there exists significant difference between the calculated and actual intersection areas. Thus, we propose a modulation factor to adjust the sensitivity of angle deviation and width-height offset to loss variation, making the loss more consistent with the evaluation metric. Furthermore, the Gaussian modeling method avoids the boundary problem but causes the angle confusion of square objects simultaneously. Thus, the Gaussian angle loss (GA loss) is presented to solve this problem by adding a corrected loss for square targets. The proposed GA loss can be easily extended to other Gaussian-based methods. Experiments on three publicly available aerial image datasets, DOTA, UCAS-AOD, and HRSC2016, show the effectiveness of the proposed method. (C) 2023 SPIE and IS&T

引用

页数：15

共 56 条

[1] Bhattacharyya A, 1946, SANKHYA, V7, P401
[2] Chen Z., 2020, P EUR C COMP VIS ECC, P195
[3] Cheng G, 2022, Arxiv, DOI [arXiv:2110.01931, DOI 10.1109/TGRS.2022.3183022]
[4] Learning RoI Transformer for Oriented Object Detection in Aerial Images
Ding, Jian
Xue, Nan
Long, Yang
Xia, Gui-Song
Lu, Qikai
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
[5] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[6] Feng PM, 2020, INT CONF ACOUST SPEE, P4057, DOI 10.1109/ICASSP40776.2020.9053562
[7] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[8] Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection
Guo, Zonghao
Liu, Chang
Zhang, Xiaosong
Jiao, Jianbin
Ji, Xiangyang
Ye, Qixiang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 8788 - 8797
[9] Align Deep Features for Oriented Object Detection
Han, Jiaming
Ding, Jian
Li, Jie
Xia, Gui-Song
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10] ReDet: A Rotation-equivariant Detector for Aerial Object Detection
Han, Jiaming
Ding, Jian
Xue, Nan
Xia, Gui-Song
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2785 - 2794

← 1 2 3 4 5 6 →