Hierarchical Mask Prompting and Robust Integrated Regression for Oriented Object Detection

被引:9
作者
Yao, Yanqing [1 ]
Cheng, Gong [1 ]
Lang, Chunbo [1 ]
Yuan, Xiang [1 ]
Xie, Xingxing [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Accuracy; Object detection; Loss measurement; Detectors; Remote sensing; Oriented object detector; remote sensing image; efficient oriented IoU loss; hierarchical mask prompting; robust integrated regression; semantic mask; SCENE TEXT DETECTION;
D O I
10.1109/TCSVT.2024.3444795
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection in remote sensing images has garnered significant attention due to its wide applications in real-world scenarios. However, most existing oriented object detectors still suffer from complex backgrounds and varying angles, limiting their performance to further improvement. In this paper, we propose a novel oriented detector with Hierarchical mask prompting and Robust integrated regression, termed HRDet. Specifically, to cope with the first issue, we construct a hierarchical mask prompting module consisting of a semantic mask prediction branch and hierarchical Softmax technique. The former aims to isolate object instances from cluttered interferences guided by coarse box-wise masks, while the latter propagates differentiated features for adjacent layers using hierarchical attentive weights. To deal with the second issue, we strive for robust integrated regression and formulate an efficient oriented IoU loss, explicitly measuring the discrepancies of three geometric factors in oriented regression, i.e., the central point distance, side length, and angle. This innovative loss intends to overcome the problem that existing IoU-based losses are invariant during the regression of varying angles. We applied these two strategies to a simple one-stage detection pipeline, achieving a new level of trade-off between speed and accuracy. Extensive experiments on four large aerial imagery datasets, DOTA-v1.0, DOTA-v2.0, DIOR-R, and HRSC2016, demonstrate that our HRDet significantly improves the accuracy of the one-stage detector over refine-stage counterparts while maintaining the efficiency advantage. The source code will be available at https://github.com/yanqingyao1994/HRDet.
引用
收藏
页码:13071 / 13084
页数:14
相关论文
共 50 条
  • [41] Arbitrary-Oriented Object Detection in Remote Sensing Images Based on Polar Coordinates
    Zhou, Lin
    Wei, Haoran
    Li, Hao
    Zhao, Wenzhe
    Zhang, Yi
    Zhang, Yue
    IEEE ACCESS, 2020, 8 (08): : 223373 - 223384
  • [42] Center-Boundary Dual Attention for Oriented Object Detection in Remote Sensing Images
    Liu, Shuai
    Zhang, Lu
    Lu, Huchuan
    He, You
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] Fast and Robust Rotation Invariant Object Detection with Joint Color Channel and Hierarchical Binary Pattern
    Kim, Insu
    Sung, Jaewon
    Lee, Dongsung
    Kim, Daijin
    2015 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2015, : 578 - 580
  • [44] WSODet: A Weakly Supervised Oriented Detector for Aerial Object Detection
    Tan, Zhiwen
    Jiang, Zhiguo
    Guo, Chen
    Zhang, Haopeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [45] Learning Orientation-Aware Distances for Oriented Object Detection
    Rao, Chaofan
    Wang, Jiabao
    Cheng, Gong
    Xie, Xingxing
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [46] RDNet: Regression Dense and Attention for Object Detection in Traffic Symbols
    Hong, Feng
    Lu, Changhua
    Jiang, Weiwei
    Ju, Wei
    Wang, Tao
    IEEE SENSORS JOURNAL, 2021, 21 (22) : 25372 - 25378
  • [47] Channel Self-Attention Based Multiscale Spatial-Frequency Domain Network for Oriented Object Detection in Remote Sensing Imagery
    Xu, Yang
    Pan, Yushan
    Wu, Zebin
    Wei, Zhihui
    Zhan, Tianming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [48] Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images
    Yu, Donghang
    Xu, Qing
    Liu, Xiangyun
    Guo, Haitao
    Lu, Jun
    Lin, Yuzhun
    Lv, Liang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 930 - 945
  • [49] Multi-Oriented Rotation-Equivariant Network for Object Detection on Remote Sensing Images
    Zhu, Kun
    Zhang, Xiaodong
    Chen, Guanzhou
    Li, Xianwei
    Cai, Peihua
    Liao, Puyun
    Wang, Tong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [50] Dual Teacher: Improving the Reliability of Pseudo Labels for Semi-Supervised Oriented Object Detection
    Fang, Zhenyu
    Ren, Jinchang
    Zheng, Jiangbin
    Chen, Rongjun
    Zhao, Huimin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63