Hierarchical Mask Prompting and Robust Integrated Regression for Oriented Object Detection

被引:9
作者
Yao, Yanqing [1 ]
Cheng, Gong [1 ]
Lang, Chunbo [1 ]
Yuan, Xiang [1 ]
Xie, Xingxing [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Accuracy; Object detection; Loss measurement; Detectors; Remote sensing; Oriented object detector; remote sensing image; efficient oriented IoU loss; hierarchical mask prompting; robust integrated regression; semantic mask; SCENE TEXT DETECTION;
D O I
10.1109/TCSVT.2024.3444795
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection in remote sensing images has garnered significant attention due to its wide applications in real-world scenarios. However, most existing oriented object detectors still suffer from complex backgrounds and varying angles, limiting their performance to further improvement. In this paper, we propose a novel oriented detector with Hierarchical mask prompting and Robust integrated regression, termed HRDet. Specifically, to cope with the first issue, we construct a hierarchical mask prompting module consisting of a semantic mask prediction branch and hierarchical Softmax technique. The former aims to isolate object instances from cluttered interferences guided by coarse box-wise masks, while the latter propagates differentiated features for adjacent layers using hierarchical attentive weights. To deal with the second issue, we strive for robust integrated regression and formulate an efficient oriented IoU loss, explicitly measuring the discrepancies of three geometric factors in oriented regression, i.e., the central point distance, side length, and angle. This innovative loss intends to overcome the problem that existing IoU-based losses are invariant during the regression of varying angles. We applied these two strategies to a simple one-stage detection pipeline, achieving a new level of trade-off between speed and accuracy. Extensive experiments on four large aerial imagery datasets, DOTA-v1.0, DOTA-v2.0, DIOR-R, and HRSC2016, demonstrate that our HRDet significantly improves the accuracy of the one-stage detector over refine-stage counterparts while maintaining the efficiency advantage. The source code will be available at https://github.com/yanqingyao1994/HRDet.
引用
收藏
页码:13071 / 13084
页数:14
相关论文
共 50 条
  • [31] A New Spatial-Oriented Object Detection Framework for Remote Sensing Images
    Yu, Dawen
    Ji, Shunping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [32] LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images
    Huang, Zhanchao
    Li, Wei
    Xia, Xiang-Gen
    Wang, Hao
    Jie, Feiran
    Tao, Ran
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Hierarchical Feature Fusion Network for Salient Object Detection
    Li, Xuelong
    Song, Dawei
    Dong, Yongsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9165 - 9175
  • [34] AIoU: Adaptive bounding box regression for accurate oriented object detection
    Wen, Nu
    Guo, Renzhong
    Ma, Ding
    Ye, Xiang
    He, Biao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (01) : 748 - 769
  • [35] Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection
    Wang, Chaowei
    Guo, Guangqian
    Liu, Chang
    Shao, Dian
    Gao, Shan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [36] Oriented Object Detection for Remote Sensing Images via Object-Wise Rotation-Invariant Semantic Representation
    Zheng, Shangdong
    Wu, Zebin
    Du, Qian
    Xu, Yang
    Wei, Zhihui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [37] A Page Object Detection Method Based on Mask R-CNN
    Xu, Canhui
    Shi, Cao
    Bi, Hengyue
    Liu, Chuanqi
    Yuan, Yongfeng
    Guo, Haoyan
    Chen, Yinong
    IEEE ACCESS, 2021, 9 : 143448 - 143457
  • [38] Refined One-Stage Oriented Object Detection Method for Remote Sensing Images
    Hou, Liping
    Lu, Ke
    Xue, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1545 - 1558
  • [39] Foreground Capture Feature Pyramid Network-Oriented Object Detection in Complex Backgrounds
    Han, Honggui
    Zhang, Qiyu
    Li, Fangyu
    Du, Yongping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [40] Adaptively Attentional Feature Fusion Oriented to Multiscale Object Detection in Remote Sensing Images
    Zhao, Wenqing
    Kang, Yijin
    Chen, Hao
    Zhao, Zhenhuan
    Zhao, Zhenbing
    Zhai, Yongjie
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72