Hierarchical Mask Prompting and Robust Integrated Regression for Oriented Object Detection

被引:9
|
作者
Yao, Yanqing [1 ]
Cheng, Gong [1 ]
Lang, Chunbo [1 ]
Yuan, Xiang [1 ]
Xie, Xingxing [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Accuracy; Object detection; Loss measurement; Detectors; Remote sensing; Oriented object detector; remote sensing image; efficient oriented IoU loss; hierarchical mask prompting; robust integrated regression; semantic mask; SCENE TEXT DETECTION;
D O I
10.1109/TCSVT.2024.3444795
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection in remote sensing images has garnered significant attention due to its wide applications in real-world scenarios. However, most existing oriented object detectors still suffer from complex backgrounds and varying angles, limiting their performance to further improvement. In this paper, we propose a novel oriented detector with Hierarchical mask prompting and Robust integrated regression, termed HRDet. Specifically, to cope with the first issue, we construct a hierarchical mask prompting module consisting of a semantic mask prediction branch and hierarchical Softmax technique. The former aims to isolate object instances from cluttered interferences guided by coarse box-wise masks, while the latter propagates differentiated features for adjacent layers using hierarchical attentive weights. To deal with the second issue, we strive for robust integrated regression and formulate an efficient oriented IoU loss, explicitly measuring the discrepancies of three geometric factors in oriented regression, i.e., the central point distance, side length, and angle. This innovative loss intends to overcome the problem that existing IoU-based losses are invariant during the regression of varying angles. We applied these two strategies to a simple one-stage detection pipeline, achieving a new level of trade-off between speed and accuracy. Extensive experiments on four large aerial imagery datasets, DOTA-v1.0, DOTA-v2.0, DIOR-R, and HRSC2016, demonstrate that our HRDet significantly improves the accuracy of the one-stage detector over refine-stage counterparts while maintaining the efficiency advantage. The source code will be available at https://github.com/yanqingyao1994/HRDet.
引用
收藏
页码:13071 / 13084
页数:14
相关论文
共 50 条
  • [21] Negative-Core Sample Knowledge Distillation for Oriented Object Detection in Remote Sensing Image
    Zhang, Wenhui
    Zhang, Yidan
    Huang, Feilong
    Qi, Xiyu
    Wang, Lei
    Liu, Xiaoxuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [22] Small Object Detection in Remote Sensing Images Based on Redundant Feature Removal and Progressive Regression
    Yang, Yang
    Zang, Bingjie
    Song, Chunying
    Li, Beichen
    Lang, Yue
    Zhang, Wenyuan
    Huo, Peng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [23] A Robust and Efficient Multiscenario Object Detection Network for Edge Devices
    Chen, Zhihuan
    Luo, Aiwen
    Ding, Lin
    Zheng, Jialu
    Huang, Zunkai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [24] Detection in Optical Remote Sensing Images of Transmission Tower Based on Oriented Object Detection
    Tan, Yuanpeng
    Jiao, Fei
    Mo, Wenhao
    Liu, Haiying
    Bai, Xiaojing
    Ma, Jiaxiu
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2025, 11 (01): : 217 - 226
  • [25] Spatial Bi-Exploration for Robust Camouflaged Object Detection
    Zhang, Jialin
    Wang, Xiao
    Yuan, Xin
    Mu, Nan
    Wang, Zheng
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1251 - 1255
  • [26] A Hierarchical Context Embedding Network for Object Detection in Remote Sensing Images
    Zhang, Ke
    Wu, Yulin
    Wang, Jingyu
    Wang, Qi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] Hierarchical Context Features Embedding for Object Detection
    Qiu, Heqian
    Li, Hongliang
    Wu, Qingbo
    Meng, Fanman
    Xu, Linfeng
    Ngan, King Ngi
    Shi, Hengcan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (12) : 3039 - 3050
  • [28] FFN: Fountain Fusion Net for Arbitrary-Oriented Object Detection
    Zhang, Tianwei
    Sun, Xu
    Zhuang, Lina
    Dong, Xiaoyu
    Gao, Lianru
    Zhang, Bing
    Zheng, Ke
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [29] Feature Alignment FPN for Oriented Object Detection in Remote Sensing Images
    Li, Zhiqing
    Li, Erzhu
    Xu, Tianyu
    Samat, Alim
    Liu, Wei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [30] Align Deep Features for Oriented Object Detection
    Han, Jiaming
    Ding, Jian
    Li, Jie
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60