Direction Prediction Redefinition: Transfer Angle to Scale in Oriented Object Detection

被引:0
|
作者
Song, Beihang [1 ]
Li, Jing [1 ]
Wu, Jia [2 ]
Chang, Jun [1 ]
Wan, Jun [3 ,4 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Macquarie Univ, Fac Sci & Engn, Sch Comp, Sydney, NSW 2109, Australia
[3] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan 430073, Peoples R China
[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Detectors; Semantics; Circuits and systems; Accuracy; Feature extraction; Encoding; Arbitrary-oriented; object detection; aerial image; boundary problem;
D O I
10.1109/TCSVT.2024.3438431
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Oriented object detection has garnered significant attention. However, rotational symmetry and discontinuity at boundaries can confuse networks, leading to discontinuous loss and regression inconsistency. In this paper, we propose an efficient multi-directional object detection framework named Direction Prediction Redefinition (DPR). We describe the angle variation of rotated bounding boxes (B-r) as changes in the dimensions of horizontal bounding boxes (B-h). Specifically, we generate two sets of horizontal bounding boxes by predicting the center points of the corresponding boundaries within the rotated bounding box, thereby avoiding boundary issues caused by angle prediction. To further achieve robust rotated boundary representation, we propose the Joint Scale Representation method and the State Feature Encoding module, which are used to eliminate outliers in rotated boundaries and guide the correct selection of horizontal bounding box vertices, respectively. Moreover, we further abstract DPR as Multiple Trigonometric functions based DPR (DPR-MT). This method maps a single angle into four sets of trigonometric functions and considers them as the four sides of the horizontal bounding box. This approach predicts angles in the form of horizontal bounding boxes without complex operations, making it plug-and-play. Experimental results and visual analysis on challenging datasets further verify the effectiveness and competitiveness of our proposed method.
引用
收藏
页码:12894 / 12906
页数:13
相关论文
共 50 条
  • [21] Dual Appearance-Aware Enhancement for Oriented Object Detection
    Gong, Maoguo
    Zhao, Hongyu
    Wu, Yue
    Tang, Zedong
    Feng, Kai-Yuan
    Sheng, Kai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [22] WSODet: A Weakly Supervised Oriented Detector for Aerial Object Detection
    Tan, Zhiwen
    Jiang, Zhiguo
    Guo, Chen
    Zhang, Haopeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [23] Learning Orientation-Aware Distances for Oriented Object Detection
    Rao, Chaofan
    Wang, Jiabao
    Cheng, Gong
    Xie, Xingxing
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [24] Bounding Box Projection for Regression Uncertainty in Oriented Object Detection
    Wu, Qian
    Xiang, Wangtao
    Tang, Rui
    Zhu, Jun
    IEEE ACCESS, 2021, 9 : 58768 - 58779
  • [25] Anchor-Free Oriented Proposal Generator for Object Detection
    Cheng, Gong
    Wang, Jiabao
    Li, Ke
    Xie, Xingxing
    Lang, Chunbo
    Yao, Yanqing
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [26] Bridging Multi-Scale Context-Aware Representation for Object Detection
    Wang, Boying
    Ji, Ruyi
    Zhang, Libo
    Wu, Yanjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2317 - 2329
  • [27] Negative-Core Sample Knowledge Distillation for Oriented Object Detection in Remote Sensing Image
    Zhang, Wenhui
    Zhang, Yidan
    Huang, Feilong
    Qi, Xiyu
    Wang, Lei
    Liu, Xiaoxuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [28] HFSI-TF: Hierarchical Full-Scale Interactive Transformer Model for Object Detection in Remote Sensing Image
    Li, Daxiang
    Li, Bingying
    Liu, Ying
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [29] PRA-Det: Anchor-Free Oriented Object Detection With Polar Radius Representation
    Dang, Min
    Liu, Gang
    Li, Hao
    Wang, Di
    Pan, Rong
    Wang, Quan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 145 - 157
  • [30] Progressive Critical Region Transfer for Cross-Domain Visual Object Detection
    Wang, Xiaowei
    Jiang, Peiwen
    Li, Yang
    Hu, Manjiang
    Gao, Ming
    Cao, Dongpu
    Ding, Rongjun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9427 - 9441