Mask OBB: A Semantic Attention-Based Mask Oriented Bounding Box Representation for Multi-Category Object Detection in Aerial Images

被引:144
作者
Wang, Jinwang [1 ]
Ding, Jian [2 ]
Guo, Haowen [1 ]
Cheng, Wensheng [1 ]
Pan, Ting [1 ]
Yang, Wen [1 ,2 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
[2] Wuhan Univ, State Key Lab LIESMARS, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
oriented object detection; aerial images; convolutional neural networks; VEHICLE DETECTION;
D O I
10.3390/rs11242930
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Object detection in aerial images is a fundamental yet challenging task in remote sensing field. As most objects in aerial images are in arbitrary orientations, oriented bounding boxes (OBBs) have a great superiority compared with traditional horizontal bounding boxes (HBBs). However, the regression-based OBB detection methods always suffer from ambiguity in the definition of learning targets, which will decrease the detection accuracy. In this paper, we provide a comprehensive analysis of OBB representations and cast the OBB regression as a pixel-level classification problem, which can largely eliminate the ambiguity. The predicted masks are subsequently used to generate OBBs. To handle huge scale changes of objects in aerial images, an Inception Lateral Connection Network (ILCN) is utilized to enhance the Feature Pyramid Network (FPN). Furthermore, a Semantic Attention Network (SAN) is adopted to provide the semantic feature, which can help distinguish the object of interest from the cluttered background effectively. Empirical studies show that the entire method is simple yet efficient. Experimental results on two widely used datasets, i.e., DOTA and HRSC2016, demonstrate that the proposed method outperforms state-of-the-art methods.
引用
收藏
页数:21
相关论文
共 2 条
  • [1] Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
    Yang, Feng
    Li, Wentong
    Hu, Haiwei
    Li, Wanyi
    Wang, Peng
    SENSORS, 2020, 20 (06)
  • [2] Attention-Based Mean-Max Balance Assignment for Oriented Object Detection in Optical Remote Sensing Images
    Lin, Qifeng
    Chen, Nuo
    Huang, Haibin
    Zhu, Daoye
    Fu, Gang
    Chen, Chuanxi
    Yu, Yuanlong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63