MS-IAF: Multi-Scale Information Augmentation Framework for Aircraft Detection

被引:8
|
作者
Zhao, Yuliang [1 ,2 ]
Li, Jian [1 ,2 ]
Li, Weishi [1 ,2 ]
Shan, Peng [1 ,2 ]
Wang, Xiaoai [1 ,2 ]
Li, Lianjiang [1 ,2 ]
Fu, Qiang [3 ]
机构
[1] Northeastern Univ, Sensor & Big Data Lab, Qinhuangdao 066000, Hebei, Peoples R China
[2] Hebei Key Lab Micronano Precis Opt Sensing & Meas, Qinhuangdao 066000, Hebei, Peoples R China
[3] Army Engn Univ PLA, Shijiazhuang Sch, Shijiazhuang 050003, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-scale; MS-IAF; ResNeSt-D; BFPCAR; object detection; remote sensing;
D O I
10.3390/rs14153696
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Aircrafts have been an important object of study in the field of multi-scale image object detection due to their important strategic role. However, the multi-scale detection of aircrafts and their key parts from remote sensing images can be a challenge, as images often present complex backgrounds and obscured conditions. Most of today's multi-scale datasets consist of independent objects and lack mixed annotations of aircrafts and their key parts. In this paper, we contribute a multi-scale aircraft dataset (AP-DATA) consisting of 7000 aircraft images that were taken in complex environments and obscured conditions. Our dataset includes mixed annotations of aircrafts and their key parts. We also present a multi-scale information augmentation framework (MS-IAF) to recognize multi-scale aircrafts and their key parts accurately. First, we propose a new deep convolutional module ResNeSt-D as the backbone, which stacks scattered attention in a multi-path manner and makes the receptive field more suitable for the object. Then, based on the combination of Faster R-CNN with ResNeSt-D, we propose a multi-scale feature fusion module called BFPCAR. BFPCAR overcomes the attention imbalance problem of the non-adjacent layers of the FPN module by reducing the loss of information between different layers and including more semantic features during information fusion. Based on AP-DATA, a dataset with three types of features, the average precision (AP) of MS-IAF reached 0.884, i.e., 2.67% higher than that of the original Faster R-CNN. The APs of these two modules were improved by 2.32% and 1.39%, respectively. The robustness of our proposed model was validated using the open sourced RSOD remote sensing image dataset, and the best accuracy was achieved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Detection of influential nodes with multi-scale information
    王静恩
    刘三阳
    Ahmed Aljmiai
    白艺光
    Chinese Physics B, 2021, 30 (08) : 669 - 676
  • [2] Multi-scale Information Aggregation for Spoofing Detection
    Li, Changtao
    Wan, Yi
    Yang, Feiran
    Yang, Jun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [3] Detection of influential nodes with multi-scale information*
    Wang, Jing-En
    Liu, San-Yang
    Aljmiai, Ahmed
    Bai, Yi-Guang
    CHINESE PHYSICS B, 2021, 30 (08)
  • [4] Multi-scale information quantification framework in network information system environment
    Wang Yunhui
    Yan Xinyang
    Wang Kexiang
    SEVENTH SYMPOSIUM ON NOVEL PHOTOELECTRONIC DETECTION TECHNOLOGY AND APPLICATIONS, 2021, 11763
  • [5] Multi-scale Semantic Information Fusion for Object Detection
    Chen Hongkun
    Luo Huilan
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (07) : 2087 - 2095
  • [6] Grape clusters detection based on multi-scale feature fusion and augmentation
    Ma, Jinlin
    Xu, Silong
    Ma, Ziping
    Fu, Hong
    Lin, Baobao
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [7] Airfield Multi-Scale Object Detection for Visual Navigation in Civil Aircraft
    Zhang, Tao
    Zhang, Xuerui
    Chen, Yong
    Zhong, Kelin
    Luo, Qijun
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2024, 58 (11): : 1816 - 1825
  • [8] A framework for multi-scale modelling
    Chopard, B.
    Borgdorff, Joris
    Hoekstra, A. G.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2014, 372 (2021):
  • [9] Detection of cervical cell based on multi-scale spatial information
    Li, Gang
    Fan, Xinyu
    Xu, Chuanyun
    Lv, Pengfei
    Wang, Ru
    Ruan, Zihan
    Zhou, Zheng
    Zhang, Yang
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [10] Enhancing video pedestrian detection with tracking-information-aided framework and multi-scale feature optimization
    Sang, Haifeng
    Suo, Pengkai
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (03)