Multi-scale entropy neural architecture search for object detection in remote sensing images

被引:0
|
作者
Yang, Jun [1 ,2 ]
Xie, Hengjing [1 ]
Fan, Hongchao [3 ]
Yan, Haowen [1 ]
机构
[1] Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou
[2] School of Electronic and Information Engineering, Lanzhou Jiaotong University, Lanzhou
[3] Department of Civil and Environmental Engineering, Norwegian University of Science and Technology, Trondheim
来源
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica | 2024年 / 53卷 / 07期
基金
中国国家自然科学基金;
关键词
feature separation convolution; maximum entropy; multi-scale entropy; neural architecture search; object detection; progressive evolution; remote sensing;
D O I
10.11947/j.AGCS.2024.20230455
中图分类号
学科分类号
摘要
Aiming at the traditional neural architecture search requires an enormous amount of time for supernet training, search efficiency is suboptimal, and the searched model can not efficiently solve the problem of multi-scale object detection difficulty and high background complexity in remote sensing images. This paper proposes a multi-scale entropy neural architecture search method for object detection in remote sensing images. At first, the feature separation convolution is added to the base block of the search space instead of the regular convolution in the residual block, which reduces the information redundancy in remote sensing images due to the high background complexity, and improves the detection performance of the network modeluner the complex background. Next, the maximum entropy principle is introduced to calculate the multi-scale entropy of each candidate network in the search space, and the multi-scale entropy is combined with the feature pyramid network to balance the detection of large, medium and small objects in remote sensing images. Finally, the network model with maximum multi-scale entropy is obtained by searching without parameter training using progressive evolutionary algorithm for the object detection task.The model ensures detection accuracy while improving the search efficiency. The proposed algorithm achieves a mean average precision of 93.1%, 75.5% and 73.6% on the RSOD, DIOR and DOTA datasets, respectively, with a network search time of 8.1 hours. The experimental results demonstrate that the proposed algorithm can significantly improve the search efficiency of the network, combine the features at different scales better and solve the problem of high image background complexity in the object detection task compared with the current benchmark methods. © 2024 SinoMaps Press. All rights reserved.
引用
收藏
页码:1384 / 1400
页数:16
相关论文
共 34 条
  • [1] ZHOU Peng, YANG Jun, Semantic segmentation of remote sensing images based on neural architecture search, Journal of Xidian University, 48, 5, pp. 47-57, (2021)
  • [2] CHEN Ding, WAN Gang, LI Ke, Object detection in optical remote sensing images based on combination of multi-layer feature and context information, Acta Geodaetica et Cartographica Sinica, 48, 10, pp. 1275-1284, (2019)
  • [3] CHEN Zhanlong, LI Shuangjiang, XU Yongyang, Et al., Correg-YOLOv3: a method for dense buildings detection in high-resolution remote sensing images, Journal of Geodesy and Geoinformation Science, 6, 2, pp. 51-61, (2023)
  • [4] YANG Jun, HAN Pengfei, Object detection of high-resolution remote sensing images by neural architecture search [J/OL], Journal of Jilin University (Engineering Edition), pp. 1-12
  • [5] GIRSHICK R, DONAHUE J, DARRELL T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [6] ZOPH B, LE Q V., Neural architecture search with reinforcement learning
  • [7] PHAM H, GUAN M, ZOPH B, Et al., Efficient neural architecture search via parameters sharing, Proceedings of 2018 International Conference on Machine Learning, pp. 4095-4104, (2018)
  • [8] REAL E, AGGARWAL A, HUANG Y, Et al., Regularized evolution for image classifier architecture search, Proceedings of 2019 AAAI Conference on Artificial Intelligence, pp. 4780-4789, (2019)
  • [9] LIU H, SIMONYAN K, YANG Y., DARTS: differentiable architecture search
  • [10] GHIASI G, LIN T Y, LE Q V., NAS-FPN: learning scalable feature pyramid architecture for object detection, Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7036-7045, (2019)