A lightweight object detection method based on fine-grained information extraction and exchange in UAV aerial images

被引:0
作者
Zhou, Liming
Zhao, Shuai
Li, Shilong
Wang, Yadi [1 ]
Liu, Yang
Zuo, Xianyu
机构
[1] Henan Univ, Henan Key Lab Big Data Anal & Proc, Kaifeng 475000, Peoples R China
关键词
UAV aerial image object detection; Feature extraction; Feature aggregation; Feature recombination; NETWORKS;
D O I
10.1016/j.knosys.2025.113253
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objects in unmanned aerial vehicle (UAV) images are easily disturbed by complex backgrounds, and objects different positions in these images often have notable differences in size because of different shooting angles. To effectively detect objects, current mainstream methods improve the detection accuracy through complex iterative convolution operations and attention mechanisms. However, these methods not only improve the accuracy but also result in a high memory overhead and feature redundancy, which brings unbearable load pressure to the UAV platform. Therefore, to refine the multi-granularity object information of UAV aerial images via a lightweight manner and improve the precision of multi-scale object detection, we design an portable lightweight multi-scale UAV image object detection network (UAVDNet) based on MConvBottleNet. First, to overcome the challenge that the conventional convolution receptive field is unitary and easily loses the fine-grained information described above, we design a multifunctional convolution (MConv) module achieve multi-receptive field information aggregation and feature weighting through hierarchical mechanism. Second, we propose MConvBottleNet to simultaneously aggregate local and global information using residual connections and channel shuffling operations on the basis of the diverse information provided by MConv. Third, to effectively exploit the context information in high-level semantic feature maps and preserve the original fine-grained details to the maximum possible extent, we design an inter-layer cascaded information aggregation pooling (ICIAP) module, which, together with MConvBottleNet, constitutes the feature extraction network. Finally, we propose a fusion network based on the feature recombination and enhancement (FRE) module, denoted as FRENet, which can take advantage of the information-complementary characteristic different channel layers to obtain overall channel enhancement results and effectively improve the ability to detect multi-scale objects. Experiments on the VisDrone dataset show that UAVDNet achieves an average detection accuracy of 48.1% with only 4.4M parameters.
引用
收藏
页数:14
相关论文
共 61 条
  • [1] [Anonymous], 2024, 2024 INT C IOT BAS C, P1013, DOI [10.1109/ICICNIS64247.2024.10823150, DOI 10.1109/ICICNIS64247.2024.10823150]
  • [2] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [3] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [4] Hybrid Task Cascade for Instance Segmentation
    Chen, Kai
    Pang, Jiangmiao
    Wang, Jiaqi
    Xiong, Yu
    Li, Xiaoxiao
    Sun, Shuyang
    Feng, Wansen
    Liu, Ziwei
    Shi, Jianping
    Ouyang, Wanli
    Loy, Chen Change
    Lin, Dahua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
  • [5] Adaptive meta-knowledge transfer network for few-shot object detection very high resolution remote sensing images
    Chen, Xi
    Jiang, Wanyue
    Qi, Honggang
    Liu, Min
    Ma, Heping
    Yu, Philip L. H.
    Wen, Ying
    Han, Zhen
    Zhang, Shuqi
    Cao, Guitao
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [6] KD loss: Enhancing discriminability of features with kernel trick for object detection in VHR remote sensing images
    Chen, Xi
    Li, Liyue
    Li, Zhihong
    Liu, Min
    Li, Qingli
    Qi, Honggang
    Ma, Dongliang
    Wen, Ying
    Cao, Guitao
    Yu, Philip L. H.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [7] Coupled Global-Local object detection for large VHR aerial images
    Chen, Xi
    Wang, Chaojie
    Li, Zhihong
    Liu, Min
    Li, Qingli
    Qi, Honggang
    Ma, Dongliang
    Li, Zhiqiang
    Wang, Yong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [8] An online continual object detector on VHR remote sensing images with class imbalance
    Chen, Xi
    Jiang, Jie
    Li, Zhiqiang
    Qi, Honggang
    Li, Qingli
    Liu, Jiapeng
    Zheng, Laiwen
    Liu, Min
    Deng, Yongqiang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [9] VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results
    Du, Dawei
    Zhu, Pengfei
    Wen, Longyin
    Bian, Xiao
    Ling, Haibin
    Hu, Qinghua
    Peng, Tao
    Zheng, Jiayu
    Wang, Xinyao
    Zhang, Yue
    Bo, Liefeng
    Shi, Hailin
    Zhu, Rui
    Kumar, Aashish
    Li, Aijin
    Zinollayev, Almaz
    Askergaliyev, Anuar
    Schumann, Arne
    Mao, Binjie
    Lee, Byeongwon
    Liu, Chang
    Chen, Changrui
    Pan, Chunhong
    Huo, Chunlei
    Yu, Da
    Cong, Dechun
    Zeng, Dening
    Pailla, Dheeraj Reddy
    Li, Di
    Wang, Dong
    Cho, Donghyeon
    Zhang, Dongyu
    Bai, Furui
    Jose, George
    Gao, Guangyu
    Liu, Guizhong
    Xiong, Haitao
    Qi, Hao
    Wang, Haoran
    Qiu, Heqian
    Li, Hongliang
    Lu, Huchuan
    Kim, Ildoo
    Kim, Jaekyum
    Shen, Jane
    Lee, Jihoon
    Ge, Jing
    Xu, Jingjing
    Zhou, Jingkai
    Meier, Jonas
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 213 - 226
  • [10] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
    Du, Dawei
    Qi, Yuankai
    Yu, Hongyang
    Yang, Yifan
    Duan, Kaiwen
    Li, Guorong
    Zhang, Weigang
    Huang, Qingming
    Tian, Qi
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 375 - 391