A lightweight object detection method based on fine-grained information extraction and exchange in UAV aerial images

被引：0

作者：

Zhou, Liming

Zhao, Shuai

Li, Shilong

Wang, Yadi ^{[1
]}

Liu, Yang

Zuo, Xianyu

机构：

[1] Henan Univ, Henan Key Lab Big Data Anal & Proc, Kaifeng 475000, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2025年 / 315卷

关键词：

UAV aerial image object detection; Feature extraction; Feature aggregation; Feature recombination; NETWORKS;

D O I：

10.1016/j.knosys.2025.113253

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Objects in unmanned aerial vehicle (UAV) images are easily disturbed by complex backgrounds, and objects different positions in these images often have notable differences in size because of different shooting angles. To effectively detect objects, current mainstream methods improve the detection accuracy through complex iterative convolution operations and attention mechanisms. However, these methods not only improve the accuracy but also result in a high memory overhead and feature redundancy, which brings unbearable load pressure to the UAV platform. Therefore, to refine the multi-granularity object information of UAV aerial images via a lightweight manner and improve the precision of multi-scale object detection, we design an portable lightweight multi-scale UAV image object detection network (UAVDNet) based on MConvBottleNet. First, to overcome the challenge that the conventional convolution receptive field is unitary and easily loses the fine-grained information described above, we design a multifunctional convolution (MConv) module achieve multi-receptive field information aggregation and feature weighting through hierarchical mechanism. Second, we propose MConvBottleNet to simultaneously aggregate local and global information using residual connections and channel shuffling operations on the basis of the diverse information provided by MConv. Third, to effectively exploit the context information in high-level semantic feature maps and preserve the original fine-grained details to the maximum possible extent, we design an inter-layer cascaded information aggregation pooling (ICIAP) module, which, together with MConvBottleNet, constitutes the feature extraction network. Finally, we propose a fusion network based on the feature recombination and enhancement (FRE) module, denoted as FRENet, which can take advantage of the information-complementary characteristic different channel layers to obtain overall channel enhancement results and effectively improve the ability to detect multi-scale objects. Experiments on the VisDrone dataset show that UAVDNet achieves an average detection accuracy of 48.1% with only 4.4M parameters.

引用

页数：14

共 61 条

[1] [Anonymous], 2024, 2024 INT C IOT BAS C, P1013, DOI [10.1109/ICICNIS64247.2024.10823150, DOI 10.1109/ICICNIS64247.2024.10823150]
[2] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
Cai, Zhaowei
Vasconcelos, Nuno
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
[3] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[4] Hybrid Task Cascade for Instance Segmentation
Chen, Kai
Pang, Jiangmiao
Wang, Jiaqi
Xiong, Yu
Li, Xiaoxiao
Sun, Shuyang
Feng, Wansen
Liu, Ziwei
Shi, Jianping
Ouyang, Wanli
Loy, Chen Change
Lin, Dahua
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
[5] Adaptive meta-knowledge transfer network for few-shot object detection very high resolution remote sensing images
Chen, Xi
Jiang, Wanyue
Qi, Honggang
Liu, Min
Ma, Heping
Yu, Philip L. H.
Wen, Ying
Han, Zhen
Zhang, Shuqi
Cao, Guitao
[J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
[6] KD loss: Enhancing discriminability of features with kernel trick for object detection in VHR remote sensing images
Chen, Xi
Li, Liyue
Li, Zhihong
Liu, Min
Li, Qingli
Qi, Honggang
Ma, Dongliang
Wen, Ying
Cao, Guitao
Yu, Philip L. H.
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
[7] Coupled Global-Local object detection for large VHR aerial images
Chen, Xi
Wang, Chaojie
Li, Zhihong
Liu, Min
Li, Qingli
Qi, Honggang
Ma, Dongliang
Li, Zhiqiang
Wang, Yong
[J]. KNOWLEDGE-BASED SYSTEMS, 2023, 260
[8] An online continual object detector on VHR remote sensing images with class imbalance
Chen, Xi
Jiang, Jie
Li, Zhiqiang
Qi, Honggang
Li, Qingli
Liu, Jiapeng
Zheng, Laiwen
Liu, Min
Deng, Yongqiang
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
[9] VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results
Du, Dawei
Zhu, Pengfei
Wen, Longyin
Bian, Xiao
Ling, Haibin
Hu, Qinghua
Peng, Tao
Zheng, Jiayu
Wang, Xinyao
Zhang, Yue
Bo, Liefeng
Shi, Hailin
Zhu, Rui
Kumar, Aashish
Li, Aijin
Zinollayev, Almaz
Askergaliyev, Anuar
Schumann, Arne
Mao, Binjie
Lee, Byeongwon
Liu, Chang
Chen, Changrui
Pan, Chunhong
Huo, Chunlei
Yu, Da
Cong, Dechun
Zeng, Dening
Pailla, Dheeraj Reddy
Li, Di
Wang, Dong
Cho, Donghyeon
Zhang, Dongyu
Bai, Furui
Jose, George
Gao, Guangyu
Liu, Guizhong
Xiong, Haitao
Qi, Hao
Wang, Haoran
Qiu, Heqian
Li, Hongliang
Lu, Huchuan
Kim, Ildoo
Kim, Jaekyum
Shen, Jane
Lee, Jihoon
Ge, Jing
Xu, Jingjing
Zhou, Jingkai
Meier, Jonas
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 213 - 226
[10] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
Du, Dawei
Qi, Yuankai
Yu, Hongyang
Yang, Yifan
Duan, Kaiwen
Li, Guorong
Zhang, Weigang
Huang, Qingming
Tian, Qi
[J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 375 - 391

← 1 2 3 4 5 6 7 →