Multi-Task Learning for UAV Aerial Object Detection in Foggy Weather Condition

被引:20
作者
Fang, Wenxuan [1 ]
Zhang, Guoqing [1 ,2 ]
Zheng, Yuhui [1 ]
Chen, Yuwen [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Coll Comp Sci, Nanjing 210044, Peoples R China
[2] Massey Univ, Coll Math & Computat Sci, Auckland 0632, New Zealand
[3] Chinese Acad Sci, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV images; object detection; YOLO; foggy weather condition; NETWORK;
D O I
10.3390/rs15184617
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Adverse weather conditions such as haze and snowfall can degrade the quality of captured images and affect performance of drone detection. Therefore, it is challenging to locate and identify targets in adverse weather scenarios. In this paper, a novel model called Object Detection in a Foggy Condition with YOLO (ODFC-YOLO) is proposed, which performs image dehazing and object detection jointly by multi-task learning approach. Our model consists of a detection subnet and a dehazing subnet, which can be trained end-to-end to optimize both tasks. Specifically, we propose a Cross-Stage Partial Fusion Decoder (CSP-Decoder) in the dehazing subnet to recover clean features of encoder from complex weather conditions, thereby reducing the feature discrepancy between hazy and clean images, thus enhancing the feature consistency between different tasks. Additionally, to increase the feature modeling and representation capabilities of our network, we also propose an efficient Global Context Enhanced Extraction (GCEE) module to extract beneficial information from blurred images by constructing global feature context long-range dependencies. Furthermore, we propose a Correlation-Aware Aggregated Loss (CAALoss) to average noise patterns and tune gradient magnitudes across different tasks, accordingly implicitly enhancing data diversity and alleviating representation bias. Finally, we verify the advantages of our proposed model on both synthetic and real-world foggy datasets, and our ODFC-YOLO achieves the highest mAP on all datasets while achieving 36 FPS real-time detection speed.
引用
收藏
页数:18
相关论文
共 57 条
[1]   Machine Learning Inspired Sound-Based Amateur Drone Detection for Public Safety Applications [J].
Anwar, Muhammad Zohaib ;
Kaleem, Zeeshan ;
Jamalipour, Abbas .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (03) :2526-2534
[2]   Self-Guided Image Dehazing Using Progressive Feature Fusion [J].
Bai, Haoran ;
Pan, Jinshan ;
Xiang, Xinguang ;
Tang, Jinhui .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :1217-1229
[3]   Combined RF-Based Drone Detection and Classification [J].
Basak, Sanjoy ;
Rajendran, Sreeraj ;
Pollin, Sofie ;
Scheers, Bart .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (01) :111-120
[4]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[5]   High-Level Semantic Networks for Multi-Scale Object Detection [J].
Cao, Jiale ;
Pang, Yanwei ;
Zhao, Shengjie ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) :3372-3386
[6]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[7]  
Chen C, 2019, AAAI CONF ARTIF INTE, P865
[8]   Gated Context Aggregation Network for Image Dehazing and Deraining [J].
Chen, Dongdong ;
He, Mingming ;
Fan, Qingnan ;
Liao, Jing ;
Zhang, Liheng ;
Hou, Dongdong ;
Yuan, Lu ;
Hua, Gang .
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, :1375-1383
[9]   Domain Adaptive Faster R-CNN for Object Detection in the Wild [J].
Chen, Yuhua ;
Li, Wen ;
Sakaridis, Christos ;
Dai, Dengxin ;
Van Gool, Luc .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3339-3348
[10]  
Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430]