A Multi-Task Framework for Car Detection From High-Resolution UAV Imagery Focusing on Road Regions

被引:13
作者
Hoanh, Nguyen [1 ,2 ]
Pham, Tran Vu [1 ]
机构
[1] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 72506, Vietnam
[2] Ind Univ Ho Chi Minh City, Fac Elect Engn Technol, Ho Chi Minh City 700000, Vietnam
关键词
Automobiles; Roads; Object detection; Autonomous aerial vehicles; Image segmentation; Accuracy; Training; Convolutional neural networks; deep learning; object detection; unmanned aerial vehicles; SEGMENTATION;
D O I
10.1109/TITS.2024.3432761
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Unmanned Aerial Vehicles (UAVs) have recently emerged as a promising platform for acquiring high-resolution imagery in urban environments. Efficiently detecting cars from these images is vital for various applications, including traffic management, urban planning, and security. However, the abundance of features and the large variability in the appearance of cars within high-resolution UAV images pose significant challenges. This paper introduces a novel multi-task framework designed to enhance car detection by focusing specifically on road regions. The model utilizes a shared encoder and a fully convolutional network decoder, augmented by an attentive binary fusion for road and car segmentation. For car detection, we combine a deep layer aggregation with a CenterNet detection head. During training, UAV images are downsampled, passed through the encoder and both decoders, generating road/car confidence maps and car detection results. In the inference phase, a region extraction module is designed to extract high-resolution road segments according to the road segmentation mask. To enhance detection accuracy, the region extraction module concatenates the input image with the car confidence map. We also introduce a scale-weighted focal loss in response to challenges associated with detecting smaller cars in high-resolution UAV images. Experimental results on the UAVid2020 and VisDrone2020 datasets demonstrate the superiority of our model in both inference time and accuracy, meeting the real-time requirements of intelligent traffic systems.
引用
收藏
页码:17160 / 17173
页数:14
相关论文
共 50 条
[31]   Road Damage Detection From Post-Disaster High-Resolution Remote Sensing Images Based on TLD Framework [J].
Zhao, Kang ;
Liu, Jingjing ;
Wang, Qingnan ;
Wu, Xianjun ;
Tu, Jihui .
IEEE ACCESS, 2022, 10 :43552-43561
[32]   MF-SRCDNet: Multi-feature fusion super-resolution building change detection framework for multi-sensor high-resolution remote sensing imagery [J].
Li, Shaochun ;
Wang, Yanjun ;
Cai, Hengfan ;
Lin, Yunhao ;
Wang, Mengjie ;
Teng, Fei .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 119
[33]   Dual-Task Network for Road Extraction From High-Resolution Remote Sensing Images [J].
Lin, Yuzhun ;
Jin, Fei ;
Wang, Dandi ;
Wang, Shuxiang ;
Liu, Xiao .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :66-78
[34]   Road Extraction in Mountainous Regions from High-Resolution Images Based on DSDNet and Terrain Optimization [J].
Xu, Zeyu ;
Shen, Zhanfeng ;
Li, Yang ;
Xia, Liegang ;
Wang, Haoyu ;
Li, Shuo ;
Jiao, Shuhui ;
Lei, Yating .
REMOTE SENSING, 2021, 13 (01) :1-19
[35]   Detection, Characterization, and Modeling Vegetation in Urban Areas From High-Resolution Aerial Imagery [J].
Iovan, Corina ;
Boldo, Didier ;
Cord, Matthieu .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2008, 1 (03) :206-213
[36]   Enhanced Task-Aware Spatial Disentanglement Head for Oil Tanks Detection in High-Resolution Optical Imagery [J].
Wang, Tong ;
Li, Ying .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[37]   Extracting road maps from high-resolution satellite imagery using refined DSE-LinkNet [J].
Das, Prativa ;
Chand, Satish .
CONNECTION SCIENCE, 2021, 33 (02) :278-295
[38]   Automated Building Detection from Airborne LiDAR and Very High-Resolution Aerial Imagery with Deep Neural Network [J].
Ojogbane, Sani Success ;
Mansor, Shattri ;
Kalantar, Bahareh ;
Bin Khuzaimah, Zailani ;
Shafri, Helmi Zulhaidi Mohd ;
Ueda, Naonori .
REMOTE SENSING, 2021, 13 (23)
[39]   AM-UNet: Road Network Extraction from high-resolution Aerial Imagery Using Attention-Based Convolutional Neural Network [J].
Soni, Yashwant ;
Meena, Uma ;
Mishra, Vikash Kumar ;
Soni, Pramod Kumar .
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2025, 53 (01) :135-147
[40]   A Multifeature Fusion Framework Based on D-S Theory for Automatic Building Extraction From High-Resolution Remote Sensing Imagery [J].
Zhang, Xuedong ;
Li, Xing ;
Huang, Jian ;
Li, Erzhu ;
Liu, Wei ;
Zhang, Lianpeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 :11839-11856