A Multi-Task Framework for Car Detection From High-Resolution UAV Imagery Focusing on Road Regions

被引：7

作者：

Hoanh, Nguyen ^{[1
,2
]}

Pham, Tran Vu ^{[1
]}

机构：

[1] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 72506, Vietnam

[2] Ind Univ Ho Chi Minh City, Fac Elect Engn Technol, Ho Chi Minh City 700000, Vietnam

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2024年 / 25卷 / 11期

关键词：

Automobiles; Roads; Object detection; Autonomous aerial vehicles; Image segmentation; Accuracy; Training; Convolutional neural networks; deep learning; object detection; unmanned aerial vehicles; SEGMENTATION;

D O I：

10.1109/TITS.2024.3432761

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Unmanned Aerial Vehicles (UAVs) have recently emerged as a promising platform for acquiring high-resolution imagery in urban environments. Efficiently detecting cars from these images is vital for various applications, including traffic management, urban planning, and security. However, the abundance of features and the large variability in the appearance of cars within high-resolution UAV images pose significant challenges. This paper introduces a novel multi-task framework designed to enhance car detection by focusing specifically on road regions. The model utilizes a shared encoder and a fully convolutional network decoder, augmented by an attentive binary fusion for road and car segmentation. For car detection, we combine a deep layer aggregation with a CenterNet detection head. During training, UAV images are downsampled, passed through the encoder and both decoders, generating road/car confidence maps and car detection results. In the inference phase, a region extraction module is designed to extract high-resolution road segments according to the road segmentation mask. To enhance detection accuracy, the region extraction module concatenates the input image with the car confidence map. We also introduce a scale-weighted focal loss in response to challenges associated with detecting smaller cars in high-resolution UAV images. Experimental results on the UAVid2020 and VisDrone2020 datasets demonstrate the superiority of our model in both inference time and accuracy, meeting the real-time requirements of intelligent traffic systems.

引用

页码：17160 / 17173

页数：14

共 50 条

[1] YOLO-U: multi-task model for vehicle detection and road segmentation in UAV aerial imagery
Zhao, Zhihong
He, Peng
EARTH SCIENCE INFORMATICS, 2024, 17 (04) : 3253 - 3269
[2] STONE PINE (Pinus Pinea L.) DETECTION FROM HIGH-RESOLUTION UAV IMAGERY USING DEEP LEARNING MODEL
Yildirim, Esra
Nazar, Mertcan
Sefercik, Umut Gunes
Kavzoglu, Taskin
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 441 - 444
[3] A MULTI-TASK DEEP LEARNING FRAMEWORK COUPLING SEMANTIC SEGMENTATION AND IMAGE RECONSTRUCTION FOR VERY HIGH RESOLUTION IMAGERY
Papadomanolaki, Maria
Karantzalos, Konstantinos
Vakalopoulou, Maria
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1069 - 1072
[4] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
Nurkarim, Wahidya
Wijayanto, Arie Wahyu
EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
[5] Road Extraction From High Spatial Resolution Remote Sensing Image Based on Multi-Task Key Point Constraints
Li, Xungen
Zhang, Zhan
Lv, Shuaishuai
Pan, Mian
Ma, Qi
Yu, Haibin
IEEE ACCESS, 2021, 9 : 95896 - 95910
[6] Analysis on Saliency Estimation Methods in High-Resolution Optical Remote Sensing Imagery for Multi-Scale Ship Detection
Li, Zezhong
You, Yanan
Liu, Fang
IEEE ACCESS, 2020, 8 (08): : 194485 - 194496
[7] Delineation of agricultural fields using multi-task BsiNet from high-resolution satellite images
Long, Jiang
Li, Mengmeng
Wang, Xiaoqin
Stein, Alfred
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
[8] A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
Ayala, Christian
Sesma, Ruben
Aranda, Carlos
Galar, Mikel
REMOTE SENSING, 2021, 13 (16)
[9] Road Extraction from High-Resolution Remote Sensing Imagery Using Deep Learning
Xu, Yongyang
Xie, Zhong
Feng, Yaxing
Chen, Zhanlong
REMOTE SENSING, 2018, 10 (09)
[10] Multiscale road centerlines extraction from high-resolution aerial imagery
Liu, Ruyi
Miao, Qiguang
Song, Jianfeng
Quan, Yining
Li, Yunan
Xu, Pengfei
Dai, Jing
NEUROCOMPUTING, 2019, 329 : 384 - 396

← 1 2 3 4 5 →