A Multi-Task Framework for Car Detection From High-Resolution UAV Imagery Focusing on Road Regions

被引:7
|
作者
Hoanh, Nguyen [1 ,2 ]
Pham, Tran Vu [1 ]
机构
[1] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 72506, Vietnam
[2] Ind Univ Ho Chi Minh City, Fac Elect Engn Technol, Ho Chi Minh City 700000, Vietnam
关键词
Automobiles; Roads; Object detection; Autonomous aerial vehicles; Image segmentation; Accuracy; Training; Convolutional neural networks; deep learning; object detection; unmanned aerial vehicles; SEGMENTATION;
D O I
10.1109/TITS.2024.3432761
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Unmanned Aerial Vehicles (UAVs) have recently emerged as a promising platform for acquiring high-resolution imagery in urban environments. Efficiently detecting cars from these images is vital for various applications, including traffic management, urban planning, and security. However, the abundance of features and the large variability in the appearance of cars within high-resolution UAV images pose significant challenges. This paper introduces a novel multi-task framework designed to enhance car detection by focusing specifically on road regions. The model utilizes a shared encoder and a fully convolutional network decoder, augmented by an attentive binary fusion for road and car segmentation. For car detection, we combine a deep layer aggregation with a CenterNet detection head. During training, UAV images are downsampled, passed through the encoder and both decoders, generating road/car confidence maps and car detection results. In the inference phase, a region extraction module is designed to extract high-resolution road segments according to the road segmentation mask. To enhance detection accuracy, the region extraction module concatenates the input image with the car confidence map. We also introduce a scale-weighted focal loss in response to challenges associated with detecting smaller cars in high-resolution UAV images. Experimental results on the UAVid2020 and VisDrone2020 datasets demonstrate the superiority of our model in both inference time and accuracy, meeting the real-time requirements of intelligent traffic systems.
引用
收藏
页码:17160 / 17173
页数:14
相关论文
共 50 条
  • [1] YOLO-U: multi-task model for vehicle detection and road segmentation in UAV aerial imagery
    Zhao, Zhihong
    He, Peng
    EARTH SCIENCE INFORMATICS, 2024, 17 (04) : 3253 - 3269
  • [2] STONE PINE (Pinus Pinea L.) DETECTION FROM HIGH-RESOLUTION UAV IMAGERY USING DEEP LEARNING MODEL
    Yildirim, Esra
    Nazar, Mertcan
    Sefercik, Umut Gunes
    Kavzoglu, Taskin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 441 - 444
  • [3] A MULTI-TASK DEEP LEARNING FRAMEWORK COUPLING SEMANTIC SEGMENTATION AND IMAGE RECONSTRUCTION FOR VERY HIGH RESOLUTION IMAGERY
    Papadomanolaki, Maria
    Karantzalos, Konstantinos
    Vakalopoulou, Maria
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1069 - 1072
  • [4] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
    Nurkarim, Wahidya
    Wijayanto, Arie Wahyu
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
  • [5] Road Extraction From High Spatial Resolution Remote Sensing Image Based on Multi-Task Key Point Constraints
    Li, Xungen
    Zhang, Zhan
    Lv, Shuaishuai
    Pan, Mian
    Ma, Qi
    Yu, Haibin
    IEEE ACCESS, 2021, 9 : 95896 - 95910
  • [6] Analysis on Saliency Estimation Methods in High-Resolution Optical Remote Sensing Imagery for Multi-Scale Ship Detection
    Li, Zezhong
    You, Yanan
    Liu, Fang
    IEEE ACCESS, 2020, 8 (08): : 194485 - 194496
  • [7] Delineation of agricultural fields using multi-task BsiNet from high-resolution satellite images
    Long, Jiang
    Li, Mengmeng
    Wang, Xiaoqin
    Stein, Alfred
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
  • [8] A Deep Learning Approach to an Enhanced Building Footprint and Road Detection in High-Resolution Satellite Imagery
    Ayala, Christian
    Sesma, Ruben
    Aranda, Carlos
    Galar, Mikel
    REMOTE SENSING, 2021, 13 (16)
  • [9] Road Extraction from High-Resolution Remote Sensing Imagery Using Deep Learning
    Xu, Yongyang
    Xie, Zhong
    Feng, Yaxing
    Chen, Zhanlong
    REMOTE SENSING, 2018, 10 (09)
  • [10] Multiscale road centerlines extraction from high-resolution aerial imagery
    Liu, Ruyi
    Miao, Qiguang
    Song, Jianfeng
    Quan, Yining
    Li, Yunan
    Xu, Pengfei
    Dai, Jing
    NEUROCOMPUTING, 2019, 329 : 384 - 396