A panoramic driving perception fusion algorithm based on multi-task learning

被引:0
|
作者
Wu, Weilin [1 ,2 ]
Liu, Chunquan [1 ]
Zheng, Haoran [3 ]
机构
[1] Guangxi Minzu Univ, Coll Elect Informat, Guangxi Appl Math Ctr, Nanning, Peoples R China
[2] Wuzhou Univ, Guangxi Postdoctoral Innovat Practice Base, Wuzhou, Peoples R China
[3] Univ Auckland, Fac Engn, Chem & Mat Engn, Auckland, New Zealand
来源
PLOS ONE | 2024年 / 19卷 / 06期
基金
中国国家自然科学基金;
关键词
NETWORKS;
D O I
10.1371/journal.pone.0304691
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
With the rapid development of intelligent connected vehicles, there is an increasing demand for hardware facilities and onboard systems of driver assistance systems. Currently, most vehicles are constrained by the hardware resources of onboard systems, which mainly process single-task and single-sensor data. This poses a significant challenge in achieving complex panoramic driving perception technology. While the panoramic driving perception algorithm YOLOP has achieved outstanding performance in multi-task processing, it suffers from poor adaptability of feature map pooling operations and loss of details during downsampling. To address these issues, this paper proposes a panoramic driving perception fusion algorithm based on multi-task learning. The model training involves the introduction of different loss functions and a series of processing steps for lidar point cloud data. Subsequently, the perception information from lidar and vision sensors is fused to achieve synchronized processing of multi-task and multi-sensor data, thereby effectively improving the performance and reliability of the panoramic driving perception system. To evaluate the performance of the proposed algorithm in multi-task processing, the BDD100K dataset is used. The results demonstrate that, compared to the YOLOP model, the multi-task learning network performs better in lane detection, drivable area detection, and vehicle detection tasks. Specifically, the lane detection accuracy improves by 11.6%, the mean Intersection over Union (mIoU) for drivable area detection increases by 2.1%, and the mean Average Precision at 50% IoU (mAP50) for vehicle detection improves by 3.7%.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Multi-task perception algorithm of autonomous driving based on temporal fusion
    Liu Z.-W.
    Fan S.-H.
    Qi M.-Y.
    Dong M.
    Wang P.
    Zhao X.-M.
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2021, 21 (04): : 223 - 234
  • [2] Illegal Parking Detection Based on Multi-Task Driving Perception
    Kuo, Li-Chia
    Lin, Huei-Yung
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1865 - 1870
  • [3] Autonomous Driving Multi-Task Perception Algorithm Based on Receptive-Field Attention Convolution
    Liu, Yunxiang
    Ma, Haili
    Zhu, Jianlin
    Zhang, Qing
    Jin, Qi
    Computer Engineering and Applications, 2024, 60 (20) : 133 - 141
  • [4] Multi-Task Environmental Perception Methods for Autonomous Driving
    Liu, Ri
    Yang, Shubin
    Tang, Wansha
    Yuan, Jie
    Chan, Qiqing
    Yang, Yunchuan
    SENSORS, 2024, 24 (17)
  • [5] Algorithm for Stereo Matching Based on Multi-Task Learning
    Wang Yufeng
    Wang Hongwei
    Liu Yu
    Yang Mingquan
    Quan Jicheng
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (04)
  • [6] A Dialogues Summarization Algorithm Based on Multi-task Learning
    Chen, Haowei
    Li, Chen
    Liang, Jiajing
    Tian, Lihua
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [7] Multi-Focus Image Fusion Algorithm Based on Multi-Task Learning and PS-ViT
    Wu, Qinghua
    Li, Weitong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (11) : 1422 - 1432
  • [8] YOLOPX: Anchor-free multi-task learning network for panoptic driving perception
    Zhan, Jiao
    Luo, Yarong
    Guo, Chi
    Wu, Yejun
    Meng, Jiawei
    Liu, Jingnan
    PATTERN RECOGNITION, 2024, 148
  • [9] Adversarial Attacks on Multi-task Visual Perception for Autonomous Driving
    Sobh, Ibrahim
    Hamed, Ahmed
    Kumar, Varun Ravi
    Yogamani, Senthil
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2021, 65 (06)
  • [10] GDMNet: A Unified Multi-Task Network for Panoptic Driving Perception
    Liu, Yunxiang
    Ma, Haili
    Zhu, Jianlin
    Zhang, Qiangbo
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 2963 - 2978