CoFormerNet: A Transformer-Based Fusion Approach for Enhanced Vehicle-Infrastructure Cooperative Perception

被引:0
|
作者
Li, Bin [1 ]
Zhao, Yanan [2 ]
Tan, Huachun [1 ,3 ,4 ]
机构
[1] Southeast Univ, Sch Transportat, Nanjing 211189, Peoples R China
[2] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Dept Transportat Engn, Zhuhai 519088, Peoples R China
[4] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, ShenSi Lab, Shenzhen 518110, Peoples R China
关键词
V2X; cooperative perception; 3D LiDAR object detection; LIDAR;
D O I
10.3390/s24134101
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Vehicle-infrastructure cooperative perception is becoming increasingly crucial for autonomous driving systems and involves leveraging infrastructure's broader spatial perspective and computational resources. This paper introduces CoFormerNet, which is a novel framework for improving cooperative perception. CoFormerNet employs a consistent structure for both vehicle and infrastructure branches, integrating the temporal aggregation module and spatial-modulated cross-attention to fuse intermediate features at two distinct stages. This design effectively handles communication delays and spatial misalignment. Experimental results using the DAIR-V2X and V2XSet datasets demonstrated that CoFormerNet significantly outperformed the existing methods, achieving state-of-the-art performance in 3D object detection.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] ViT-FuseNet: Multimodal Fusion of Vision Transformer for Vehicle-Infrastructure Cooperative Perception
    Zhou, Yang
    Yang, Cai
    Wang, Ping
    Wang, Chao
    Wang, Xinhong
    Van, Nguyen Ngoc
    IEEE ACCESS, 2024, 12 : 31640 - 31651
  • [2] A method of vehicle-infrastructure cooperative perception based vehicle state information fusion using improved kalman filter
    Mo, Yanghui
    Zhang, Peilin
    Chen, Zhijun
    Ran, Bin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (04) : 4603 - 4620
  • [3] A method of vehicle-infrastructure cooperative perception based vehicle state information fusion using improved kalman filter
    Yanghui Mo
    Peilin Zhang
    Zhijun Chen
    Bin Ran
    Multimedia Tools and Applications, 2022, 81 : 4603 - 4620
  • [4] Object Perceptibility Prediction for Transmission Load Reduction in Vehicle-Infrastructure Cooperative Perception
    Lv, Pin
    Han, Jinlei
    He, Yuebin
    Xu, Jia
    Li, Taoshen
    SENSORS, 2022, 22 (11)
  • [5] A software framework for vehicle-infrastructure cooperative applications
    Bengochea, S
    Talamona, A
    Parent, M
    2005 IEEE Intelligent Transportation Systems Conference (ITSC), 2005, : 797 - 800
  • [6] FETR: Feature Transformer for vehicle-infrastructure cooperative 3D object detection
    Yan, Wenchao
    Cao, Hua
    Chen, Jiazhong
    Wu, Tao
    NEUROCOMPUTING, 2024, 600
  • [7] Traffic wave model based on vehicle-infrastructure cooperative and vehicle communication data
    Yuan, Huazhi
    Zhang, Hongjia
    Liu, Xuelian
    Jiao, Xinlong
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (04) : 1755 - 1772
  • [8] Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
    Yu, Haibao
    Tang, Yingjuan
    Xie, Enze
    Mao, Jilei
    Luo, Ping
    Nie, Zaiqing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Re-identification Algorithm for Cooperative Vehicle-infrastructure System Based on Multi-information Fusion
    Qian, Min
    Geng, Keke
    Yin, Guodong
    Li, Shangjie
    Wang, Ziwei
    Sun, Yuxiao
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2024, 60 (16): : 231 - 240
  • [10] Cooperative vehicle-infrastructure localization based on the symmetric measurement equation filter
    Feihu Zhang
    Gereon Hinz
    Dhiraj Gulati
    Daniel Clarke
    Alois Knoll
    GeoInformatica, 2016, 20 : 159 - 178