Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

被引:0
|
作者
Yu, Haibao [1 ,2 ]
Tang, Yingjuan [2 ,3 ]
Xie, Enze [1 ]
Mao, Jilei [2 ]
Luo, Ping [1 ,4 ]
Nie, Zaiqing [2 ,5 ]
机构
[1] Univ Hong Kong, Hong Kong, Peoples R China
[2] Tsinghua Univ, Inst AI Ind Res AIR, Beijing, Peoples R China
[3] Beijing Inst Technol, Beijing, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
[5] AIR, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cooperatively utilizing both ego-vehicle and infrastructure sensor data can significantly enhance autonomous driving perception abilities. However, the uncertain temporal asynchrony and limited communication conditions can lead to fusion misalignment and constrain the exploitation of infrastructure data. To address these issues in vehicle-infrastructure cooperative 3D (VIC3D) object detection, we propose the Feature Flow Net (FFNet), a novel cooperative detection framework. FFNet is a flow-based feature fusion framework that uses a feature flow prediction module to predict future features and compensate for asynchrony. Instead of transmitting feature maps extracted from still-images, FFNet transmits feature flow, leveraging the temporal coherence of sequential infrastructure frames. Furthermore, we introduce a self-supervised training approach that enables FFNet to generate feature flow with feature prediction ability from raw infrastructure sequences. Experimental results demonstrate that our proposed method outperforms existing cooperative detection methods while only requiring about 1/100 of the transmission cost of raw data and covers all latency in one model on the DAIR-V2X dataset. The code is available at https://github.com/haibao-yu/FFNet-VIC3D.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] FETR: Feature Transformer for vehicle-infrastructure cooperative 3D object detection
    Yan, Wenchao
    Cao, Hua
    Chen, Jiazhong
    Wu, Tao
    NEUROCOMPUTING, 2024, 600
  • [2] Occlusion-guided multi-modal fusion for vehicle-infrastructure cooperative 3D object detection
    Chu, Huazhen
    Liu, Haizhuang
    Zhuo, Junbao
    Chen, Jiansheng
    Ma, Huimin
    PATTERN RECOGNITION, 2025, 157
  • [3] TransIFF: An Instance-Level Feature Fusion Framework for Vehicle-Infrastructure Cooperative 3D Detection with Transformers
    Chen, Ziming
    Shi, Yifeng
    Jia, Jinrang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18159 - 18168
  • [4] CenterCoop: Center-Based Feature Aggregation for Communication-Efficient Vehicle-Infrastructure Cooperative 3D Object Detection
    Zhou, Linyi
    Gan, Zhongxue
    Fan, Jiayuan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3570 - 3577
  • [5] SparseComm: An Efficient Sparse Communication Framework for Vehicle-Infrastructure Cooperative 3D Detection
    Liu, Haizhuang
    Chu, Huazhen
    Zhuo, Junbao
    Zou, Bochao
    Chen, Jiansheng
    Ma, Huimin
    PATTERN RECOGNITION, 2025, 158
  • [6] DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection
    Yu, Haibao
    Luo, Yizhen
    Shu, Mao
    Huo, Yiyi
    Yang, Zebang
    Shi, Yifeng
    Guo, Zhenglong
    Li, Hanyu
    Hu, Xing
    Yuan, Jirui
    Nie, Zaiqing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 21329 - 21338
  • [7] Adaptive Feature Fusion Based Cooperative 3D Object Detection for Autonomous Driving
    Wang, Junyong
    Zeng, Yuan
    Gong, Yi
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 103 - 107
  • [8] Integrated Detection and Tracking Framework for 3D Multi-Object Tracking in Vehicle-Infrastructure Cooperation
    Hu, Tao
    Wang, Ping
    Wang, Xinhong
    International Journal of Advanced Computer Science and Applications, 2024, 15 (11) : 1228 - 1237
  • [9] 3D Detection and Pose Estimation of Vehicle in Cooperative Vehicle Infrastructure System
    Guo, Ente
    Chen, Zhifeng
    Rahardja, Susanto
    Yang, Jingjing
    IEEE SENSORS JOURNAL, 2021, 21 (19) : 21759 - 21771
  • [10] CoFormerNet: A Transformer-Based Fusion Approach for Enhanced Vehicle-Infrastructure Cooperative Perception
    Li, Bin
    Zhao, Yanan
    Tan, Huachun
    SENSORS, 2024, 24 (13)