Monocular 3D Object Detection from Roadside Infrastructure

被引:0
|
作者
Huang, Delu [1 ]
Wen, Feng [1 ]
机构
[1] Continental Holding China Co Ltd, Innovat & Technol Div, Shanghai 200082, Peoples R China
关键词
D O I
10.1109/IV55156.2024.10588725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperative vehicle infrastructure system (CVIS) plays a crucial role in achieving fully autonomous driving. However, Conducting research on infrastructure-side monocular 3D object detection is challenging due to the significant discrepancy in calibration parameters of cameras mounted on different infrastructures. This discrepancy can create ambiguity for detection algorithms. To address this issue, our approach focuses on directly regress 8 vertices of 3D bounding box at image level to mitigate the impact of calibration parameters. During the training and inference process, our method do not need any calibration parameter. The 3D pose and position parameters are obtained after post-processing. We proposed a simple post-processing algorithm to calculate 3D parameters from 8 image-level vertices. And since background from the view of infrastructure remains unchanged, we propose using Gaussian Mixture Model (GMM) branch to generate moving-objects-sensitive (MOS) features. This approach enhances the recognition of objects, leading to our method being termed GMMNet. GMMNet achieves a high mean average precision (mAP) on the DAIR-V2X-I dataset, surpassing other start-of-the-art methods by a significant margin. Furthermore, GMMNet exhibits a greater generalization ability.
引用
收藏
页码:1672 / 1677
页数:6
相关论文
共 50 条
  • [1] Competition for roadside camera monocular 3D object detection
    Jia, Jinrang
    Shi, Yifeng
    Qu, Yuli
    Wang, Rui
    Xu, Xing
    Zhang, Hai
    NATIONAL SCIENCE REVIEW, 2023, 10 (06)
  • [2] Competition for roadside camera monocular 3D object detection
    Jinrang Jia
    Yifeng Shi
    Yuli Qu
    Rui Wang
    Xing Xu
    Hai Zhang
    NationalScienceReview, 2023, 10 (06) : 34 - 37
  • [3] RoadSense3D: A Framework for Roadside Monocular 3D Object Detection
    Carta, Salvatore
    Castrillon-Santana, Modesto
    Marras, Mirko
    Mohamed, Sondos
    Podda, Alessandro Sebastian
    Saia, Roberto
    Sau, Marco
    Zimmer, Walter
    ADJUNCT PROCEEDINGS OF THE 32ND ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2024, 2024, : 452 - 459
  • [4] Investigating the Effectiveness of 3D Monocular Object Detection Methods for Roadside Scenarios
    Barra, Silvio
    Marras, Mirko
    Mohamed, Sondos
    Podda, Alessandro Sebastian
    Saia, Roberto
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 221 - 223
  • [5] YOLOv7-3D: A Monocular 3D Traffic Object Detection Method from a Roadside Perspective
    Ye, Zixun
    Zhang, Hongying
    Gu, Jingliang
    Li, Xue
    APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [6] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
    Yang, Lei
    Zhang, Xinyu
    Yu, Jiaxin
    Li, Jun
    Zhao, Tong
    Wang, Li
    Huang, Yi
    Zhang, Chuang
    Wang, Hong
    Li, Yiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
  • [7] 3D Visual Object Detection from Monocular Images
    Wang, Qiaosong
    Rasmussen, Christopher
    ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 168 - 180
  • [8] Monocular 3D Object Detection with Depth from Motion
    Wang, Tai
    Pang, Jiangmiao
    Lin, Dahua
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
  • [9] Aerial Monocular 3D Object Detection
    Hu, Yue
    Fang, Shaoheng
    Xie, Weidi
    Chen, Siheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
  • [10] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999