Monocular 3D Object Detection from Roadside Infrastructure

被引:0
|
作者
Huang, Delu [1 ]
Wen, Feng [1 ]
机构
[1] Continental Holding China Co Ltd, Innovat & Technol Div, Shanghai 200082, Peoples R China
关键词
D O I
10.1109/IV55156.2024.10588725
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cooperative vehicle infrastructure system (CVIS) plays a crucial role in achieving fully autonomous driving. However, Conducting research on infrastructure-side monocular 3D object detection is challenging due to the significant discrepancy in calibration parameters of cameras mounted on different infrastructures. This discrepancy can create ambiguity for detection algorithms. To address this issue, our approach focuses on directly regress 8 vertices of 3D bounding box at image level to mitigate the impact of calibration parameters. During the training and inference process, our method do not need any calibration parameter. The 3D pose and position parameters are obtained after post-processing. We proposed a simple post-processing algorithm to calculate 3D parameters from 8 image-level vertices. And since background from the view of infrastructure remains unchanged, we propose using Gaussian Mixture Model (GMM) branch to generate moving-objects-sensitive (MOS) features. This approach enhances the recognition of objects, leading to our method being termed GMMNet. GMMNet achieves a high mean average precision (mAP) on the DAIR-V2X-I dataset, surpassing other start-of-the-art methods by a significant margin. Furthermore, GMMNet exhibits a greater generalization ability.
引用
收藏
页码:1672 / 1677
页数:6
相关论文
共 50 条
  • [21] Triangulation Learning Network: from Monocular to Stereo 3D Object Detection
    Qin, Zengyi
    Wang, Jinglu
    Lu, Yan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7607 - 7615
  • [22] Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver
    Liu, Xianpeng
    Zheng, Ce
    Cheng, Kelvin
    Xue, Nan
    Qi, Guo-Jun
    Wu, Tianfu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6413 - 6423
  • [23] Progressive Coordinate Transforms for Monocular 3D Object Detection
    Wang, Li
    Zhang, Li
    Zhu, Yi
    Zhang, Zhi
    He, Tong
    Li, Mu
    Xue, Xiangyang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [24] Exploring Geometric Consistency for Monocular 3D Object Detection
    Lian, Qing
    Ye, Botao
    Xu, Ruijia
    Yao, Weilong
    Zhang, Tong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1675 - 1684
  • [25] MonoSG: Monocular 3D Object Detection With Stereo Guidance
    Fan, Zhiwei
    Xu, Chao
    Chu, Minghang
    Huang, Yuling
    Ma, Yaoyao
    Wang, Jing
    Xu, Yishen
    Wu, Di
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (04): : 3604 - 3611
  • [26] Monocular 3D Object Detection With Motion Feature Distillation
    Hu, Henan
    Li, Muyu
    Zhu, Ming
    Gao, Wen
    Liu, Peiyu
    Chan, Kwok-Leung
    IEEE ACCESS, 2023, 11 : 82933 - 82945
  • [27] Monocular Object Detection Using 3D Geometric Primitives
    Carr, Peter
    Sheikh, Yaser
    Matthews, Iain
    COMPUTER VISION - ECCV 2012, PT I, 2012, 7572 : 864 - 878
  • [28] Dense-JANet for Monocular 3D Object Detection
    Shang, Xiaoqing
    Cheng, Zhiwei
    Shi, Su
    Cheng, Zhuanghao
    Huang, Hongcheng
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [29] MonoCD: Monocular 3D Object Detection with Complementary Depths
    Yan, Longfei
    Yan, Pei
    Xiong, Shengzhou
    Xiang, Xuanyu
    Tan, Yihua
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 10248 - 10257
  • [30] Monocular 3D object detection for an indoor robot environment
    Kim, Jiwon
    Lee, GiJae
    Kim, Jun-Sik
    Kim, Hyunwoo J.
    Kim, KangGeon
    2020 29TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2020, : 438 - 445