Monocular 3D Object Detection from Roadside Infrastructure

被引：0

作者：

Huang, Delu ^{[1
]}

Wen, Feng ^{[1
]}

机构：

[1] Continental Holding China Co Ltd, Innovat & Technol Div, Shanghai 200082, Peoples R China

来源：

2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年

关键词：

D O I：

10.1109/IV55156.2024.10588725

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cooperative vehicle infrastructure system (CVIS) plays a crucial role in achieving fully autonomous driving. However, Conducting research on infrastructure-side monocular 3D object detection is challenging due to the significant discrepancy in calibration parameters of cameras mounted on different infrastructures. This discrepancy can create ambiguity for detection algorithms. To address this issue, our approach focuses on directly regress 8 vertices of 3D bounding box at image level to mitigate the impact of calibration parameters. During the training and inference process, our method do not need any calibration parameter. The 3D pose and position parameters are obtained after post-processing. We proposed a simple post-processing algorithm to calculate 3D parameters from 8 image-level vertices. And since background from the view of infrastructure remains unchanged, we propose using Gaussian Mixture Model (GMM) branch to generate moving-objects-sensitive (MOS) features. This approach enhances the recognition of objects, leading to our method being termed GMMNet. GMMNet achieves a high mean average precision (mAP) on the DAIR-V2X-I dataset, surpassing other start-of-the-art methods by a significant margin. Furthermore, GMMNet exhibits a greater generalization ability.

引用

页码：1672 / 1677

页数：6

共 50 条

[1] Competition for roadside camera monocular 3D object detection
Jia, Jinrang
Shi, Yifeng
Qu, Yuli
Wang, Rui
Xu, Xing
Zhang, Hai
NATIONAL SCIENCE REVIEW, 2023, 10 (06)
[2] Competition for roadside camera monocular 3D object detection
Jinrang Jia
Yifeng Shi
Yuli Qu
Rui Wang
Xing Xu
Hai Zhang
NationalScienceReview, 2023, 10 (06) : 34 - 37
[3] RoadSense3D: A Framework for Roadside Monocular 3D Object Detection
Carta, Salvatore
Castrillon-Santana, Modesto
Marras, Mirko
Mohamed, Sondos
Podda, Alessandro Sebastian
Saia, Roberto
Sau, Marco
Zimmer, Walter
ADJUNCT PROCEEDINGS OF THE 32ND ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2024, 2024, : 452 - 459
[4] Investigating the Effectiveness of 3D Monocular Object Detection Methods for Roadside Scenarios
Barra, Silvio
Marras, Mirko
Mohamed, Sondos
Podda, Alessandro Sebastian
Saia, Roberto
39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 221 - 223
[5] YOLOv7-3D: A Monocular 3D Traffic Object Detection Method from a Roadside Perspective
Ye, Zixun
Zhang, Hongying
Gu, Jingliang
Li, Xue
APPLIED SCIENCES-BASEL, 2023, 13 (20):
[6] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
Yang, Lei
Zhang, Xinyu
Yu, Jiaxin
Li, Jun
Zhao, Tong
Wang, Li
Huang, Yi
Zhang, Chuang
Wang, Hong
Li, Yiming
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
[7] 3D Visual Object Detection from Monocular Images
Wang, Qiaosong
Rasmussen, Christopher
ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 168 - 180
[8] Monocular 3D Object Detection with Depth from Motion
Wang, Tai
Pang, Jiangmiao
Lin, Dahua
COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 386 - 403
[9] Aerial Monocular 3D Object Detection
Hu, Yue
Fang, Shaoheng
Xie, Weidi
Chen, Siheng
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 1959 - 1966
[10] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999

← 1 2 3 4 5 →