Data Fusion of Semantic and Depth Information in the Context of Object Detection

被引:0
作者
Abu Yusuf, Md [1 ]
Khan, Md Rezaul Karim [2 ]
Saha, Partha Pratim [2 ]
Rahaman, Mohammed Mahbubur [2 ]
机构
[1] Tech Univ Chemnitz, Fac Comp Sci, Chemnitz, Germany
[2] Maharishi Int Univ, Dept Comp Sci, Fairfield, IA USA
来源
2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024 | 2024年
关键词
Autonomous driving; Computer vision; Faster RCNN; Object detection; Stereo vision;
D O I
10.1109/ICOICI62503.2024.10696627
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considerable study has already been conducted regarding autonomous driving in modern era. An autonomous driving system must be extremely good at detecting objects surrounding the car to ensure safety. In this paper, classification, and estimation of an object's (pedestrian) position (concerning an ego 3D coordinate system) are studied and the distance between the ego vehicle and the object in the context of autonomous driving is measured. To classify the object, faster Region-based Convolution Neural Network (R-CNN) with inception v2 is utilized. First, a network is trained with customized dataset to estimate the reference position of objects as well as the distance from the vehicle. From camera calibration to computing the distance, cutting-edge technologies of computer vision algorithms in a series of processes are applied to generate a 3D reference point of the region of interest. The foremost step in this process is generating a disparity map using the concept of stereo vision.
引用
收藏
页码:1124 / 1129
页数:6
相关论文
共 14 条
[1]   3D Objects Detection in an Autonomous Car Driving Problem [J].
Agafonov, Anton ;
Yumaganov, Alexander .
2020 VI INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND NANOTECHNOLOGY (IEEE ITNT-2020), 2020,
[2]  
Buelthoff H H., 1991, Sensor Fusion III: 3D Perception and Recognition
[3]   Deep Multi-modal Object Detection for Autonomous Driving [J].
Ennajar, Amal ;
Khouja, Nadia ;
Boutteau, Remi ;
Tlili, Fethi .
2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, :7-11
[4]  
Harada K., 2019, IEEE INT C FIELD PRO
[5]  
Jin Y., 2023, IEEE INT C INT COMP
[6]   Improving Deep Multi-modal 3D Object Detection for Autonomous Driving [J].
Khamsehashari, Razieh ;
Schill, Kerstin .
2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, :263-267
[7]  
McCrae S, 2020, IEEE IMAGE PROC, P2661, DOI [10.1109/icip40778.2020.9191134, 10.1109/ICIP40778.2020.9191134]
[8]  
Nishat Mirza Muntasir, 2022, Proceedings of the International Conference on Big Data, IoT, and Machine Learning: BIM 2021. Lecture Notes on Data Engineering and Communications Technologies (95), P473, DOI 10.1007/978-981-16-6636-0_36
[9]  
O'Shea K, 2015, Arxiv, DOI [arXiv:1511.08458, DOI 10.48550/ARXIV.1511.08458]
[10]  
Priya M.V., 2021, IEEE INT IND GEOSC R