A novel three-dimensional object detection with the modified You Only Look Once

被引：12

作者：

Zhao, Xia ^{[1
]}

Jia, Haihang ^{[1
]}

Ni, Yingting ^{[1
]}

机构：

[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2018年 / 15卷 / 02期

关键词：

Convolutional neural network; object detection; cluster box; coordinate transformation; 3D object bounding box;

D O I：

10.1177/1729881418765507

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Three-dimensional object detection aims to produce a three-dimensional bounding box of an object at its full extent. Nowadays, three-dimensional object detection is mainly based on red green blue-depth (RGB-D) images. However, it remains an open problem because of the difficulty in labeling for three-dimensional training data. In this article, we present a novel three-dimensional object detection method based on two-dimensional object detection, which only takes a set of RGB images as input. First, aiming at the requirement of three-dimensional object detection and the low location accuracy of You Only Look Once, a modified two-dimensional object detection method based on You Only Look Once is proposed. Then, using a set of images from different visual angles, three-dimensional geometric data are reconstructed. In addition, making use of the modified You Only Look Once method, the two-dimensional object bounding boxes of the forward and side views are obtained. Finally, according to the transformation between the two-dimensional pixel coordinate and the three-dimensional space coordinate, the two-dimensional object bounding box is mapped onto the reconstructed three-dimensional scene to form the three-dimensional object box. Because this method only needs the collection of two-dimensional images to train the modified You Only Look Once model, it has a wide range of applications. The experimental results show that the modified You Only Look Once model can improve the location accuracy, and our algorithm can effectively realize the three-dimensional object detection without depth images.

引用

页数：13

共 50 条

[41] Principal component analysis-based object detection/recognition chip for wireless interconnected three-dimensional integration
Ando, Hiroshi
Kameda, Seiji
Arizono, Daisuke
Fuchigami, Norimitsu
Kaya, Kouta
Sasaki, Mamoru
Iwata, Atushi
JAPANESE JOURNAL OF APPLIED PHYSICS, 2008, 47 (04) : 2746 - 2748
[42] Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism
Peng, Cheng
Sun, Mingqi
Zou, Kun
Zhang, Bowen
Dai, Genan
Tsoi, Ah Chung
SENSORS, 2024, 24 (21)
[43] Pseudo-LiDAR With Two-Dimensional Instance for Monocular Three-Dimensional Object Tracking
Gao, Rui
Kim, Junoh
Chu, Phuong Minh
Cho, Kyungeun
IEEE ACCESS, 2025, 13 : 45771 - 45783
[44] SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection
Huang, Shangfeng
Cai, Guorong
Wang, Zongyue
Xia, Qiming
Wang, Ruisheng
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 14764 - 14778
[45] SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion
Liu, Pengfei
Wang, Qing
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 7093 - 7106
[46] NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking
Kalgaonkar, Priyank
El-Sharkawy, Mohamed
FUTURE INTERNET, 2024, 16 (04)
[47] Maneuvering Target Detection Based on Three-Dimensional Coherent Integration
Zhao, Langxu
Tao, Haihong
Chen, Weijia
IEEE ACCESS, 2020, 8 : 188321 - 188334
[48] Distance Detection Method in Virtual Three-Dimensional Hoisting Environment
An Jianqi
Wang Wei
Wu Min
He Yong
PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 7233 - 7238
[49] Three-Dimensional Point Cloud Object Detection Using Scene Appearance Consistency Among Multi-View Projection Directions
Sugimura, Daisuke
Yamazaki, Tomoaki
Hamamoto, Takayuki
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3345 - 3357
[50] Three-Dimensional Object Co-Localization From Mobile LiDAR Point Clouds
Guo, Wenzhong
Chen, Jiawei
Wang, Weipeng
Luo, Huan
Wang, Shiping
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (04) : 1996 - 2007

← 1 2 3 4 5 →