A novel three-dimensional object detection with the modified You Only Look Once

被引：12

作者：

Zhao, Xia ^{[1
]}

Jia, Haihang ^{[1
]}

Ni, Yingting ^{[1
]}

机构：

[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2018年 / 15卷 / 02期

关键词：

Convolutional neural network; object detection; cluster box; coordinate transformation; 3D object bounding box;

D O I：

10.1177/1729881418765507

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Three-dimensional object detection aims to produce a three-dimensional bounding box of an object at its full extent. Nowadays, three-dimensional object detection is mainly based on red green blue-depth (RGB-D) images. However, it remains an open problem because of the difficulty in labeling for three-dimensional training data. In this article, we present a novel three-dimensional object detection method based on two-dimensional object detection, which only takes a set of RGB images as input. First, aiming at the requirement of three-dimensional object detection and the low location accuracy of You Only Look Once, a modified two-dimensional object detection method based on You Only Look Once is proposed. Then, using a set of images from different visual angles, three-dimensional geometric data are reconstructed. In addition, making use of the modified You Only Look Once method, the two-dimensional object bounding boxes of the forward and side views are obtained. Finally, according to the transformation between the two-dimensional pixel coordinate and the three-dimensional space coordinate, the two-dimensional object bounding box is mapped onto the reconstructed three-dimensional scene to form the three-dimensional object box. Because this method only needs the collection of two-dimensional images to train the modified You Only Look Once model, it has a wide range of applications. The experimental results show that the modified You Only Look Once model can improve the location accuracy, and our algorithm can effectively realize the three-dimensional object detection without depth images.

引用

页数：13

共 50 条

[21] Video you only look once: Overall temporal convolutions for action recognition
Jing, Longlong
Yang, Xiaodong
Tian, Yingli
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 52 : 58 - 65
[22] Tunnel Lining Multi-Defect Detection Based on an Improved You Only Look Once Version 7 Algorithm
Song, Juan
He, Long-Xi
Long, Hui-Ping
IEEE ACCESS, 2023, 11 : 125171 - 125184
[23] RDD-YOLO: Road Damage Detection Algorithm Based on Improved You Only Look Once Version 8
Li, Yue
Yin, Chang
Lei, Yutian
Zhang, Jiale
Yan, Yiting
APPLIED SCIENCES-BASEL, 2024, 14 (08):
[24] You Only Look at Once for Real-Time and Generic Multi-Task
Wang, Jiayuan
Wu, Q. M. Jonathan
Zhang, Ning
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12625 - 12637
[25] YOLOMM - You Only Look Once for Multi-modal Multi-tasking
Campos, Filipe
Cerqueira, Francisco Goncalves
Cruz, Ricardo P. M.
Cardoso, Jaime S.
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 564 - 574
[26] Heavy Equipment Detection on Construction Sites Using You Only Look Once (YOLO-Version 10) with Transformer Architectures
Eum, Ikchul
Kim, Jaejun
Wang, Seunghyeon
Kim, Juhyung
APPLIED SCIENCES-BASEL, 2025, 15 (05):
[27] Three-Dimensional Mapping System for Environment and Object Detection on an Unmanned Aerial Vehicle
Ambata, Leonard
Francis Chavez, Adrian
Devnani, Naveen
Ingeniero, Joshua
Kyle Paggabao, Aidan
2019 IEEE 11TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT, AND MANAGEMENT (HNICEM), 2019,
[28] You Only Look at Interested Cells: Real-Time Object Detection Based on Cell-Wise Segmentation
Su, Kai
Wang, Huitao
Chowdhury, Intisar Md
Zhao, Qiangfu
Tomioka, Yoichi
2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2020,
[29] Edge-Triggered Three-Dimensional Object Detection Using a LiDAR Ring
Song, Eunji
Jeong, Seyoung
Hwang, Sung-Ho
SENSORS, 2024, 24 (06)
[30] Rapid identification of moldy peanuts based on three-dimensional hyperspectral object detection
Yang, Weiqiang
Liu, Chaoxian
Zeng, Shan
Duan, Xiangjun
Zhang, Chengyu
Tao, Wei
JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2024, 133

← 1 2 3 4 5 →