A novel three-dimensional object detection with the modified You Only Look Once

被引:12
|
作者
Zhao, Xia [1 ]
Jia, Haihang [1 ]
Ni, Yingting [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
来源
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2018年 / 15卷 / 02期
关键词
Convolutional neural network; object detection; cluster box; coordinate transformation; 3D object bounding box;
D O I
10.1177/1729881418765507
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Three-dimensional object detection aims to produce a three-dimensional bounding box of an object at its full extent. Nowadays, three-dimensional object detection is mainly based on red green blue-depth (RGB-D) images. However, it remains an open problem because of the difficulty in labeling for three-dimensional training data. In this article, we present a novel three-dimensional object detection method based on two-dimensional object detection, which only takes a set of RGB images as input. First, aiming at the requirement of three-dimensional object detection and the low location accuracy of You Only Look Once, a modified two-dimensional object detection method based on You Only Look Once is proposed. Then, using a set of images from different visual angles, three-dimensional geometric data are reconstructed. In addition, making use of the modified You Only Look Once method, the two-dimensional object bounding boxes of the forward and side views are obtained. Finally, according to the transformation between the two-dimensional pixel coordinate and the three-dimensional space coordinate, the two-dimensional object bounding box is mapped onto the reconstructed three-dimensional scene to form the three-dimensional object box. Because this method only needs the collection of two-dimensional images to train the modified You Only Look Once model, it has a wide range of applications. The experimental results show that the modified You Only Look Once model can improve the location accuracy, and our algorithm can effectively realize the three-dimensional object detection without depth images.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Principal component analysis-based object detection/recognition chip for wireless interconnected three-dimensional integration
    Ando, Hiroshi
    Kameda, Seiji
    Arizono, Daisuke
    Fuchigami, Norimitsu
    Kaya, Kouta
    Sasaki, Mamoru
    Iwata, Atushi
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2008, 47 (04) : 2746 - 2748
  • [42] Facial Expression Recognition-You Only Look Once-Neighborhood Coordinate Attention Mamba: Facial Expression Detection and Classification Based on Neighbor and Coordinates Attention Mechanism
    Peng, Cheng
    Sun, Mingqi
    Zou, Kun
    Zhang, Bowen
    Dai, Genan
    Tsoi, Ah Chung
    SENSORS, 2024, 24 (21)
  • [43] Pseudo-LiDAR With Two-Dimensional Instance for Monocular Three-Dimensional Object Tracking
    Gao, Rui
    Kim, Junoh
    Chu, Phuong Minh
    Cho, Kyungeun
    IEEE ACCESS, 2025, 13 : 45771 - 45783
  • [44] SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection
    Huang, Shangfeng
    Cai, Guorong
    Wang, Zongyue
    Xia, Qiming
    Wang, Ruisheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 14764 - 14778
  • [45] SCA-YOLOv4: you only look once with squeeze-and-excitation, coordinate attention and adaptively spatial feature fusion
    Liu, Pengfei
    Wang, Qing
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 7093 - 7106
  • [46] NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking
    Kalgaonkar, Priyank
    El-Sharkawy, Mohamed
    FUTURE INTERNET, 2024, 16 (04)
  • [47] Maneuvering Target Detection Based on Three-Dimensional Coherent Integration
    Zhao, Langxu
    Tao, Haihong
    Chen, Weijia
    IEEE ACCESS, 2020, 8 : 188321 - 188334
  • [48] Distance Detection Method in Virtual Three-Dimensional Hoisting Environment
    An Jianqi
    Wang Wei
    Wu Min
    He Yong
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 7233 - 7238
  • [49] Three-Dimensional Point Cloud Object Detection Using Scene Appearance Consistency Among Multi-View Projection Directions
    Sugimura, Daisuke
    Yamazaki, Tomoaki
    Hamamoto, Takayuki
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3345 - 3357
  • [50] Three-Dimensional Object Co-Localization From Mobile LiDAR Point Clouds
    Guo, Wenzhong
    Chen, Jiawei
    Wang, Weipeng
    Luo, Huan
    Wang, Shiping
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (04) : 1996 - 2007