A novel three-dimensional object detection with the modified You Only Look Once

被引:12
|
作者
Zhao, Xia [1 ]
Jia, Haihang [1 ]
Ni, Yingting [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
来源
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2018年 / 15卷 / 02期
关键词
Convolutional neural network; object detection; cluster box; coordinate transformation; 3D object bounding box;
D O I
10.1177/1729881418765507
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Three-dimensional object detection aims to produce a three-dimensional bounding box of an object at its full extent. Nowadays, three-dimensional object detection is mainly based on red green blue-depth (RGB-D) images. However, it remains an open problem because of the difficulty in labeling for three-dimensional training data. In this article, we present a novel three-dimensional object detection method based on two-dimensional object detection, which only takes a set of RGB images as input. First, aiming at the requirement of three-dimensional object detection and the low location accuracy of You Only Look Once, a modified two-dimensional object detection method based on You Only Look Once is proposed. Then, using a set of images from different visual angles, three-dimensional geometric data are reconstructed. In addition, making use of the modified You Only Look Once method, the two-dimensional object bounding boxes of the forward and side views are obtained. Finally, according to the transformation between the two-dimensional pixel coordinate and the three-dimensional space coordinate, the two-dimensional object bounding box is mapped onto the reconstructed three-dimensional scene to form the three-dimensional object box. Because this method only needs the collection of two-dimensional images to train the modified You Only Look Once model, it has a wide range of applications. The experimental results show that the modified You Only Look Once model can improve the location accuracy, and our algorithm can effectively realize the three-dimensional object detection without depth images.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Video you only look once: Overall temporal convolutions for action recognition
    Jing, Longlong
    Yang, Xiaodong
    Tian, Yingli
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 52 : 58 - 65
  • [22] Tunnel Lining Multi-Defect Detection Based on an Improved You Only Look Once Version 7 Algorithm
    Song, Juan
    He, Long-Xi
    Long, Hui-Ping
    IEEE ACCESS, 2023, 11 : 125171 - 125184
  • [23] RDD-YOLO: Road Damage Detection Algorithm Based on Improved You Only Look Once Version 8
    Li, Yue
    Yin, Chang
    Lei, Yutian
    Zhang, Jiale
    Yan, Yiting
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [24] You Only Look at Once for Real-Time and Generic Multi-Task
    Wang, Jiayuan
    Wu, Q. M. Jonathan
    Zhang, Ning
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (09) : 12625 - 12637
  • [25] YOLOMM - You Only Look Once for Multi-modal Multi-tasking
    Campos, Filipe
    Cerqueira, Francisco Goncalves
    Cruz, Ricardo P. M.
    Cardoso, Jaime S.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 564 - 574
  • [26] Heavy Equipment Detection on Construction Sites Using You Only Look Once (YOLO-Version 10) with Transformer Architectures
    Eum, Ikchul
    Kim, Jaejun
    Wang, Seunghyeon
    Kim, Juhyung
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [27] Three-Dimensional Mapping System for Environment and Object Detection on an Unmanned Aerial Vehicle
    Ambata, Leonard
    Francis Chavez, Adrian
    Devnani, Naveen
    Ingeniero, Joshua
    Kyle Paggabao, Aidan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT, AND MANAGEMENT (HNICEM), 2019,
  • [28] You Only Look at Interested Cells: Real-Time Object Detection Based on Cell-Wise Segmentation
    Su, Kai
    Wang, Huitao
    Chowdhury, Intisar Md
    Zhao, Qiangfu
    Tomioka, Yoichi
    2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2020,
  • [29] Edge-Triggered Three-Dimensional Object Detection Using a LiDAR Ring
    Song, Eunji
    Jeong, Seyoung
    Hwang, Sung-Ho
    SENSORS, 2024, 24 (06)
  • [30] Rapid identification of moldy peanuts based on three-dimensional hyperspectral object detection
    Yang, Weiqiang
    Liu, Chaoxian
    Zeng, Shan
    Duan, Xiangjun
    Zhang, Chengyu
    Tao, Wei
    JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2024, 133