A novel three-dimensional object detection with the modified You Only Look Once

被引:12
|
作者
Zhao, Xia [1 ]
Jia, Haihang [1 ]
Ni, Yingting [1 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
来源
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS | 2018年 / 15卷 / 02期
关键词
Convolutional neural network; object detection; cluster box; coordinate transformation; 3D object bounding box;
D O I
10.1177/1729881418765507
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Three-dimensional object detection aims to produce a three-dimensional bounding box of an object at its full extent. Nowadays, three-dimensional object detection is mainly based on red green blue-depth (RGB-D) images. However, it remains an open problem because of the difficulty in labeling for three-dimensional training data. In this article, we present a novel three-dimensional object detection method based on two-dimensional object detection, which only takes a set of RGB images as input. First, aiming at the requirement of three-dimensional object detection and the low location accuracy of You Only Look Once, a modified two-dimensional object detection method based on You Only Look Once is proposed. Then, using a set of images from different visual angles, three-dimensional geometric data are reconstructed. In addition, making use of the modified You Only Look Once method, the two-dimensional object bounding boxes of the forward and side views are obtained. Finally, according to the transformation between the two-dimensional pixel coordinate and the three-dimensional space coordinate, the two-dimensional object bounding box is mapped onto the reconstructed three-dimensional scene to form the three-dimensional object box. Because this method only needs the collection of two-dimensional images to train the modified You Only Look Once model, it has a wide range of applications. The experimental results show that the modified You Only Look Once model can improve the location accuracy, and our algorithm can effectively realize the three-dimensional object detection without depth images.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multi-Dimensional Information Fusion You Only Look Once Network for Suspicious Object Detection in Millimeter Wave Images
    Chen, Zhenhong
    Tian, Ruijiao
    Xiong, Di
    Yuan, Chenchen
    Li, Tang
    Shi, Yiran
    ELECTRONICS, 2024, 13 (04)
  • [2] Optimizing Loss Functions for You Only Look Once Models: Improving Object Detection in Agricultural Datasets
    Matsui, Atsuki
    Ishibashi, Ryuto
    Meng, Lin
    COMPUTERS, 2025, 14 (02)
  • [3] Modified You Only Look Once Network Model for Enhanced Traffic Scene Detection Performance for Small Targets
    Shi, Lei
    Ren, Shuai
    Fan, Xing
    Wang, Ke
    Lin, Shan
    Liu, Zhanwen
    IET IMAGE PROCESSING, 2025, 19 (01)
  • [4] Automatic Polyp Detection by Combining Conditional Generative Adversarial Network and Modified You-Only-Look-Once
    Qian, Zhiqin
    Jing, Weiji
    Lv, Yi
    Zhang, Wenjun
    IEEE SENSORS JOURNAL, 2022, 22 (11) : 10841 - 10849
  • [5] You Only Look Once, But Compute Twice: Service Function Chaining for Low-Latency Object Detection in Softwarized Networks
    Xiang, Zuo
    Seeling, Patrick
    Fitzek, Frank H. P.
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 14
  • [6] Automatic detection of sewer defects based on improved you only look once algorithm
    Tan, Yi
    Cai, Ruying
    Li, Jingru
    Chen, Penglu
    Wang, Mingzhu
    AUTOMATION IN CONSTRUCTION, 2021, 131
  • [7] YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
    Liu, Chenguang
    Gao, Guangshuai
    Huang, Ziyue
    Hu, Zhenghui
    Liu, Qingjie
    Wang, Yunhong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13863 - 13875
  • [8] Marine Debris Detection Using Optimized You Only Look Once Version 5
    Lee, Tae-Young
    Jeon, Seung Bae
    Jeong, Myeong-Hun
    SENSORS AND MATERIALS, 2023, 35 (09) : 3441 - 3449
  • [9] Automated Vehicle Counting from Pre-Recorded Video Using You Only Look Once (YOLO) Object Detection Model
    Majumder, Mishuk
    Wilmot, Chester
    JOURNAL OF IMAGING, 2023, 9 (07)
  • [10] Teacher-Student Model Using Grounding DINO and You Only Look Once for Multi-Sensor-Based Object Detection
    Son, Jinhwan
    Jung, Heechul
    APPLIED SCIENCES-BASEL, 2024, 14 (06):