A Comprehensive Review on 3D Object Detection and 6D Pose Estimation With Deep Learning

被引:32
作者
Hoque, Sabera [1 ]
Arafat, Md. Yasir [1 ]
Xu, Shuxiang [1 ]
Maiti, Ananda [1 ]
Wei, Yuchen [1 ]
机构
[1] Univ Tasmania, Sch Informat & Commun Technol, Newnham, Tas 7248, Australia
关键词
Three-dimensional displays; Object detection; Pose estimation; Laser radar; Cameras; Visualization; Automobiles; Machine learning; deep neural network; computer vision; image processing; convolutional neural network; 3D object detection; 6D pose estimation; NEURAL-NETWORKS; IMAGE FEATURES; RECOGNITION; REPRESENTATION; LOCALIZATION; TRACKING; SEGMENTATION;
D O I
10.1109/ACCESS.2021.3114399
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, computer vision with 3D (dimension) object detection and 6D (degree of freedom) pose assumptions are widely discussed and studied in the field. In the 3D object detection process, classifications are centered on the object's size, position, and direction. And in 6D pose assumptions, networks emphasize 3D translation and rotation vectors. Successful application of these strategies can have a huge impact on various machine learning-based applications, including the autonomous vehicles, the robotics industry, and the augmented reality sector. Although extensive work has been done on 3D object detection with a pose assumption from RGB images, the challenges have not been fully resolved. Our analysis provides a comprehensive review of the proposed contemporary techniques for complete 3D object detection and the recovery of 6D pose assumptions of an object. In this review research paper, we have discussed several proposed sophisticated methods in 3D object detection and 6D pose estimation, including some popular data sets, evaluation matrix, and proposed method challenges. Most importantly, this study makes an effort to offer some possible future directions in 3D object detection and 6D pose estimation. We accept the autonomous vehicle as the sample case for this detailed review. Finally, this review provides a complete overview of the latest in-depth learning-based research studies related to 3D object detection and 6D pose estimation systems and points out a comparison between some popular frameworks. To be more concise, we propose a detailed summary of the state-of-the-art techniques of modern deep learning-based object detection and pose estimation models.
引用
收藏
页码:143746 / 143770
页数:25
相关论文
共 293 条
  • [1] TaDPole: Traffic-aware Discovery Protocol for Software-Defined Wireless and Mobile Networks
    Alenezi, Faheed A. F.
    Song, Sejun
    Choi, Baek-Young
    Gebre-Amlak, Haymanot
    [J]. 2020 29TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2020), 2020,
  • [2] [Anonymous], OPEN ACCESS REPOSITO
  • [3] [Anonymous], 2021, WHAT IS LID
  • [4] [Anonymous], THESIS U TASMANIA HO
  • [5] Arafeh M, 2019, 2019 4TH INTERNATIONAL CONFERENCE ON SMART AND SUSTAINABLE TECHNOLOGIES (SPLITECH), P208, DOI [10.23919/splitech.2019.8783092, 10.1109/skima47702.2019.8982391]
  • [6] A survey of augmented reality
    Azuma, RT
    [J]. PRESENCE-VIRTUAL AND AUGMENTED REALITY, 1997, 6 (04): : 355 - 385
  • [7] GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES
    BALLARD, DH
    [J]. PATTERN RECOGNITION, 1981, 13 (02) : 111 - 122
  • [8] Digital image restoration
    Banham, MR
    Katsaggelos, AK
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (02) : 24 - 41
  • [9] MonoFENet: Monocular 3D Object Detection With Feature Enhancement Networks
    Bao, Wentao
    Xu, Bin
    Chen, Zhenzhong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 2753 - 2765
  • [10] Current challenges in autonomous driving
    Barabas, I.
    Todorut, A.
    Cordos, N.
    Molea, A.
    [J]. INTERNATIONAL CONGRESS OF AUTOMOTIVE AND TRANSPORT ENGINEERING - MOBILITY ENGINEERING AND ENVIRONMENT (CAR2017), 2017, 252