Direct pose estimation from RGB images using 3D objects

被引:1
作者
Dede, Muhammet Ali [1 ]
Genc, Yakup [1 ]
机构
[1] Gebze Tech Univ, Fac Engn, Dept Comp Engn, Gebze, Turkey
来源
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI | 2022年 / 28卷 / 02期
关键词
Augmented reality; Pose estimation; Deep learning;
D O I
10.5505/pajes.2021.08566
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We present a real-time monocular camera pose estimation algorithm for augmented reality applications. Proposed model is a small convolutional neural network that is trained to directly estimate 6 Degree of Freedom (6-DOF) camera pose from an RGB image. Our model is designed to run on real-time devices with low memory and computation power. Our model can estimate the camera pose in less than 1ms while keeping accuracy comparable to the state-of-the art. This was made possible by employing geometrically sound loss functions and algebraic constraints. Furthermore, we introduce a new synthetic dataset for demonstrating the proposed methods capabilities.
引用
收藏
页码:277 / 285
页数:9
相关论文
共 36 条
[1]  
[Anonymous], 2014, P 2 INT C LEARNING R
[2]   Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd [J].
Doumanoglou, Andreas ;
Kouskouridas, Rigas ;
Malassiotis, Sotiris ;
Kim, Tae-Kyun .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3583-3592
[3]   Simultaneous localization and mapping: Part I [J].
Durrant-Whyte, Hugh ;
Bailey, Tim .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (02) :99-108
[4]  
Forsyth D.A., 2002, Prentice Hall Professional Technical Reference, DOI 10.5555/580035
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]  
Hinterstoisser S, 2012, 11 ASIAN C COMPUTER
[7]   Gradient Response Maps for Real-Time Detection of Textureless Objects [J].
Hinterstoisser, Stefan ;
Cagniart, Cedric ;
Ilic, Slobodan ;
Sturm, Peter ;
Navab, Nassir ;
Fua, Pascal ;
Lepetit, Vincent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (05) :876-888
[8]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[9]   SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again [J].
Kehl, Wadim ;
Manhardt, Fabian ;
Tombari, Federico ;
Ilic, Slobodan ;
Navab, Nassir .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1530-1538
[10]   Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core [J].
Kehl, Wadim ;
Tombari, Federico ;
Ilic, Slobodan ;
Navab, Nassir .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :465-473