A 6D Object Pose Estimation Method combining Self-attention Mechanism

被引:1
作者
Sun, Yifan [1 ]
Dai, Sumin [2 ]
Dang, Jianwu [1 ]
Yong, Jiu [1 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou, Peoples R China
[2] Beijing Fibrlink Commun Co Ltd, Beijing, Peoples R China
来源
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024 | 2024年
关键词
6D pose estimation; Self-attention mechanism; Residual structure; Deep learning;
D O I
10.1109/ICCEA62105.2024.10604264
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Aiming at the accuracy issue of the 6D pose estimation algorithm, a YOLO-6D object pose estimation method which introduces the expectation maximization self-attention mechanism is proposed. Firstly, use CSPDarkNet53 network to replace the original DarkNet19 network, increase network depth, and improve the network's feature extraction ability. Secondly, the expectation maximization self- attention module (EMAU) is combined with residual structures to propose the ResEMA structure, which is incorporated into the backbone network to extract more fine-grained representations. Finally, experiments are conducted on the publicly available dataset LineMOD, and the results show that the improved algorithm significantly improves accuracy, with a 6.78% increase in 2D projection error accuracy, a 9.14% increase in ADD accuracy, a 13.44% increase in 5cm,5 degrees accuracy.
引用
收藏
页码:1315 / 1319
页数:5
相关论文
共 21 条
[1]  
Bao Zhiqiang, 2021, Computer Engineering and Applications, V57, P148, DOI 10.3778/j.issn.1002-8331.2001-0367
[2]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[3]  
Hinterstoisser S., 2013, COMPUTER VISION ACCV, V7724, P548, DOI [10.1007/978- 3- 642-37331-2_42, DOI 10.1007/978-3-642-37331-242]
[4]  
Hinterstoisser S, 2011, IEEE I CONF COMP VIS, P858, DOI 10.1109/ICCV.2011.6126326
[5]   Projective reconstruction from multiple views with minimization of 2D reprojection error [J].
Hung, YS ;
Tang, WK .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 66 (03) :305-317
[6]   SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again [J].
Kehl, Wadim ;
Manhardt, Fabian ;
Tombari, Federico ;
Ilic, Slobodan ;
Navab, Nassir .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1530-1538
[7]   Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images [J].
Krull, Alexander ;
Brachmann, Eric ;
Michel, Frank ;
Yang, Michael Ying ;
Gumhold, Stefan ;
Rother, Carsten .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :954-962
[8]  
Liu J. H., 2023, University of Chinese Academy of Sciences (Innovation Academy for Microsatellites of CAS), DOI [10.44194/d.cnki.gwxwx.2023.000016, DOI 10.44194/D.CNKI.GWXWX.2023.000016]
[9]  
Liu M., 2022, Journal of Jilin University (Science Edition), V60, P1176
[10]  
[刘泽洋 Liu Zeyang], 2023, [计算机应用研究, Application Research of Computers], V40, P938