A 6D Object Pose Estimation Method combining Self-attention Mechanism

被引：1

作者：

Sun, Yifan ^{[1
]}

Dai, Sumin ^{[2
]}

Dang, Jianwu ^{[1
]}

Yong, Jiu ^{[1
]}

机构：

[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou, Peoples R China

[2] Beijing Fibrlink Commun Co Ltd, Beijing, Peoples R China

来源：

2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024 | 2024年

关键词：

6D pose estimation; Self-attention mechanism; Residual structure; Deep learning;

D O I：

10.1109/ICCEA62105.2024.10604264

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Aiming at the accuracy issue of the 6D pose estimation algorithm, a YOLO-6D object pose estimation method which introduces the expectation maximization self-attention mechanism is proposed. Firstly, use CSPDarkNet53 network to replace the original DarkNet19 network, increase network depth, and improve the network's feature extraction ability. Secondly, the expectation maximization self- attention module (EMAU) is combined with residual structures to propose the ResEMA structure, which is incorporated into the backbone network to extract more fine-grained representations. Finally, experiments are conducted on the publicly available dataset LineMOD, and the results show that the improved algorithm significantly improves accuracy, with a 6.78% increase in 2D projection error accuracy, a 9.14% increase in ADD accuracy, a 13.44% increase in 5cm,5 degrees accuracy.

引用

页码：1315 / 1319

页数：5

共 21 条

[1]

Bao Zhiqiang, 2021, Computer Engineering and Applications, V57, P148, DOI 10.3778/j.issn.1002-8331.2001-0367

[2] SURF: Speeded up robust features [J].

Bay, Herbert ;

Tuytelaars, Tinne ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417

[3]

Hinterstoisser S., 2013, COMPUTER VISION ACCV, V7724, P548, DOI [10.1007/978- 3- 642-37331-2_42, DOI 10.1007/978-3-642-37331-242]

[4]

Hinterstoisser S, 2011, IEEE I CONF COMP VIS, P858, DOI 10.1109/ICCV.2011.6126326

[5] Projective reconstruction from multiple views with minimization of 2D reprojection error [J].

Hung, YS ;

Tang, WK .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 66 (03) :305-317

[6] SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again [J].

Kehl, Wadim ;

Manhardt, Fabian ;

Tombari, Federico ;

Ilic, Slobodan ;

Navab, Nassir .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1530-1538

[7] Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images [J].

Krull, Alexander ;

Brachmann, Eric ;

Michel, Frank ;

Yang, Michael Ying ;

Gumhold, Stefan ;

Rother, Carsten .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :954-962

[8]

Liu J. H., 2023, University of Chinese Academy of Sciences (Innovation Academy for Microsatellites of CAS), DOI [10.44194/d.cnki.gwxwx.2023.000016, DOI 10.44194/D.CNKI.GWXWX.2023.000016]

[9]

Liu M., 2022, Journal of Jilin University (Science Edition), V60, P1176

[10]

[刘泽洋 Liu Zeyang], 2023, [计算机应用研究, Application Research of Computers], V40, P938

← 1 2 3 →