Keypoint-Guided Efficient Pose Estimation and Domain Adaptation for Micro Aerial Vehicles

被引：1

作者：

Zheng, Ye ^{[1
,2
]}

Zheng, Canlun ^{[1
]}

Shen, Jiahao ^{[1
]}

Liu, Peidong ^{[1
]}

Zhao, Shiyu ^{[1
,3
]}

机构：

[1] Westlake Univ, Sch Engn, Hangzhou 310024, Peoples R China

[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China

[3] Westlake Univ, Res Ctr Ind Future, Hangzhou 310024, Peoples R China

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2024年 / 40卷

关键词：

Pose estimation; Three-dimensional displays; Task analysis; Estimation; Training; Location awareness; Computational modeling; 6-D pose estimation; micro aerial vehicles (MAVs); unsupervised domain adaptation; RELATIVE LOCALIZATION; DRONE FLOCKING; TRACKING;

D O I：

10.1109/TRO.2024.3400938

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Visual detection of micro aerial vehicles (MAVs) is an important problem in many tasks such as vision-based swarming of MAVs. This article studies vision-based 6-D pose estimation to detect a 3-D bounding box of a target MAV, and then, estimate its 3-D position and 3-D attitude. The 3-D attitude information is critical to better estimate the target's velocity since the attitude and motion are dynamically coupled. In this article, we propose a novel 6-D pose estimation method, whose novelties are threefold. First, we propose a novel centroid point-guided keypoint localization network that outperforms the state-of-the-art methods in terms of both accuracy and efficiency. Second, while there are no publicly available real-world datasets for 6-D pose estimation for MAVs up to now, we propose a high-quality dataset based on an automatic dataset collection method. Third, since the dataset is collected in an indoor environment but detection tasks are usually in outdoor environments, we propose a self-training-based unsupervised domain adaption method to transfer the method from indoor to outdoor. Finally, we show that the estimated 6-D pose especially the 3-D attitude can significantly help improve the target's velocity estimation.

引用

页码：2967 / 2983

页数：17

共 54 条

[1]

Albanis Georgios, 2020, Computer Vision - ECCV 2020 Workshops. Proceedings. Lecture Notes in Computer Science (LNCS 12536), P663, DOI 10.1007/978-3-030-66096-3_44

[2]

Ben-David Shai, 2012, Algorithmic Learning Theory. 23rd International Conference (ALT 2012). Proceedings, P139, DOI 10.1007/978-3-642-34106-9_14

[3]

Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934

[4]

Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35

[5]

Bukschat Y, 2020, Arxiv, DOI [arXiv:2011.04307, DOI 10.48550/ARXIV.2011.04307]

[6] Occlusion-Robust Object Pose Estimation with Holistic Representation [J].

Chen, Bo ;

Chin, Tat-Jun ;

Klimavicius, Marius .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :2223-2233

[7] Navigation and Control of Unconventional VTOL UAVs in Forward-Flight With Explicit Wind Velocity Estimation [J].

Cohen, Mitchell R. ;

Forbes, James Richard .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1151-1158

[8]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[9] SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation [J].

Di, Yan ;

Manhardt, Fabian ;

Wang, Gu ;

Ji, Xiangyang ;

Navab, Nassir ;

Tombari, Federico .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12376-12385

[10] JS']JST: Joint Self-training for Unsupervised Domain Adaptation on 2D&3D Object Detection [J].

Ding, Guangyao ;

Zhang, Meiying ;

Li, E. ;

Hao, Qi .

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,

← 1 2 3 4 5 6 →