Keypoint-Guided Efficient Pose Estimation and Domain Adaptation for Micro Aerial Vehicles

被引:1
作者
Zheng, Ye [1 ,2 ]
Zheng, Canlun [1 ]
Shen, Jiahao [1 ]
Liu, Peidong [1 ]
Zhao, Shiyu [1 ,3 ]
机构
[1] Westlake Univ, Sch Engn, Hangzhou 310024, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Westlake Univ, Res Ctr Ind Future, Hangzhou 310024, Peoples R China
关键词
Pose estimation; Three-dimensional displays; Task analysis; Estimation; Training; Location awareness; Computational modeling; 6-D pose estimation; micro aerial vehicles (MAVs); unsupervised domain adaptation; RELATIVE LOCALIZATION; DRONE FLOCKING; TRACKING;
D O I
10.1109/TRO.2024.3400938
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Visual detection of micro aerial vehicles (MAVs) is an important problem in many tasks such as vision-based swarming of MAVs. This article studies vision-based 6-D pose estimation to detect a 3-D bounding box of a target MAV, and then, estimate its 3-D position and 3-D attitude. The 3-D attitude information is critical to better estimate the target's velocity since the attitude and motion are dynamically coupled. In this article, we propose a novel 6-D pose estimation method, whose novelties are threefold. First, we propose a novel centroid point-guided keypoint localization network that outperforms the state-of-the-art methods in terms of both accuracy and efficiency. Second, while there are no publicly available real-world datasets for 6-D pose estimation for MAVs up to now, we propose a high-quality dataset based on an automatic dataset collection method. Third, since the dataset is collected in an indoor environment but detection tasks are usually in outdoor environments, we propose a self-training-based unsupervised domain adaption method to transfer the method from indoor to outdoor. Finally, we show that the estimated 6-D pose especially the 3-D attitude can significantly help improve the target's velocity estimation.
引用
收藏
页码:2967 / 2983
页数:17
相关论文
共 54 条
[1]  
Albanis Georgios, 2020, Computer Vision - ECCV 2020 Workshops. Proceedings. Lecture Notes in Computer Science (LNCS 12536), P663, DOI 10.1007/978-3-030-66096-3_44
[2]  
Ben-David Shai, 2012, Algorithmic Learning Theory. 23rd International Conference (ALT 2012). Proceedings, P139, DOI 10.1007/978-3-642-34106-9_14
[3]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[4]  
Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35
[5]  
Bukschat Y, 2020, Arxiv, DOI [arXiv:2011.04307, DOI 10.48550/ARXIV.2011.04307]
[6]   Occlusion-Robust Object Pose Estimation with Holistic Representation [J].
Chen, Bo ;
Chin, Tat-Jun ;
Klimavicius, Marius .
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :2223-2233
[7]   Navigation and Control of Unconventional VTOL UAVs in Forward-Flight With Explicit Wind Velocity Estimation [J].
Cohen, Mitchell R. ;
Forbes, James Richard .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :1151-1158
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation [J].
Di, Yan ;
Manhardt, Fabian ;
Wang, Gu ;
Ji, Xiangyang ;
Navab, Nassir ;
Tombari, Federico .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12376-12385
[10]   JS']JST: Joint Self-training for Unsupervised Domain Adaptation on 2D&3D Object Detection [J].
Ding, Guangyao ;
Zhang, Meiying ;
Li, E. ;
Hao, Qi .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,