PIXGAN-Drone: 3D Avatar of Human Body Reconstruction From Multi-View 2D Images

被引:0
作者
Rasheed, Ali Salim [1 ,2 ]
Jabberi, Marwa [2 ]
Hamdani, Tarek M. [2 ,3 ]
Alimi, Adel M. [2 ,4 ]
机构
[1] Univ Informat Technol & Commun, Coll Engn, Dept Media Technol & Commun Engn, Baghdad 00964, Iraq
[2] Univ Sfax, Natl Engn Sch Sfax ENIS, Res Grp Intelligent Machines ReGIM Lab, Sfax 3038, Tunisia
[3] Univ Monastir, Higher Inst Comp Sci Mahdia ISIMa, Monastir 5147, Tunisia
[4] Univ Johannesburg, Fac Engn & Built Environm, Dept Elect & Elect Engn Sci, Johannesburg 3038, South Africa
关键词
Three-dimensional displays; Image reconstruction; Avatars; Solid modeling; Drones; Feature extraction; Generative adversarial networks; Reconstruction algorithms; Rendering (computer graphics); 3D human avatar; 3D reconstruction; PIX2PIXGAN; body model rendering; drone active tracking;
D O I
10.1109/ACCESS.2024.3404554
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Study is being conducted on training Generative Adversarial Networks (GANs) from 2D datasets to generate 3D human body avatars. Numerous applications, such as virtual reality, sports analysis, cinematography, surveillance, and more, have advanced significantly as a result of the promising research in this subject. Aerial photography sensors together with drone active tracking can remove occlusions and enable 3D avatar body reconstruction by avoiding obstacles and generating high-resolution, rich-information multi-view (RGB) photos. Training failures of 3D avatar reconstruction techniques lead to distortions and loss of features in 3D reconstructed models due to several reasons, including limited viewpoint coverage, visible occlusions, and texture disappearance. The recently developed end-to-end trainable deep neural network technique This work presents PIXGAN-Drone, a photo-realistic 3D avatar reconstruction system for the human body from multi-view photos. To create high-resolution 2D models, is predicated on integrating aerial photography sensors (a steady autonomous circular motion system) coupled with active tracking drones into the Pix2Pix GANs training framework. Accurate and realistic 3D models can be created with conditional image-to-image translation and dynamic aerial views. This study used tests on several datasets to show that our approach outperforms state-of-the-art approaches for a variety of metrics (Chamfer, P2S, and CED). Our 3D reconstructed human avatars in RenderPeople were 0.0293, 0.0271, and 0.0232; on People Snapshot (inside), 0.0133, 0.0136, 0.0050; on People Snapshot (outdoor), 0.0154, 0.0101, 0.0063; and on Custom data-drone (collected dataset), 0.0316, 0.0275, 0.0216.
引用
收藏
页码:74762 / 74776
页数:15
相关论文
共 81 条
[1]   Tex2Shape: Detailed Full Human Body Geometry From a Single Image [J].
Alldieck, Thiemo ;
Pons-Moll, Gerard ;
Theobalt, Christian ;
Magnor, Marcus .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2293-2303
[2]   Video Based Reconstruction of 3D People Models [J].
Alldieck, Thiemo ;
Magnor, Marcus ;
Xu, Weipeng ;
Theobalt, Christian ;
Pons-Moll, Gerard .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8387-8397
[3]   Vision-Based Navigation Techniques for Unmanned Aerial Vehicles: Review and Challenges [J].
Arafat, Muhammad Yeasir ;
Alam, Muhammad Morshed ;
Moh, Sangman .
DRONES, 2023, 7 (02)
[4]   Doppelgangers: Learning to Disambiguate Images of Similar Structures [J].
Cai, Ruojin ;
Tung, Joseph ;
Wang, Qianqian ;
Averbuch-Elor, Hadar ;
Hariharan, Bharath ;
Snavely, Noah .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, :34-44
[5]   Integrated Surveying, from Laser Scanning to UAV Systems, for Detailed Documentation of Architectural and Archeological Heritage [J].
Calisi, Daniele ;
Botta, Stefano ;
Cannata, Alessandro .
DRONES, 2023, 7 (09)
[6]   MFGAN: towards a generic multi-kernel filter based adversarial generator for image restoration [J].
Chahi, Abderrazak ;
Kas, Mohamed ;
Kajo, Ibrahim ;
Ruichek, Yassine .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (03) :1113-1136
[7]   Control of a Novel Parallel Mechanism for the Stabilization of Unmanned Aerial Vehicles [J].
Chamas, Mohamad Haidar ;
Amine, Semaan ;
Gazo Hanna, Eddie ;
Mokhiamar, Ossama .
APPLIED SCIENCES-BASEL, 2023, 13 (15)
[8]  
Chen AJ, 2023, Arxiv, DOI arXiv:2210.01346
[9]   A self-rotating, single-actuated UAV with extended sensor field of view for autonomous navigation [J].
Chen, Nan ;
Kong, Fanze ;
Xu, Wei ;
Cai, Yixi ;
Li, Haotian ;
He, Dongjiao ;
Qin, Youming ;
Zhang, Fu .
SCIENCE ROBOTICS, 2023, 8 (76)
[10]   Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional generative adversarial networks for multiple dam domains [J].
Choi, Suyeon ;
Kim, Yeonjoo .
GEOSCIENTIFIC MODEL DEVELOPMENT, 2022, 15 (15) :5967-5985