REVERSE DOMAIN ADAPTATION FOR INDOOR CAMERA POSE REGRESSION

被引:0
|
作者
Acharya, Debaditya [1 ]
Khoshelham, Kourosh [2 ,3 ]
机构
[1] RMIT Univ, Geospatial Sci, Melbourne, Vic 3000, Australia
[2] Univ Melbourne, Dept Infrastruct Engn, Parkville, Vic 3010, Australia
[3] Bldg 4-0 CRC, Caulfield, Vic 3145, Australia
关键词
Domain adaptation; GAN; deep learning; Indoor localization; 3D building models; camera pose regression; BIM;
D O I
10.5194/isprs-annals-X-1-W1-2023-453-2023
中图分类号
K85 [文物考古];
学科分类号
0601 ;
摘要
Synthetic images have been used to mitigate the scarcity of annotated data for training deep learning approaches, followed by domain adaptation that reduces the gap between synthetic and real images. One such approach is using Generative Adversarial Networks (GANs) such as CycleGAN to bridge the domain gap where the synthetic images are translated into real-looking synthetic images that are used to train the deep learning models. In this article, we explore the less intuitive alternate strategy for domain adaption in the reverse direction; i.e., real-to-synthetic adaptation. We train the deep learning models with synthetic data directly, and then during inference we apply domain adaptation to convert the real images to synthetic-looking real images using CycleGAN. This strategy reduces the amount of data conversion required during the training, can potentially generate artefact-free images compared to the harder synthetic-to-real case, and can improve the performance of deep learning models. We demonstrate the success of this strategy in indoor localisation by experimenting with camera pose regression. The experimental results indicate an improvement in localisation accuracy is observed with the proposed domain adaptation as compared to the synthetic-to-real adaptation.
引用
收藏
页码:453 / 460
页数:8
相关论文
共 50 条
  • [31] Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation
    Jin, Rui
    Zhang, Jing
    Yang, Jianyu
    Tao, Dacheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6125 - 6136
  • [32] Investigating Depth Domain Adaptation for Efficient Human Pose Estimation
    Martinez-Gonzalez, Angel
    Villamizar, Michael
    Canevet, Olivier
    Odobez, Jean-Marc
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT II, 2019, 11130 : 346 - 363
  • [33] Rademacher Complexity Bound for Domain Adaptation Regression
    Zhou, Jiajia
    Liu, Jianwei
    Luo, Xionglin
    2015 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2015, : 273 - 280
  • [34] Representation Subspace Distance for Domain Adaptation Regression
    Chen, Xinyang
    Wang, Sinan
    Wang, Jianmin
    Long, Mingsheng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] Contrastive Regression for Domain Adaptation on Gaze Estimation
    Wang, Yaoming
    Jiang, Yangzhou
    Li, Jin
    Ni, Bingbing
    Dai, Wenrui
    Li, Chenglin
    Xiong, Hongkai
    Li, Teng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19354 - 19363
  • [36] ROUGH COMPRESSED DOMAIN CAMERA POSE ESTIMATION THROUGH OBJECT MOTION
    Kaes, Christian
    Nicolas, Henri
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3481 - 3484
  • [37] DeepPilot4Pose: a fast pose localisation for MAV indoor flight using the OAK-D camera
    Rojas-Perez, L. Oyuki
    Martinez-Carranza, Jose
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
  • [38] Use of LSTM Regression and Rotation Classification to Improve Camera Pose Localization Estimation
    Xu, Meng
    Wang, Lingfeng
    Ren, Jian
    Poslad, Stefan
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2020, : 6 - 10
  • [39] Learning single and multi-scene camera pose regression with transformer encoders
    Shavit, Yoli
    Ferens, Ron
    Keller, Yosi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 243
  • [40] Camera Pose Filtering with Local Regression Geodesics on the Riemannian Manifold of Dual Quaternions
    Busam, Benjamin
    Birdal, Tolga
    Navab, Nassir
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2436 - 2445