Cross-Domain Indoor Visual Place Recognition for Mobile Robot via Generalization Using Style Augmentation

被引:3
作者
Wozniak, Piotr [1 ]
Ozog, Dominik [1 ]
机构
[1] Rzeszow Univ Technol, Fac Elect & Comp Engn, Dept Comp & Control Engn, Al Powstancow Warszawy 12, PL-35959 Rzeszow, Poland
关键词
visual place recognition; CNNs; multi-domain learning; domain generalization; transfer learning;
D O I
10.3390/s23136134
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The article presents an algorithm for the multi-domain visual recognition of an indoor place. It is based on a convolutional neural network and style randomization. The authors proposed a scene classification mechanism and improved the performance of the models based on synthetic and real data from various domains. In the proposed dataset, a domain change was defined as a camera model change. A dataset of images collected from several rooms was used to show different scenarios, human actions, equipment changes, and lighting conditions. The proposed method was tested in a scene classification problem where multi-domain data were used. The basis was a transfer learning approach with an extension style applied to various combinations of source and target data. The focus was on improving the unknown domain score and multi-domain support. The results of the experiments were analyzed in the context of data collected on a humanoid robot. The article shows that the average score was the highest for the use of multi-domain data and data style enhancement. The method of obtaining average results for the proposed method reached the level of 92.08%. The result obtained by another research team was corrected.
引用
收藏
页数:19
相关论文
共 53 条
  • [1] Al-Qizwini M, 2017, IEEE INT VEH SYM, P89, DOI 10.1109/IVS.2017.7995703
  • [2] MixVPR: Feature Mixing for Visual Place Recognition
    Ali-bey, Amar
    Chaib-draa, Brahim
    Giguere, Philippe
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2997 - 3006
  • [3] Arandjelovic R, 2016, Arxiv, DOI arXiv:1511.07247
  • [4] Barros T, 2022, Arxiv, DOI arXiv:2106.10458
  • [5] Baumgartl H., 2020, P 53 HAW INT C SYST, DOI [10.24251/HICSS.2020.069, DOI 10.24251/HICSS.2020.069]
  • [6] SURF: Speeded up robust features
    Bay, Herbert
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
  • [7] Chatfield K, 2014, Arxiv, DOI arXiv:1405.3531
  • [8] Domain Adaptive Faster R-CNN for Object Detection in the Wild
    Chen, Yuhua
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
  • [9] Multi-Scale Fully Convolutional Network-Based Semantic Segmentation for Mobile Robot Navigation
    Dang, Thai-Viet
    Bui, Ngoc-Tam
    [J]. ELECTRONICS, 2023, 12 (03)
  • [10] Dara Suresh, 2018, 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), P1795, DOI 10.1109/ICECA.2018.8474912