Cross-Domain Indoor Visual Place Recognition for Mobile Robot via Generalization Using Style Augmentation

被引：3

作者：

Wozniak, Piotr ^{[1
]}

Ozog, Dominik ^{[1
]}

机构：

[1] Rzeszow Univ Technol, Fac Elect & Comp Engn, Dept Comp & Control Engn, Al Powstancow Warszawy 12, PL-35959 Rzeszow, Poland

来源：

SENSORS | 2023年 / 23卷 / 13期

关键词：

visual place recognition; CNNs; multi-domain learning; domain generalization; transfer learning;

D O I：

10.3390/s23136134

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The article presents an algorithm for the multi-domain visual recognition of an indoor place. It is based on a convolutional neural network and style randomization. The authors proposed a scene classification mechanism and improved the performance of the models based on synthetic and real data from various domains. In the proposed dataset, a domain change was defined as a camera model change. A dataset of images collected from several rooms was used to show different scenarios, human actions, equipment changes, and lighting conditions. The proposed method was tested in a scene classification problem where multi-domain data were used. The basis was a transfer learning approach with an extension style applied to various combinations of source and target data. The focus was on improving the unknown domain score and multi-domain support. The results of the experiments were analyzed in the context of data collected on a humanoid robot. The article shows that the average score was the highest for the use of multi-domain data and data style enhancement. The method of obtaining average results for the proposed method reached the level of 92.08%. The result obtained by another research team was corrected.

引用

页数：19

共 53 条

[1] Al-Qizwini M, 2017, IEEE INT VEH SYM, P89, DOI 10.1109/IVS.2017.7995703
[2] MixVPR: Feature Mixing for Visual Place Recognition
Ali-bey, Amar
Chaib-draa, Brahim
Giguere, Philippe
[J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2997 - 3006
[3] Arandjelovic R, 2016, Arxiv, DOI arXiv:1511.07247
[4] Barros T, 2022, Arxiv, DOI arXiv:2106.10458
[5] Baumgartl H., 2020, P 53 HAW INT C SYST, DOI [10.24251/HICSS.2020.069, DOI 10.24251/HICSS.2020.069]
[6] SURF: Speeded up robust features
Bay, Herbert
Tuytelaars, Tinne
Van Gool, Luc
[J]. COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 : 404 - 417
[7] Chatfield K, 2014, Arxiv, DOI arXiv:1405.3531
[8] Domain Adaptive Faster R-CNN for Object Detection in the Wild
Chen, Yuhua
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3339 - 3348
[9] Multi-Scale Fully Convolutional Network-Based Semantic Segmentation for Mobile Robot Navigation
Dang, Thai-Viet
Bui, Ngoc-Tam
[J]. ELECTRONICS, 2023, 12 (03)
[10] Dara Suresh, 2018, 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), P1795, DOI 10.1109/ICECA.2018.8474912

← 1 2 3 4 5 6 →