A Novel Approach to Image Retrieval for Vision-Based Positioning Utilizing Graph Topology

被引：1

作者：

Elashry, Abdelgwad ^{[1
]}

Toth, Charles ^{[2
]}

机构：

[1] Elsewedy Univ Technol, Comp Engn, Cairo, Egypt

[2] Ohio State Univ, Dept Civil Environm & Geodet Engn, Columbus, OH USA

来源：

ISPRS ANNALS OF THE PHOTOGRAMMETRY, REMOTE SENSING AND SPATIAL INFORMATION SCIENCES: VOLUME X-2-2024 | 2024年

关键词：

Vision-based positioning; Image retrieval; Graph topology; The Bag of Visual Words; Fingerprinting;

D O I：

10.5194/isprs-annals-X-2-2024-49-2024

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This research introduces a novel approach to improve vision-based positioning in the absence of GNSS signals. Specifically, we address the challenge posed by obstacles that alter image information or features, making retrieving the query image from the database difficult. While the Bag of Visual Words (BoVW) is a widely used image retrieval technique, it has a limitation in representing each image with a single histogram vector or vocabulary of visual words, i.e., the emergence of obstacles can introduce new features to the query image, resulting in different visual words. Our study overcomes this limitation by clustering the features of each image using the k-means method and generating a graph for each class. Each node or key point in the graph obtains additional information from its direct neighbors using functions employed in graph neural networks, functioning as a feedforward network with constant parameters. This process generates new embedding nodes, and eventually, global pooling is applied to produce one vector for each graph, representing each image with graph vectors based on objects or feature classes. As a result, each image is represented with graph vectors based on objects or feature classes. In the presence of obstacles covering one or more graphs, there is sufficient information from the query image to retrieve the most relevant image from the database. Our approach was applied to indoor positioning applications, with the database collected in Bolz Hall at The Ohio State University. Traditional BoVW techniques struggle to properly retrieve most query images from the database due to obstacles like humans or recently deployed objects that alter image features. In contrast, our approach has shown progress in image retrieval by representing each image with multiple graph vectors, depending on the number of objects in the image. This helps prevent or mitigate changes in image features caused by obstacles covering or adding features to the image, as demonstrated in the results.

引用

页码：49 / 56

页数：8

共 17 条

[1]

Al Chanti D, 2018, Arxiv, DOI arXiv:1810.00360

[2]

[Anonymous], 2007, P INT WORKSHOP WORKS

[3]

Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027

[4]

Barnes Joel, 2004, P 60 ANN M I NAV

[5] Improving bag-of-visual-words image retrieval with predictive clustering trees [J].

Dimitrovski, Ivica ;

Kocev, Dragi ;

Loskovska, Suzana ;

Dzeroski, Saso .

INFORMATION SCIENCES, 2016, 329 :851-865

[6]

El Ashry Abd Elgwad M, 2018, INT C EL ENG, V11, P1

[7]

Hamilton WL, 2018, Arxiv, DOI arXiv:1709.05584

[8]

Mostafa MM, 2017, P INT TECH M I NAVIG, P856

[9]

Qader W. A., 2019, IEEE INT C NEUR NETW, P200, DOI DOI 10.1109/IEC47844.2019.8950616

[10] A Probabilistic Approach to WLAN User Location Estimation [J].

Roos T. ;

Myllymäki P. ;

Tirri H. ;

Misikangas P. ;

Sievänen J. .

International Journal of Wireless Information Networks, 2002, 9 (03) :155-164

← 1 2 →