Semantic signatures for large-scale visual localization

被引：0

作者：

Li Weng

Valérie Gouet-Brunet

Bahman Soheilian

机构：

[1] Hangzhou Dianzi University,Department of Automation (Artificial Intelligence)

[2] Univ. Gustave Eiffel,LaSTIG Lab.

[3] ENSG,undefined

[4] IGN,undefined

来源：

Multimedia Tools and Applications | 2021年 / 80卷

关键词：

Database search; Information retrieval; Visual localization; Semantic feature; Urban computing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Visual localization is a useful alternative to standard localization techniques. It works by utilizing cameras. In a typical scenario, features are extracted from captured images and compared with geo-referenced databases. Location information is then inferred from the matching results. Conventional schemes mainly use low-level visual features. These approaches offer good accuracy but suffer from scalability issues. In order to assist localization in large urban areas, this work explores a different path by utilizing high-level semantic information. It is found that object information in a street view can facilitate localization. A novel descriptor scheme called “semantic signature” is proposed to summarize this information. A semantic signature consists of type and angle information of visible objects at a spatial location. Several metrics and protocols are proposed for signature comparison and retrieval. They illustrate different trade-offs between accuracy and complexity. Extensive simulation results confirm the potential of the proposed scheme in large-scale applications. This paper is an extended version of a conference paper in CBMI’18. A more efficient retrieval protocol is presented with additional experiment results.

引用

页码：22347 / 22372

页数：25

共 50 条

[1] Semantic signatures for large-scale visual localization
Weng, Li
Gouet-Brunet, Valerie
Soheilian, Bahman
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22347 - 22372
[2] VirtualLoc: Large-scale Visual Localization Using Virtual Images
Xiong, Yuan
Wang, Jingru
Zhou, Zhong
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
[3] A Visual Backchannel for Large-Scale Events
Doerk, Marian
Gruen, Daniel
Williamson, Carey
Carpendale, Sheelagh
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138
[4] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
Torii, Akihiko
Taira, Hajime
Sivic, Josef
Pollefeys, Marc
Okutomi, Masatoshi
Pajdla, Tomas
Sattler, Torsten
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829
[5] Accurate and Robust Visual Localization System in Large-Scale Appearance-Changing Environments
Yu, Yang
Yun, Peng
Xue, Bohuan
Jiao, Jianhao
Fan, Rui
Liu, Ming
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5222 - 5232
[6] A large-scale dataset for indoor visual localization with high-precision ground truth
Liu, Yuchen
Gao, Wei
Hu, Zhanyi
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (02) : 129 - 135
[7] LASH: Large-Scale Academic Deep Semantic Hashing
Guo, Jia-Nan
Mao, Xian-Ling
Lan, Tian
Tu, Rong-Xin
Wei, Wei
Huang, Heyan
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1734 - 1746
[8] Improving large-scale search engines with semantic annotations
Fuentes-Lorenzo, Damaris
Fernandez, Norberto
Fisteus, Jesus A.
Sanchez, Luis
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) : 2287 - 2296
[9] Audio-visual large-scale video copy detection
Liu, Yang
Xu, Changsheng
Lu, Hanqing
INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3803 - 3816
[10] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
Li, Ning
Ai, Haojun
VISUAL COMPUTER, 2022, 38 (06) : 2091 - 2106

← 1 2 3 4 5 →