Semantic signatures for large-scale visual localization

被引:0
|
作者
Li Weng
Valérie Gouet-Brunet
Bahman Soheilian
机构
[1] Hangzhou Dianzi University,Department of Automation (Artificial Intelligence)
[2] Univ. Gustave Eiffel,LaSTIG Lab.
[3] ENSG,undefined
[4] IGN,undefined
来源
Multimedia Tools and Applications | 2021年 / 80卷
关键词
Database search; Information retrieval; Visual localization; Semantic feature; Urban computing;
D O I
暂无
中图分类号
学科分类号
摘要
Visual localization is a useful alternative to standard localization techniques. It works by utilizing cameras. In a typical scenario, features are extracted from captured images and compared with geo-referenced databases. Location information is then inferred from the matching results. Conventional schemes mainly use low-level visual features. These approaches offer good accuracy but suffer from scalability issues. In order to assist localization in large urban areas, this work explores a different path by utilizing high-level semantic information. It is found that object information in a street view can facilitate localization. A novel descriptor scheme called “semantic signature” is proposed to summarize this information. A semantic signature consists of type and angle information of visible objects at a spatial location. Several metrics and protocols are proposed for signature comparison and retrieval. They illustrate different trade-offs between accuracy and complexity. Extensive simulation results confirm the potential of the proposed scheme in large-scale applications. This paper is an extended version of a conference paper in CBMI’18. A more efficient retrieval protocol is presented with additional experiment results.
引用
收藏
页码:22347 / 22372
页数:25
相关论文
共 50 条
  • [1] Semantic signatures for large-scale visual localization
    Weng, Li
    Gouet-Brunet, Valerie
    Soheilian, Bahman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22347 - 22372
  • [2] VirtualLoc: Large-scale Visual Localization Using Virtual Images
    Xiong, Yuan
    Wang, Jingru
    Zhou, Zhong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
  • [3] A Visual Backchannel for Large-Scale Events
    Doerk, Marian
    Gruen, Daniel
    Williamson, Carey
    Carpendale, Sheelagh
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138
  • [4] Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
    Torii, Akihiko
    Taira, Hajime
    Sivic, Josef
    Pollefeys, Marc
    Okutomi, Masatoshi
    Pajdla, Tomas
    Sattler, Torsten
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 814 - 829
  • [5] Accurate and Robust Visual Localization System in Large-Scale Appearance-Changing Environments
    Yu, Yang
    Yun, Peng
    Xue, Bohuan
    Jiao, Jianhao
    Fan, Rui
    Liu, Ming
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5222 - 5232
  • [6] A large-scale dataset for indoor visual localization with high-precision ground truth
    Liu, Yuchen
    Gao, Wei
    Hu, Zhanyi
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2022, 41 (02) : 129 - 135
  • [7] LASH: Large-Scale Academic Deep Semantic Hashing
    Guo, Jia-Nan
    Mao, Xian-Ling
    Lan, Tian
    Tu, Rong-Xin
    Wei, Wei
    Huang, Heyan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1734 - 1746
  • [8] Improving large-scale search engines with semantic annotations
    Fuentes-Lorenzo, Damaris
    Fernandez, Norberto
    Fisteus, Jesus A.
    Sanchez, Luis
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) : 2287 - 2296
  • [9] Audio-visual large-scale video copy detection
    Liu, Yang
    Xu, Changsheng
    Lu, Hanqing
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3803 - 3816
  • [10] EfiLoc: large-scale visual indoor localization with efficient correlation between sparse features and 3D points
    Li, Ning
    Ai, Haojun
    VISUAL COMPUTER, 2022, 38 (06) : 2091 - 2106