Learning Quintuplet Loss for Large-Scale Visual Geolocalization

被引:4
|
作者
Zhai, Qiang [1 ]
Huang, Rui [2 ]
Cheng, Hong [1 ]
Zhan, Huiqin [1 ]
Li, Jun [3 ]
Liu, Zicheng [4 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Robot, Chengdu, Peoples R China
[2] Univ Elect Sci & Technol China, Ctr Robot, Sch Automat Engn, Chengdu, Peoples R China
[3] Tsinghua Univ, Sch Vehicle & Transportat, Beijing, Peoples R China
[4] Microsoft Res Redmond, Redmond, WA USA
基金
美国国家科学基金会;
关键词
Feature extraction; Measurement; Task analysis; Training data; Learning systems; Visualization; Image recognition; visual geo-localization; triplet loss; quintuplet loss; deep neural network;
D O I
10.1109/MMUL.2020.2996941
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the maturity of artificial intelligence technology, large-scale visual geolocalization (LSVGL) is increasingly important in urban computing, where the task is to accurately and efficiently recognize the geolocation of a given query image. The main challenge of LSVGL faced by many experiments due to the appearance of real-word places may differ in various ways while perspective deviation almost inevitably exists between training images and query images because of the arbitrary perspective. To cope with this situation, in this article, we in-depth analyze the limitation of triplet loss, which is the most commonly used metric learning loss in state-of-the-art LSVGL framework and propose a new quintuplet loss by embedding all the potential positive samples to the primitive triplet loss. Extensive experiments are conducted to verify the effectiveness of the proposed approach and the results demonstrate that our new loss can enhance various LSVGL methods.
引用
收藏
页码:34 / 43
页数:10
相关论文
共 50 条
  • [1] Large-Scale Geolocalization of Overhead Imagery
    Divecha, Mehul
    Newsam, Shawn
    24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [2] ON ADVERSARIAL ROBUSTNESS OF LARGE-SCALE AUDIO VISUAL LEARNING
    Li, Juncheng B.
    Qu, Shuhui
    Li, Xinjian
    Huang, Po-Yao
    Metze, Florian
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 231 - 235
  • [3] Efficient Large-Scale Visual Representation Learning and Evaluation
    Dolev, Eden
    Awad, Alaa
    Roberts, Denisa Olteanu
    Ebrahimzadeh, Zahra
    Mejran, Marcin
    Malpani, Vaibhav
    Yavuz, Mahir
    REVOLUTIONIZING FASHION AND RETAIL, 2025, 1299 : 97 - 111
  • [4] Discriminative Learning of Relaxed Hierarchy for Large-scale Visual Recognition
    Gao, Tianshi
    Koller, Daphne
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2072 - 2079
  • [5] Three Guidelines of Online Learning for Large-Scale Visual Recognition
    Ushiku, Yoshitaka
    Hidaka, Masatoshi
    Harada, Tatsuya
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3574 - 3581
  • [6] Learning Compact Visual Attributes for Large-Scale Image Classification
    Su, Yu
    Jurie, Frederic
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 51 - 60
  • [7] Fast Learning Discriminative Dictionaries for Large-scale Visual Recognition
    Zhao, Tianyi
    Qu, Yanyun
    Fan, Jianping
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [8] Geolocalization of Large-Scale DAS Channels Using a GPS-Tracked Moving Vehicle
    Biondi, Ettore
    Wang, Xin
    Williams, Ethan F.
    Zhan, Zhongwen
    SEISMOLOGICAL RESEARCH LETTERS, 2023, 94 (01) : 318 - 330
  • [9] Novel Considerations in the ML/AI Modeling of Large-Scale Learning Loss
    Elizondo, Mirna
    Yu, June
    Payan, Daniel
    Feng, Li
    Tesic, Jelena
    IEEE ACCESS, 2025, 13 : 7780 - 7792
  • [10] A Visual Backchannel for Large-Scale Events
    Doerk, Marian
    Gruen, Daniel
    Williamson, Carey
    Carpendale, Sheelagh
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138