Learning Quintuplet Loss for Large-Scale Visual Geolocalization

被引：4

作者：

Zhai, Qiang ^{[1
]}

Huang, Rui ^{[2
]}

Cheng, Hong ^{[1
]}

Zhan, Huiqin ^{[1
]}

Li, Jun ^{[3
]}

Liu, Zicheng ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Robot, Chengdu, Peoples R China

[2] Univ Elect Sci & Technol China, Ctr Robot, Sch Automat Engn, Chengdu, Peoples R China

[3] Tsinghua Univ, Sch Vehicle & Transportat, Beijing, Peoples R China

[4] Microsoft Res Redmond, Redmond, WA USA

来源：

IEEE MULTIMEDIA | 2020年 / 27卷 / 03期

基金：

美国国家科学基金会;

关键词：

Feature extraction; Measurement; Task analysis; Training data; Learning systems; Visualization; Image recognition; visual geo-localization; triplet loss; quintuplet loss; deep neural network;

D O I：

10.1109/MMUL.2020.2996941

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the maturity of artificial intelligence technology, large-scale visual geolocalization (LSVGL) is increasingly important in urban computing, where the task is to accurately and efficiently recognize the geolocation of a given query image. The main challenge of LSVGL faced by many experiments due to the appearance of real-word places may differ in various ways while perspective deviation almost inevitably exists between training images and query images because of the arbitrary perspective. To cope with this situation, in this article, we in-depth analyze the limitation of triplet loss, which is the most commonly used metric learning loss in state-of-the-art LSVGL framework and propose a new quintuplet loss by embedding all the potential positive samples to the primitive triplet loss. Extensive experiments are conducted to verify the effectiveness of the proposed approach and the results demonstrate that our new loss can enhance various LSVGL methods.

引用

页码：34 / 43

页数：10

共 50 条

[1] Large-Scale Geolocalization of Overhead Imagery
Divecha, Mehul
Newsam, Shawn
24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
[2] ON ADVERSARIAL ROBUSTNESS OF LARGE-SCALE AUDIO VISUAL LEARNING
Li, Juncheng B.
Qu, Shuhui
Li, Xinjian
Huang, Po-Yao
Metze, Florian
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 231 - 235
[3] Efficient Large-Scale Visual Representation Learning and Evaluation
Dolev, Eden
Awad, Alaa
Roberts, Denisa Olteanu
Ebrahimzadeh, Zahra
Mejran, Marcin
Malpani, Vaibhav
Yavuz, Mahir
REVOLUTIONIZING FASHION AND RETAIL, 2025, 1299 : 97 - 111
[4] Discriminative Learning of Relaxed Hierarchy for Large-scale Visual Recognition
Gao, Tianshi
Koller, Daphne
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2072 - 2079
[5] Three Guidelines of Online Learning for Large-Scale Visual Recognition
Ushiku, Yoshitaka
Hidaka, Masatoshi
Harada, Tatsuya
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3574 - 3581
[6] Learning Compact Visual Attributes for Large-Scale Image Classification
Su, Yu
Jurie, Frederic
COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 51 - 60
[7] Fast Learning Discriminative Dictionaries for Large-scale Visual Recognition
Zhao, Tianyi
Qu, Yanyun
Fan, Jianping
2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
[8] Geolocalization of Large-Scale DAS Channels Using a GPS-Tracked Moving Vehicle
Biondi, Ettore
Wang, Xin
Williams, Ethan F.
Zhan, Zhongwen
SEISMOLOGICAL RESEARCH LETTERS, 2023, 94 (01) : 318 - 330
[9] Novel Considerations in the ML/AI Modeling of Large-Scale Learning Loss
Elizondo, Mirna
Yu, June
Payan, Daniel
Feng, Li
Tesic, Jelena
IEEE ACCESS, 2025, 13 : 7780 - 7792
[10] A Visual Backchannel for Large-Scale Events
Doerk, Marian
Gruen, Daniel
Williamson, Carey
Carpendale, Sheelagh
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2010, 16 (06) : 1129 - 1138

← 1 2 3 4 5 →