A news picture geo-localization pipeline based on deep learning and street view images

被引:5
作者
Chu, Tianyou [1 ]
Chen, Yumin [1 ]
Su, Heng [1 ]
Xu, Zhenzhen [1 ]
Chen, Guodong [1 ]
Zhou, Annan [1 ]
机构
[1] Wuhan Univ, Sch Resource & Environm Sci, 129 Luoyu Rd, Wuhan, Peoples R China
关键词
Street view images; geo-localization; image retrieval; social media; VISUAL PLACE RECOGNITION; GEOGRAPHICAL DISPARITIES; KERNELS;
D O I
10.1080/17538947.2022.2121437
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Numerous news or event pictures are taken and shared on the internet every day that have abundant information worth being mined, but only a small fraction of them are geotagged. The visual content of the news image hints at clues of the geographical location because they are usually taken at the site of the incident, which provides a prerequisite for geo-localization. This paper proposes an automated pipeline based on deep learning for the geo-localization of news pictures in a large-scale urban environment using geotagged street view images as a reference dataset. The approach obtains location information by constructing an attention-based feature extraction network. Then, the image features are aggregated, and the candidate street view image results are retrieved by the selective matching kernel function. Finally, the coordinates of the news images are estimated by the kernel density prediction method. The pipeline is tested in the news pictures in Hong Kong. In the comparison experiments, the proposed pipeline shows stable performance and generalizability in the large-scale urban environment. In addition, the performance analysis of components in the pipeline shows the ability to recognize localization features of partial areas in pictures and the effectiveness of the proposed solution in news picture geo-localization.
引用
收藏
页码:1485 / 1505
页数:21
相关论文
共 57 条
[1]  
Arandjelovic R, 2018, IEEE T PATTERN ANAL, V40, P1437, DOI [10.1109/TPAMI.2017.2711011, 10.1109/CVPR.2016.572]
[2]   All about VLAD [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :1578-1585
[3]   An Efficient and Scalable Collection of Fly-Inspired Voting Units for Visual Place Recognition in Changing Environments [J].
Arcanjo, Bruno ;
Ferrarini, Bruno ;
Milford, Michael ;
McDonald-Maier, Klaus D. ;
Ehsan, Shoaib .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) :2527-2534
[4]   Aggregating Deep Convolutional Features for Image Retrieval [J].
Babenko, Artem ;
Lempitsky, Victor .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1269-1277
[5]  
Berton G. Mereu R. Trivigno G. Masone C. Csurka G. Sattler T. Caputo B., 2022, P IEEE CVF C COMP VI, P5396
[6]   Facebook and Whatsapp as disaster management tools during the Chennai (India) floods of 2015 [J].
Bhuvana, N. ;
Aram, I. Arul .
INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2019, 39
[7]   Crowd-sourced pictures geo-localization method based on street view images and 3D reconstruction [J].
Cheng, Liang ;
Yuan, Yi ;
Xia, Nan ;
Chen, Song ;
Chen, Yanming ;
Yang, Kang ;
Ma, Lei ;
Li, Manchun .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 141 :72-85
[8]   Use of Tencent Street View Imagery for Visual Perception of Streets [J].
Cheng, Liang ;
Chu, Sensen ;
Zong, Wenwen ;
Li, Shuyi ;
Wu, Jie ;
Li, Manchun .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (09)
[9]   A Grid Feature-Point Selection Method for Large-Scale Street View Image Retrieval Based on Deep Local Features [J].
Chu, Tianyou ;
Chen, Yumin ;
Huang, Liheng ;
Xu, Zhigiang ;
Tan, Huangyuan .
REMOTE SENSING, 2020, 12 (23) :1-18
[10]   Automatic Caption Generation for News Images [J].
Feng, Yansong ;
Lapata, Mirella .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) :797-812