On-Device Mobile Visual Location Recognition by Integrating Vision and Inertial Sensors

被引:68
作者
Guan, Tao [1 ]
He, Yunfeng [1 ]
Gao, Juan [1 ]
Yang, Jianzhong [1 ]
Yu, Junqing [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Mobile visual location recognition; on-device; vector quantization; vision and inertial sensors integration;
D O I
10.1109/TMM.2013.2265674
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with the problem of city scale on-device mobile visual location recognition by fusing the inertial sensors and computer vision techniques. The main contributions are as follows: Firstly, we design an efficient vector quantization strategy by combining the Transform Coding (TC) and Residual Vector Quantization (RVQ). Our method can compress a visual descriptor into only several bytes while providing reasonable searching accuracy, which makes the managing of city scale image database directly on mobile devices come true. Secondly, we integrate the information from inertial sensors into the Vector of Locally Aggregated Descriptors (VLAD) generation and image similarity evaluation processes. Our method is not only fast enough for on-device implementation, but it also can improve the location recognition accuracy obviously. Thirdly, we also release a set of 1.295 million geo-tagged street view images with the information from inertial sensors, as well as a difficult set of query images. These resources can be used as a new benchmark to facilitate further research in the area. Experimental results prove the validity of the proposed methods for on-device mobile visual location recognition applications.
引用
收藏
页码:1688 / 1699
页数:12
相关论文
共 24 条
[1]  
[Anonymous], P 11 EUR C COMP VIS
[2]  
Baatz G., 2010, P 11 EUR C COMP VIS
[3]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[4]   Transform Coding for Fast Approximate Nearest Neighbor Search in High Dimensions [J].
Brandt, Jonathan .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1815-1822
[5]   Compressed Histogram of Gradients: A Low-Bitrate Descriptor [J].
Chandrasekhar, Vijay ;
Takacs, Gabriel ;
Chen, David M. ;
Tsai, Sam S. ;
Reznik, Yuriy ;
Grzeszczuk, Radek ;
Girod, Bernd .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (03) :384-399
[6]  
Chen D., 2011, P AS C SIGN SYST COM
[7]  
Chen DM, 2011, PROC CVPR IEEE, P737, DOI 10.1109/CVPR.2011.5995610
[8]   Tree Histogram Coding for Mobile Image Matching [J].
Chen, David M. ;
Tsai, Sam S. ;
Chandrasekhar, Vijay ;
Takacs, Gabriel ;
Singh, Jatinder ;
Girod, Bernd .
DCC 2009: 2009 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, :143-+
[9]   Integrated Content and Context Analysis for Mobile Landmark Recognition [J].
Chen, Tao ;
Yap, Kim-Hui ;
Chau, Lap-Pui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (10) :1476-1486
[10]   Approximate Nearest Neighbor Search by Residual Vector Quantization [J].
Chen, Yongjian ;
Guan, Tao ;
Wang, Cheng .
SENSORS, 2010, 10 (12) :11259-11273