Mobile Visual Search Using Image and Text Features

被引:0
|
作者
Tsai, Sam S. [1 ]
Chen, Huizhong [1 ]
Chen, David [1 ]
Vedantham, Ramakrishna [2 ]
Grzeszczuk, Radek [2 ]
Girod, Bernd [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Nokia Res Ctr, Palo Alto, CA 94304 USA
来源
2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR) | 2011年
关键词
mobile visual search; image retrieval; document retrieval; document analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a mobile visual search system that utilizes both text and low bit-rate image features. Using a cameraphone, a user can snap a picture of a document image and search for the document in online databases. From the query image, the title text is detected and recognized and image features are extracted and compressed, as well. Both types of information are sent from the cameraphone client to a server. The server uses the recognized title to retrieve candidate documents from online databases. Then, image features are used to select the correct document(s). We show that by using a novel geometric verification method that incorporates both text and image feature information, we can reduce the missed positives up to 50%. The proposed method can also speed up the geometric process, enabling a larger set of verified titles, resulting in a superior performance compared to previous schemes.
引用
收藏
页码:845 / 849
页数:5
相关论文
共 50 条
  • [41] Feature grouping and local soft match for mobile visual search
    Liu, Xianglong
    Lang, Bo
    Xu, Yi
    Cheng, Bo
    PATTERN RECOGNITION LETTERS, 2012, 33 (03) : 239 - 246
  • [42] TRANSMITTING INFORMATIVE COMPONENTS OF FISHER CODES FOR MOBILE VISUAL SEARCH
    Zhang, Guixuan
    Zeng, Zhi
    Zhang, Shuwu
    Guo, Qinzhen
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1136 - 1140
  • [43] Progressive transmission based on wavelet used in mobile visual search
    Miao, Siyang
    Li, Zhiyang
    Qu, Wenyu
    Du, Yegang
    Wang, Songhe
    Qi, Heng
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2014, 6 (2-3) : 114 - 123
  • [44] DEEP-BASED FISHER VECTOR FOR MOBILE VISUAL SEARCH
    Huang, Chen
    Zhang, Shengchuan
    Lin, Xianming
    Liu, Xiangrong
    Ji, Rongrong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3430 - 3434
  • [45] SORTING LOCAL DESCRIPTORS FOR LOWBIT RATE MOBILE VISUAL SEARCH
    Chen, Jie
    Duan, Ling-Yu
    Ji, Rongrong
    Yao, Hongxun
    Gao, Wen
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1029 - 1032
  • [46] Fast verification via statistical geometric for mobile visual search
    Miaohui Zhang
    Shaozi Li
    Xianming Lin
    Songzhi Su
    Rongrong Ji
    Multimedia Systems, 2016, 22 : 525 - 534
  • [47] Fast verification via statistical geometric for mobile visual search
    Zhang, Miaohui
    Li, Shaozi
    Lin, Xianming
    Su, Songzhi
    Ji, Rongrong
    MULTIMEDIA SYSTEMS, 2016, 22 (04) : 525 - 534
  • [48] A framework for rapid visual image search using single-trial brain evoked responses
    Huang, Yonghong
    Erdogmus, Deniz
    Pavel, Misha
    Mathan, Santosh
    Hild, Kenneth E., II
    NEUROCOMPUTING, 2011, 74 (12-13) : 2041 - 2051
  • [49] DIMENSIONALITY REDUCTION OF VISUAL FEATURES USING SPARSE PROJECTORS FOR CONTENT-BASED IMAGE RETRIEVAL
    Negrel, Romain
    Picard, David
    Gosselin, Philippe-Henri
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2192 - 2196
  • [50] Chinese image captioning with fusion encoder and visual keyword search
    Zou, Yang
    Liao, Shiyu
    Wang, Qifei
    IET IMAGE PROCESSING, 2024, 18 (11) : 3055 - 3069