An Interactive Approach for Filtering Out Junk Images From Keyword-Based Google Search Results

被引:11
作者
Gao, Yuli [1 ]
Peng, Jinye [2 ]
Luo, Hangzai [3 ]
Keim, Daniel A. [4 ]
Fan, Jianping [5 ]
机构
[1] Hewlett Packard Labs, Palo Alto, CA 94304 USA
[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[3] E China Normal Univ, Inst Software Engn, Shanghai 200062, Peoples R China
[4] Univ Konstanz, Dept Comp & Informat Sci, D-78464 Constance, Germany
[5] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
基金
美国国家科学基金会;
关键词
Hyperbolic image visualization; incremental kernel learning; junk image filtering; mixture-of-kernels; user-system interaction; RELEVANCE-FEEDBACK; RETRIEVAL;
D O I
10.1109/TCSVT.2009.2026968
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The keyword-based Google Images search engine is now becoming very popular for online image search. Unfortunately, only the text terms that are explicitly or implicitly linked with the images are used for image indexing but the associated text terms may not have exact correspondence with the underlying image semantics, thus the keyword-based Google Images search engine may return large amounts of junk images which are irrelevant to the given keyword-based queries. Based on this observation, we have developed an interactive approach to filter out the junk images from keyword-based Google Images search results and our approach consists of the following major components. a) A kernel-based image clustering technique is developed to partition the returned images into multiple clusters and outliers. b) Hyperbolic visualization is incorporated to display large amounts of returned images according to their nonlinear visual similarity contexts, so that users can assess the relevance between the returned images and their real query intentions interactively and select one or multiple images to express their query intentions and personal preferences precisely. c) An incremental kernel learning algorithm is developed to translate the users' query intentions and personal preferences for updating the mixture-of-kernels and generating better hypotheses to achieve more accurate clustering of the returned images and filter out the junk images more effectively. Experiments on diverse keyword-based queries from Google Images search engine have obtained very positive results. Our junk image filtering system is released for public evaluation at: http://www.cs.uncc.edu/similar to jfan/google_demo/.
引用
收藏
页码:1851 / 1865
页数:15
相关论文
共 33 条
  • [1] [Anonymous], P ACM INT C MULT
  • [2] [Anonymous], P ICML
  • [3] [Anonymous], P EUR C COMP VIS ECC
  • [4] [Anonymous], P ACM INT C MULT
  • [5] [Anonymous], P IEEE C COMP VIS PA
  • [6] [Anonymous], P IEEE ICIP
  • [7] [Anonymous], P IEEE COMP VIS PATT
  • [8] [Anonymous], P ACM INT C MULT, DOI DOI 10.1145/1101149.1101167
  • [9] [Anonymous], J MACH LEARNING RES
  • [10] [Anonymous], P IEEE COMP VIS PATT