Semisupervised SVM Batch Mode Active Learning with Applications to Image Retrieval

被引:115
作者
Hoi, Steven C. H. [1 ]
Jin, Rong [2 ]
Zhu, Jianke
Lyu, Michael R. [3 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Div Informat Syst, Singapore 639798, Singapore
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[3] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Algorithms; Design; Experimentation; Performance; Content-based image retrieval; support vector machines; active learning; semisupervised learning; batch mode active learning; human-computer interaction; RELEVANCE-FEEDBACK;
D O I
10.1145/1508850.1508854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Support vector machine (SVM) active learning is one popular and successful technique for relevance feedback in content-based image retrieval (CBIR). Despite the success, conventional SVM active learning has two main drawbacks. First, the performance of SVM is usually limited by the number of labeled examples. It often suffers a poor performance for the small-sized labeled examples, which is the case in relevance feedback. Second, conventional approaches do not take into account the redundancy among examples, and could select multiple examples that are similar (or even identical). In this work, we propose a novel scheme for explicitly addressing the drawbacks. It first learns a kernel function from a mixture of labeled and unlabeled data, and therefore alleviates the problem of small-sized training data. The kernel will then be used for a batch mode active learning method to identify the most informative and diverse examples via a min-max framework. Two novel algorithms are proposed to solve the related combinatorial optimization: the first approach approximates the problem into a quadratic program, and the second solves the combinatorial optimization approximately by a greedy algorithm that exploits the merits of submodular functions. Extensive experiments with image retrieval using both natural photo images and medical images show that the proposed algorithms are significantly more effective than the state-of-the-art approaches. A demo is available at http://msm.cais.ntu.edu.sg/LSCBIR/.
引用
收藏
页数:29
相关论文
共 49 条
  • [1] [Anonymous], 2006, IEEE T NEURAL NETWOR
  • [2] [Anonymous], 2005, P INT C MACH LEARN I
  • [3] [Anonymous], 2006, WORLD WIDE WEB, DOI DOI 10.1145/1135777.1135870
  • [4] [Anonymous], P IEEE C COMP VIS PA
  • [5] [Anonymous], P ACM SIGKDD
  • [6] Boyd S., 2004, CONVEX OPTIMIZATION, DOI DOI 10.1017/CBO9780511804441
  • [7] BRINKER K, 2003, P INT C MACH LEARN I
  • [8] COHN DA, 1995, P NIPS
  • [9] Dagli CK, 2006, LECT NOTES COMPUT SC, V4071, P123
  • [10] Goh K., 2004, PROC ACM INT C MULTI, P564