Learning by expansion: Exploiting social media for image classification with few training examples

被引:8
作者
Wang, Sheng-Yuan [1 ]
Liao, Wei-Shing [1 ]
Hsieh, Liang-Chi [1 ]
Chen, Yan-Ying [1 ]
Hsu, Winston H. [1 ]
机构
[1] Natl Taiwan Univ, Taipei 10764, Taiwan
关键词
Object recognition; Image classification; Web image search; Crowdsourcing; Semantic query expansion; SCENE;
D O I
10.1016/j.neucom.2011.05.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Witnessing the sheer amount of user-contributed photos and videos, we argue to leverage such freely available image collections as the training images for image classification. We propose an image expansion framework to mine more semantically related training images from the auxiliary image collection provided with very few training examples. The expansion is based on a semantic graph considering both visual and (noisy) textual similarities in the auxiliary image collections, where we also consider scalability issues (e.g., MapReduce) as constructing the graph. We found the expanded images not only reduce the time-consuming (manual) annotation efforts but also further improve the classification accuracy since more visually diverse training images are included. Experimenting in certain benchmarks, we show that the expanded training images improve image classification significantly. Furthermore, we achieve more than 27% relative improvement in accuracy compared to the state-of-the-art training image crowdsourcing approaches by exploiting media sharing services (such as Flickr) for additional training images. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 32 条
[1]  
[Anonymous], ACM MULTIMEDIA
[2]  
[Anonymous], 2007, P 6 ACM INT C IM VID, DOI [DOI 10.1145/1282280.1282340, 10.1145/1282280.1282340]
[3]  
[Anonymous], 2008, CVPR
[4]  
[Anonymous], 2003, ICCV
[5]  
Berg TamaraL., 2006, CVPR, DOI DOI 10.1109/CVPR.2006.57
[6]  
Berg TL, 2004, PROC CVPR IEEE, P848
[7]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[8]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[9]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[10]  
Delalleau Olivier., 2006, SEMISUPERVISED LEARN, P333