An Informed Framework for Training Classifiers from Social Media

被引:0
作者
Cheng, Dong Seon [1 ]
Abdulhak, Sami Abduljalil [2 ]
机构
[1] Hankuk Univ Foreign Studies, Dept Comp Sci & Engn, 81 Oedae Ro, Yongin 449791, Gyeonggi Do, South Korea
[2] Univ Verona, Dept Comp Sci, Str Le Grazie 15, I-37134 Verona, Italy
关键词
training sets; image classification; Shannon entropy; social media;
D O I
10.3390/e18040130
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Extracting information from social media has become a major focus of companies and researchers in recent years. Aside from the study of the social aspects, it has also been found feasible to exploit the collaborative strength of crowds to help solve classical machine learning problems like object recognition. In this work, we focus on the generally underappreciated problem of building effective datasets for training classifiers by automatically assembling data from social media. We detail some of the challenges of this approach and outline a framework that uses expanded search queries to retrieve more qualified data. In particular, we concentrate on collaboratively tagged media on the social platform Flickr, and on the problem of image classification to evaluate our approach. Finally, we describe a novel entropy-based method to incorporate an information-theoretic principle to guide our framework. Experimental validation against well-known public datasets shows the viability of this approach and marks an improvement over the state of the art in terms of simplicity and performance.
引用
收藏
页数:15
相关论文
共 29 条
[1]  
Ames M, 2007, CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1 AND 2, P971
[2]  
[Anonymous], 2006, P INT C MULT INF SCI
[3]  
[Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
[4]   Application of Adaptive Extended Kalman Smoothing on INS/WSN Integration System for Mobile Robot Indoors [J].
Chen, Xiyuan ;
Xu, Yuan ;
Li, Qinghua .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
[5]   Semantically-driven automatic creation of training sets for object recognition [J].
Cheng, Dong Seon ;
Setti, Francesco ;
Zeni, Nicola ;
Ferrario, Roberta ;
Cristani, Marco .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 131 :56-71
[6]  
Crowston Kevin, 2012, Shaping the Future of ICT Research. Methods and Approaches, P210, DOI DOI 10.1007/978-3-642-35142-6_14
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Learning Everything about Anything: Webly-Supervised Visual Concept Learning [J].
Divvala, Santosh K. ;
Farhadi, Ali ;
Guestrin, Carlos .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3270-3277
[9]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[10]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871