Personalized Deep Learning for Tag Recommendation

被引:49
作者
Nguyen, Hanh T. H. [1 ]
Wistuba, Martin [1 ]
Grabocka, Josif [1 ]
Drumond, Lucas Rego [1 ]
Schmidt-Thieme, Lars [1 ]
机构
[1] Univ Hildesheim, Informat Syst & Machine Learning Lab, Univ Pl 1, D-31141 Hildesheim, Germany
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I | 2017年 / 10234卷
关键词
Image tagging; Convolutional Neural Networks; Personalized tag recommendation;
D O I
10.1007/978-3-319-57454-7_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media services deploy tag recommendation systems to facilitate the process of tagging objects which depends on the information of both the user's preferences and the tagged object. However, most image tag recommender systems do not consider the additional information provided by the uploaded image but rely only on textual information, or make use of simple low-level image features. In this paper, we propose a personalized deep learning approach for the image tag recommendation that considers the user's preferences, as well as visual information. We employ Convolutional Neural Networks (CNNs), which already provide excellent performance for image classification and recognition, to obtain visual features from images in a supervised way. We provide empirical evidence that features selected in this fashion improve the capability of tag recommender systems, compared to the current state of the art that is using hand-crafted visual features, or is solely based on the tagging history information. The proposed method yields up to at least two percent accuracy improvement in two real world datasets, namely NUS-WIDE and Flickr-PTR.
引用
收藏
页码:186 / 197
页数:12
相关论文
共 24 条
[11]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[12]  
Li Xirong., 2008, Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval, MIR '08, P180
[13]  
McParlane Philip J., 2014, MultiMedia Modeling. 20th Anniversary International Conference, MMM 2014. Proceedings: LNCS 8325, P133, DOI 10.1007/978-3-319-04114-8_12
[14]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41
[15]   Tagging photos using users' vocabularies [J].
Qian, Xueming ;
Liu, Xiaoxiao ;
Zheng, Chao ;
Du, Youtian ;
Hou, Xingsong .
NEUROCOMPUTING, 2013, 111 :144-153
[16]  
Rae Adam., 2010, Adaptivity, Personalization and Fusion of Heterogeneous Information, P92
[17]  
Rendle Steffen, 2010, Proceedings 2010 10th IEEE International Conference on Data Mining (ICDM 2010), P995, DOI 10.1109/ICDM.2010.127
[18]  
Rendle S, 2009, UAI 2009, P452, DOI DOI 10.5555/1795114.1795167
[19]  
Rendle S, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P727
[20]  
Rendle Steffen, 2010, P 3 ACM INT C WEB SE, P81, DOI DOI 10.1145/1718487.1718498