Personalized Tag Recommendation for Images Using Deep Transfer Learning

被引:23
作者
Nguyen, Hanh T. H. [1 ]
Wistuba, Martin [1 ]
Schmidt-Thieme, Lars [1 ]
机构
[1] Univ Hildesheim, Informat Syst & Machine Learning Lab, Univ Pl 1, D-31141 Hildesheim, Germany
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II | 2017年 / 10535卷
关键词
Image tagging; Convolutional neural networks; Personalized tag recommendation; Factorization models;
D O I
10.1007/978-3-319-71246-8_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image tag recommendation in social media systems provides the users with personalized tag suggestions which facilitate the users' tagging task and enable automatic organization and many image retrieval tasks. Factorization models are a widely used approach for personalized tag recommendation and achieve good results. These methods rely on the user's tagging preferences only and ignore the contents of the image. However, it is obvious that especially the contents of the image, such as the objects appearing in the image, colors, shapes or other visual aspects, strongly influence the user's tagging decisions. We present a personalized content-aware image tag recommendation approach that combines both historical tagging information and image-based features in a factorization model. Employing transfer learning, we apply state of the art deep learning image classification and object detection techniques to extract powerful features from the images. Both, image information and tagging history, are fed to an adaptive factorization model to recommend tags. Empirically, we can demonstrate that the visual and object-based features can improve the performance up to 1.5% over the state of the art.
引用
收藏
页码:705 / 720
页数:16
相关论文
共 19 条
[1]  
[Anonymous], 2008, P 17 INT C WORLD WID
[2]  
[Anonymous], 2012, RECOMMENDER SYSTEMS
[3]  
[Anonymous], 2013, ARXIV13124894
[4]  
Chua T.-S., 2009, ACM INT C IM VID RET, p48:1
[5]  
Garg N, 2008, RECSYS'08: PROCEEDINGS OF THE 2008 ACM CONFERENCE ON RECOMMENDER SYSTEMS, P67
[6]  
Jäschke R, 2007, LECT NOTES ARTIF INT, V4702, P506
[7]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[8]   Real-time computerized annotation of pictures [J].
Li, Jia ;
Wang, James Z. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (06) :985-1002
[9]  
Li Xirong., 2008, Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval, MIR '08, P180
[10]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41