Rating Image Aesthetics Using Deep Learning

被引:165
作者
Lu, Xin [1 ]
Lin, Zhe [2 ]
Jin, Hailin [2 ]
Yang, Jianchao [2 ]
Wang, James. Z. [1 ]
机构
[1] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
[2] Adobe Syst Inc, Adobe Res, San Jose, CA 95110 USA
关键词
Automatic feature learning; deep neural networks; image aesthetics; QUALITY;
D O I
10.1109/TMM.2015.2477040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates unified feature learning and classifier training approaches for image aesthetics assessment. Existing methods built upon handcrafted or generic image features and developed machine learning and statistical modeling techniques utilizing training examples. We adopt a novel deep neural network approach to allow unified feature learning and classifier training to estimate image aesthetics. In particular, we develop a double-column deep convolutional neural network to support heterogeneous inputs, i.e., global and local views, in order to capture both global and local characteristics of images. In addition, we employ the style and semantic attributes of images to further boost the aesthetics categorization performance. Experimental results show that our approach produces significantly better results than the earlier reported results on the AVA dataset for both the generic image aesthetics and content-based image aesthetics. Moreover, we introduce a 1.5-million image dataset (IAD) for image aesthetics assessment and we further boost the performance on the AVA test set by training the proposed deep neural networks on the IAD dataset.
引用
收藏
页码:2021 / 2034
页数:14
相关论文
共 32 条
[1]  
Agostinelli F., 2013, Advances in Neural Information Processing Systems (NIPS), V1, P1493
[2]  
[Anonymous], 2011, P 19 ACM INT C MULTI
[3]  
[Anonymous], 2006, Proceedings of the Conference on Computer Vision and Pattern Recognition
[4]  
[Anonymous], 2010, ACM MULTIMEDIA 2010
[5]  
[Anonymous], 2014, P BMVC
[6]  
[Anonymous], 2010, IEEE Transactions on Knowledge and Data Engineering, DOI [DOI 10.1109/TKDE.2009.191, 10.1109/TKDE.2009.191, 10.1007/978-981-15-5971-6_83]
[7]  
Arnheim R., 1954, Art and Visual Perception: A Psychology of the Creative Eye
[8]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[9]  
Collobert Ronan, 2008, INT C MACHINE LEARNI, P160, DOI DOI 10.1145/1390156.1390177
[10]  
Datta R, 2006, LECT NOTES COMPUT SC, V3953, P288, DOI 10.1007/11744078_23