Distribution-Oriented Aesthetics Assessment With Semantic-Aware Hybrid Network

被引:69
作者
Cui, Chaoran [1 ]
Liu, Huihui [2 ]
Lian, Tao [3 ]
Nie, Liqiang [2 ]
Zhu, Lei [4 ]
Yin, Yilong [5 ]
机构
[1] Shandong Univ Finance & Econ, Sch Comp Sci & Technol, Jinan 250014, Shandong, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Jinan 250101, Shandong, Peoples R China
[3] Taiyuan Univ Technol, Coll Data Sci, Taiyuan 030024, Shanxi, Peoples R China
[4] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Shandong, Peoples R China
[5] Shandong Univ, Sch Software, Jinan 250101, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Image aesthetics assessment; label distribution learning; fully convolutional networks; semantic fusion; PHOTO;
D O I
10.1109/TMM.2018.2875357
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image aesthetics assessment has emerged as a hot topic in recent years due to its potential in numerous high-level vision applications. In this paper, distinguished from existing studies relying on a single label, we propose quantifying image aesthetics by a distribution over multiple quality levels. The distribution-based representation characterizes the disagreement among users' aesthetic preferences regarding the same image, and is also compatible with the traditional task of aesthetic label prediction. Our framework is developed based on fully convolutional networks and enables inputs of varying sizes. In this way, we circumvent the fixed-size constraint of prevalent convolutional neural networks, and avoid the risk of impairing the intrinsic aesthetic appeal of images. Moreover, given the fact that aesthetic perceiving is coupled with semantic understanding, we present a novel semantic-aware hybrid NEtwork (SANE), which harvests the information from object categorization and scene recognition to enhance image aesthetics assessment. Experiments on two benchmark datasets have well verified the effectiveness of our approach in both scenarios of aesthetic distribution prediction and aesthetic label prediction, and highlighted the benefits of input preserving as well as semantic understanding for images.
引用
收藏
页码:1209 / 1220
页数:12
相关论文
共 48 条
[1]  
[Anonymous], P 3 INT C LEARNING R
[2]  
[Anonymous], 2015, PROC CVPR IEEE
[3]  
[Anonymous], PROC 17TH ACM INT CO
[4]  
[Anonymous], PROC 21ST INT CONF M
[5]  
[Anonymous], 2014, P 2 INT C LEARNING R
[6]   Automated Aesthetic Analysis of Photographic Images [J].
Aydin, Tunc Ozan ;
Smolic, Aljoscha ;
Gross, Markus .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2015, 21 (01) :31-42
[7]   Fully-Convolutional Siamese Networks for Object Tracking [J].
Bertinetto, Luca ;
Valmadre, Jack ;
Henriques, Joao F. ;
Vedaldi, Andrea ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865
[8]  
Choi K, 2017, INT CONF ACOUST SPEE, P2392, DOI 10.1109/ICASSP.2017.7952585
[9]   Distribution-oriented Aesthetics Assessment for Image Search [J].
Cui, Chaoran ;
Fang, Huidi ;
Deng, Xiang ;
Nie, Xiushan ;
Dai, Hongshuai ;
Yin, Yilong .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :1013-1016
[10]  
Datta R, 2006, LECT NOTES COMPUT SC, V3953, P288, DOI 10.1007/11744078_23