Learning Visual Balance from Large-scale Datasets of Aesthetically Highly Rated Images

被引:14
作者
Jahanian, Ali [1 ]
Vishwanathan, S. V. N. [2 ,3 ]
Allebach, Jan P. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[3] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
来源
HUMAN VISION AND ELECTRONIC IMAGING XX | 2015年 / 9394卷
关键词
Visual balance; Arnheim's theory of visual rightness; layout; aesthetics; automatic visual design; the Rule of Thirds; symmetry; design mining; SPATIAL COMPOSITION; GOLDEN SECTION; PHOTOGRAPHS; PERCEPTION; ART;
D O I
10.1117/12.2084548
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of visual balance is innate for humans, and influences how we perceive visual aesthetics and cognize harmony. Although visual balance is a vital principle of design and taught in schools of designs, it is barely quantified. On the other hand, with emergence of automantic/semi-automatic visual designs for self-publishing, learning visual balance and computationally modeling it, may escalate aesthetics of such designs. In this paper, we present how questing for understanding visual balance inspired us to revisit one of the well-known theories in visual arts, the so called theory of "visual rightness", elucidated by Arnheim. We define Arnheim's hypothesis as a design mining problem with the goal of learning visual balance from work of professionals. We collected a dataset of 120K images that are aesthetically highly rated, from a professional photography website. We then computed factors that contribute to visual balance based on the notion of visual saliency. We fitted a mixture of Gaussians to the saliency maps of the images, and obtained the hotspots of the images. Our inferred Gaussians align with Arnheim's hotspots, and confirm his theory. Moreover, the results support the viability of the center of mass, symmetry, as well as the Rule of Thirds in our dataset.
引用
收藏
页数:9
相关论文
共 54 条
  • [1] Evaluating the Rule of Thirds in Photographs and Paintings
    Amirshahi, Seyed Ali
    Hayn-Leichsenring, Gregor Uwe
    Denzler, Joachim
    Redies, Christoph
    [J]. ART & PERCEPTION, 2014, 2 (1-2) : 163 - 182
  • [2] [Anonymous], 2004, P ACM C INT US INT
  • [3] [Anonymous], 2006, PATTERN RECOGN, DOI DOI 10.1117/1.2819119
  • [4] [Anonymous], 1954, ART VISUAL PERCEPTIO
  • [5] [Anonymous], IS T SPIE ELECT IMAG
  • [6] Arnheim Rudolf., 1983, POWER CTR STUDY COMP
  • [7] In-camera automation of photographic composition rules
    Banerjee, Serene
    Evans, Brian L.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (07) : 1807 - 1820
  • [8] Bergstrom B., 2009, Essentials of visual communication
  • [9] PERCEPTION OF SYMMETRY IN INFANCY
    BORNSTEIN, MH
    FERDINANDSEN, K
    GROSS, CG
    [J]. DEVELOPMENTAL PSYCHOLOGY, 1981, 17 (01) : 82 - 86
  • [10] Carpenter P.Graham., 1971, Art and ideas: An approach to art appreciation