Learning to Photograph: A Compositional Perspective

被引：43

作者：

Ni, Bingbing ^{[1
]}

Xu, Mengdi ^{[2
]}

Cheng, Bin ^{[2
]}

Wang, Meng ^{[3
]}

Yan, Shuicheng ^{[2
]}

Tian, Qi ^{[4
]}

机构：

[1] Adv Digital Sci Ctr, Singapore 138632, Singapore

[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117576, Singapore

[3] Hefei Univ Technol, Hefei, Peoples R China

[4] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2013年 / 15卷 / 05期

基金：

美国国家科学基金会; 中国国家自然科学基金;

关键词：

Generative model; maximum a posteriori; photo composition; spatial context; view recommendation; IMAGE; HISTOGRAMS;

D O I：

10.1109/TMM.2013.2241042

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present an intelligent photography system which can recommend the most user-favored view rectangle for arbitrary camera input, from a photographic compositional perspective. Automating this process is difficult, due to the subjectivity of human's aesthetics judgement and large variations of image contents, where heuristic compositional rules lack generality. Motivated by the recent prevalence of photo-sharing websites, e. g., Flickr.com, we develop a learning-based framework which discovers the underlying aesthetic photographic compositional structures from a large set of user-favored online sharing photographs and utilizes the implicitly shared knowledge among the professional photographers for aesthetically optimal view recommendation. In particular, we propose an Omni-Range Context method which explicitly encodes the spatial and geometric distributions of various visual elements in the photograph as well as cooccurrence characteristics of visual element pairs by using generative mixture models. Searching the optimal view rectangle is then formulated as maximum a posterior by imposing the trained prior distributions along with additional photographic constraints. The proposed system has the potential to operate in near real-time. Comprehensive user studies well demonstrate the effectiveness of the proposed framework for aesthetically optimal view recommendation.

引用

页码：1138 / 1151

页数：14

共 53 条

[1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].

Achanta, Radhakrishna ;

Shaji, Appu ;

Smith, Kevin ;

Lucchi, Aurelien ;

Fua, Pascal ;

Suesstrunk, Sabine .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281

[2]

[Anonymous], 2009, P INT C COMP VIS

[3]

[Anonymous], 2004, P ACM C INT US INT

[4]

[Anonymous], 2009, P 17 ACM INT C MULTI

[5]

[Anonymous], 2006, Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, DOI DOI 10.1109/CVPR.2006.303

[6] Seam carving for content-aware image resizing [J].

Avidan, Shai ;

Shamir, Ariel .

ACM TRANSACTIONS ON GRAPHICS, 2007, 26 (03)

[7] PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing [J].

Barnes, Connelly ;

Shechtman, Eli ;

Finkelstein, Adam ;

Goldman, Dan B. .

ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)

[8]

Birchfield ST, 2005, PROC CVPR IEEE, P1158

[9]

Byers Z., 2003, Proceedings of the Fifteenth Innovative Applications of Artificial Intelligence Conference, P65

[10] Example-based color transformation of image and video using basic color categories [J].

Chang, Youngha ;

Saito, Suguru ;

Nakajima, Masayuki .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (02) :329-336

← 1 2 3 4 5 6 →