CNN-Based Classification of Illustrator Style in Graphic Novels: Which Features Contribute Most?

被引:3
作者
Laubrock, Jochen [1 ]
Dubray, David [1 ]
机构
[1] Univ Potsdam, Potsdam, Germany
来源
MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷
关键词
Convolutional Neural Network; Classification; Graphic Novels; Stylometry;
D O I
10.1007/978-3-030-05716-9_61
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Can classification of graphic novel illustrators be achieved by convolutional neural network (CNN) features evolved for classifying concepts on photographs? Assuming that basic features at lower network levels generically represent invariants of our environment, they should be reusable. However, features at what level of abstraction are characteristic of illustrator style? We tested transfer learning by classifying roughly 50,000 digitized pages from about 200 comic books of the Graphic Narrative Corpus (GNC, [6]) by illustrator. For comparison, we also classified Manga109 [18] by book. We tested the predictability of visual features by experimentally varying which of the mixed layers of Inception V3 [29] was used to train classifiers. Overall, the top-1 test-set classification accuracy in the artist attribution analysis increased from 92% for mixed-layer 0 to over 97% when adding mixed-layers higher in the hierarchy. Above mixed-layer 5, there were signs of overfitting, suggesting that texture-like mid-level vision features were sufficient. Experiments varying input material show that page layout and coloring scheme are important contributors. Thus, stylistic classification of comics artists is possible re-using pre-trained CNN features, given only a limited amount of additional training material. We propose that CNN features are general enough to provide the foundation of a visual stylometry, potentially useful for comparative art history.
引用
收藏
页码:684 / 695
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 2013, DISTANT READING
[2]   Image Style Classification Based on Learnt Deep Correlation Features [J].
Chu, Wei-Ta ;
Wu, Yi-Ling .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (09) :2491-2502
[3]   Manga FaceNet: Face Detection in Manga based on Deep Neural Network [J].
Chu, Wei-Ta ;
Li, Wei-Wei .
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, :417-420
[4]   Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence [J].
Cichy, Radoslaw Martin ;
Khosla, Aditya ;
Pantazis, Dimitrios ;
Torralba, Antonio ;
Oliva, Aude .
SCIENTIFIC REPORTS, 2016, 6
[5]   In Search of Art [J].
Crowley, Elliot J. ;
Zisserman, Andrew .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 :54-70
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]  
Dunst A., 2018, EMPIRICAL COMICS RES, P239
[8]   The Graphic Narrative Corpus (GNC): Design, Annotation, and Analysis for the Digital Humanities [J].
Dunst, Alexander ;
Hartel, Rita ;
Laubrock, Jochen .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 3, 2017, :15-20
[9]   Texture and art with deep neural networks [J].
Gatys, Leon A. ;
Ecker, Alexander S. ;
Bethge, Matthias .
CURRENT OPINION IN NEUROBIOLOGY, 2017, 46 :178-186
[10]   Image Style Transfer Using Convolutional Neural Networks [J].
Gatys, Leon A. ;
Ecker, Alexander S. ;
Bethge, Matthias .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2414-2423