Artwork Style Recognition Using Vision Transformers and MLP Mixer

被引:4
作者
Iliadis, Lazaros Alexios [1 ]
Nikolaidis, Spyridon [1 ]
Sarigiannidis, Panagiotis [2 ]
Wan, Shaohua [3 ]
Goudos, Sotirios K. [1 ]
机构
[1] Aristotle Univ Thessaloniki, Sch Phys, ELEDIA AUTH, Thessaloniki 54124, Greece
[2] Univ Western Macedonia, Dept Informat & Telecommun Engn, Kozani 50100, Greece
[3] Zhongnan Univ Econ & Law, Sch Informat & Safety Engn, Wuhan 430073, Peoples R China
关键词
vision transformers; computer vision; deep learning; artistic style recognition; CONVOLUTIONAL NEURAL-NETWORKS; CLASSIFICATION;
D O I
10.3390/technologies10010002
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Through the extensive study of transformers, attention mechanisms have emerged as potentially more powerful than sequential recurrent processing and convolution. In this realm, Vision Transformers have gained much research interest, since their architecture changes the dominant paradigm in Computer Vision. An interesting and difficult task in this field is the classification of artwork styles, since the artistic style of a painting is a descriptor that captures rich information about the painting. In this paper, two different Deep Learning architectures-Vision Transformer and MLP Mixer (Multi-layer Perceptron Mixer)-are trained from scratch in the task of artwork style recognition, achieving over 39% prediction accuracy for 21 style classes on the WikiArt paintings dataset. In addition, a comparative study between the most common optimizers was conducted obtaining useful information for future studies.
引用
收藏
页数:12
相关论文
共 26 条
  • [1] Altenburger P, 1996, J BIOTECHNOL, V47, P39, DOI 10.1016/0168-1656(96)01376-4
  • [2] Advanced Service Search Model for Higher Network Navigation Using Small World Networks
    Amin, Farhan
    Choi, Gyu Sang
    [J]. IEEE ACCESS, 2021, 9 : 70584 - 70595
  • [3] Towards Trust and Friendliness Approaches in the Social Internet of Things
    Amin, Farhan
    Ahmad, Awais
    Choi, Gyu Sang
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (01):
  • [4] Classification of Artistic Styles Using Binarized Features Derived from a Deep Neural Network
    Bar, Yaniv
    Levy, Noga
    Wolf, Lior
    [J]. COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 : 71 - 84
  • [5] Carion N., 2020, EUROPEAN C COMPUTER, V12346, P213, DOI 10.1007/978-3-030-58452-8_13
  • [6] Fine-tuning Convolutional Neural Networks for fine art classification
    Cetinic, Eva
    Lipic, Tomislav
    Grgic, Sonja
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 114 : 107 - 118
  • [7] Choi D., 2020, ARXIV191005446
  • [8] Daskalakis C., 2018, P INT C LEARN REPR, P1
  • [9] Dosovitskiy A., 2021, P ICLR 2021 9 INT C
  • [10] Elgammal A., 2018, P AAAI PAL ALT CA US