Deep Convolutional Neural Networks with Transfer Learning for Visual Sentiment Analysis

被引:0
作者
Devi, K. Usha Kingsly [1 ]
Gomathi, V [2 ]
机构
[1] Anna Univ, Dept Elect & Commun Engn, Reg Campus, Tirunelveli, Tamil Nadu, India
[2] Natl Engn Coll, Dept Comp Sci & Engn, Kovilpatti, Tamil Nadu, India
关键词
Visual sentiment analysis; CNN; Affective computing; Transfer learning; Data augmentation; ResNet; DenseNet; IMAGE; FEATURES;
D O I
10.1007/s11063-022-11082-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of visual sentiment analysis is to predict the positive or negative sentiment polarity evoked by images by analysing the image contents. The task of automatically recognizing sentiments in still images is inherently more challenging than other visual recognition tasks such as scene recognition, object classification, and semantic image classification since, it involves higher level of abstraction in the human cognition perspective. Sentiment classification in still images requires effective handling of large intra-class variance, scalability, subjectivity while it is also ambiguous as an image can evoke multiple sentiments. To address these issues many of the existing works focus on improving the image sentiment representation. The emergence of convolutional neural networks (CNN) has resulted in impressive performance on computer vision related tasks. The significant contribution of this work includes an exhaustive analysis on four pre-trained CNN architectural models, namely, AlexNet, GoogleNet, ResNet50, and DenseNet201 along with five data augmentation methods on five affective datasets, IAPSa, ArtPhoto, abstract paintings, MART, and EmoROI. Data augmentation is proven to provide better performance for smaller datasets. Five-fold cross validation was performed to train and evaluate the four models with data augmentation and the results demonstrate that the proposed framework is able to achieve improved performances compared to conventional techniques.
引用
收藏
页码:5087 / 5120
页数:34
相关论文
共 59 条
  • [1] Using filter banks in Convolutional Neural Networks for texture classification
    Andrearczyk, Vincent
    Whelan, Paulf.
    [J]. PATTERN RECOGNITION LETTERS, 2016, 84 : 63 - 69
  • [2] [Anonymous], 2016, PROC CVPR IEEE, DOI [DOI 10.1109/TASC.2016.2522821, DOI 10.1109/CVPR.2016.566]
  • [3] [Anonymous], MART DATASET
  • [4] Aquino NMR, 2017, BRAZILIAN SOC COMPUT
  • [5] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [6] Borth D., 2013, Proceedings of the 21st ACM International Conference on Multimedia, P223, DOI DOI 10.1145/2502081.2502282
  • [7] Campos V, 2015, P 1 INT WORKSH AFF S, P57
  • [8] From pixels to sentiment: Fine-tuning CNNs for visual sentiment prediction
    Campos, Victor
    Jou, Brendan
    Giro-i-Nieto, Xavier
    [J]. IMAGE AND VISION COMPUTING, 2017, 65 : 15 - 22
  • [9] Chen CH, 2015, PROC CVPR IEEE, P4110, DOI 10.1109/CVPR.2015.7299038
  • [10] Chen T., 2014, ARXIV