Multimodal Deep Learning Framework for Sentiment Analysis from Text-Image Web Data

被引：10

作者：

Thuseethan, Selvarajah ^{[1
]}

Janarthan, Sivasubramaniam ^{[1
]}

Rajasegarar, Sutharshan ^{[1
]}

Kumari, Priya ^{[1
]}

Yearwood, John ^{[1
]}

机构：

[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia

来源：

2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020) | 2020年

关键词：

Sentiment Analysis; Multimodal Features; Web Data; Deep Learning; Affective Computing;

D O I：

10.1109/WIIAT50758.2020.00039

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Understanding people's sentiments from data published on the web presents a significant research problem and has a variety of applications, such as learning the context, prediction of election results and opinion about an incident. So far, sentiment analysis from web data has focused primarily on a single modality, such as text or image. However, the readily available multiple modal information, such as image and different forms of texts, as a combination can help to estimate the sentiments more accurately. Further, blindly combining the visual and textual features increases the complexity of the model, which ultimately reduces the sentiment analysis performance as it often fails to capture the correct interrelationships between different modalities. Hence, in this study, a sentiment analysis framework that carefully fuses the salient visual cues and high attention textual cues is proposed, exploiting the interrelationships between multimodal web data. A multimodal deep association learner is stacked to learn the relationships between learned salient visual features and textual features. Further, to automatically learn the discriminative features from the image and text, two streams of unimodal deep feature extractors are proposed to extract the visual and textual features that are most relevant to the sentiments. Finally, the sentiment is estimated using the features that are combined using a late fusion mechanism. The extensive evaluations show that our proposed framework achieved promising results for sentiment analysis using web data, in comparison to existing unimodal approaches and multimodal approaches that blindly combine the visual and textual features.

引用

页码：267 / 274

页数：8

共 47 条

[1] Agarwal Ayush, 2019, 2019 4th International Conference on Big Data, Cloud Computing, Data Science & Engineering (BCD), P19, DOI 10.1109/BCD.2019.8885108
[2] [Anonymous], 2014, ARXIV14091556
[3] Hybrid N-gram model using Naive Bayes for classification of political sentiments on Twitter
Awwalu, Jamilu
Abu Bakar, Azuraliza
Yaakub, Mohd Ridzwan
[J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (12) : 9207 - 9220
[4] Affective Computing and Sentiment Analysis
Cambria, Erik
[J]. IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) : 102 - 107
[5] Explaining Recommendations Based on Feature Sentiments in Product Reviews
Chen, Li
Wang, Feng
[J]. IUI'17: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2017, : 17 - 28
[6] Chen Minghai, 2017, P 19 ACM INT C MULTI, P163, DOI [10.1145/3136755.3136801, DOI 10.1145/3136755.3136801]
[7] Chen T, 2015, AAAI CONF ARTIF INTE, P30
[8] Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[9] Costa Pereira Jose, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P4583, DOI 10.1109/ICASSP.2014.6854470
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 →