Multimodal model for the Spanish sentiment analysis in a tourism domain

被引:0
|
作者
Monsalve-Pulido, Julian [1 ]
Parra, Carlos Alberto [2 ]
Aguilar, Jose [3 ,4 ,5 ]
机构
[1] Univ Pedag & Tecnol Colombia, GIMI, Tunja, Colombia
[2] Pontificia Univ Javeriana, Bogota, Colombia
[3] Univ Los Andes, CEMISID, Merida, Venezuela
[4] Univ EAFIT, CIDITIC, Medellin, Colombia
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Multimodal model; Sentiment analysis; Opinion mining; Spanish language; Tourism;
D O I
10.1007/s13278-024-01202-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of sentiment analysis of tourism data focuses on the analysis of the multimodal characteristics of the data generated digitally by tourists on each platform or social network. Generally, their opinions have multimodal characteristics, since they combine text, images or numbers (ratings), which represents an important challenge in sentiment analysis that requires new models or multimodal data classification techniques. This work proposes a multimodal sentiment analysis model for data in Spanish in the tourism domain composed of four main phases (extraction, classification, fusion, visualization), and a transversal phase to evaluate the quality of the multimodal sentiment analysis process. Thus, the multimodal sentiment analysis model integrates a data quality model to improve multimodal sentiment analysis tasks, but in addition, the linguistic resource "SenticNet 5" is adapted to Spanish. The model was validated by applying various classification metrics, and the classification results were compared to a manually labeled dataset (TASS) using two machine learning classification algorithms. The first was Random Forest, where the manually labeled dataset has a 50% F1 score compared to the adapted SenticNet automatically generated dataset, which has a 71% F1 score measure and a 70% accuracy. The classification generated by SenticNet is 21% higher than that of the TASS data set. The second algorithm applied was Support Vector Machine (SVM), which classified the SenticNet-generated dataset with an F1 score of 72% versus the manually created dataset with 57.7% (14.3% more effective). In the fusion tests of the multimodal sentiment inputs, the accuracy results for text were 65%, for images 33%, and the fusion of both was 71%. In general, it was identified that the opinions made by users composed of text in Spanish and images improve polarity identification if an independent classification is carried out, and then apply a polarity fusion process.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
    Xie, Zhuyang
    Yang, Yan
    Wang, Jie
    Liu, Xiaorong
    Li, Xiaofan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7657 - 7670
  • [22] Twitter sentiment mining: A multi domain analysis
    Shahheidari, Saeideh
    Dong, Hai
    Bin Daud, Md Nor Ridzuan
    2013 SEVENTH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS (CISIS), 2013, : 144 - 149
  • [23] A Spanish Political Tweets Fine-Tuned Sentiment Analysis Model
    Jimenez-Bravo, Diego M.
    Lozano Murciego, Alvaro
    Bajo, Javier
    De La Iglesia, Daniel H.
    Pinzon, Cristian
    NEW TRENDS IN DISRUPTIVE TECHNOLOGIES, TECH ETHICS AND ARTIFICIAL INTELLIGENCE, DITTET 2022, 2023, 1430 : 91 - 102
  • [24] VAE-Based Adversarial Multimodal Domain Transfer for Video-Level Sentiment Analysis
    Wang, Yanan
    Wu, Jianming
    Furumai, Kazuaki
    Wada, Shinya
    Kurihara, Satoshi
    IEEE ACCESS, 2022, 10 : 51315 - 51324
  • [25] TGMoE: A Text Guided Mixture-of-Experts Model for Multimodal Sentiment Analysis
    Zhao, Xueliang
    Wang, Mingyang
    Tan, Yingchun
    Wang, Xianjie
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 1227 - 1234
  • [26] Multimodal Sentiment Analysis Model Integrating Multi-features and Attention Mechanism
    Lyu X.
    Tian C.
    Zhang L.
    Du Y.
    Zhang X.
    Cai Z.
    Data Analysis and Knowledge Discovery, 2024, 8 (05) : 91 - 101
  • [27] Semisupervised Hierarchical Subspace Learning Model for Multimodal Social Media Sentiment Analysis
    Han, Xue
    Cheng, Honlin
    Ding, Jike
    Yan, Suqin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 3446 - 3454
  • [28] CubeMLP: A MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation
    Sun, Hao
    Wang, Hongyi
    Liu, Jiaqing
    Chen, Yen-Wei
    Lin, Lanfen
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3722 - 3729
  • [29] Sentiment Analysis Based Information Architecture Model for Peruvian Sustainable Tourism SMEs
    Zapata, Gianpierre
    Murga, Javier
    Raymundo, Carlos
    Dominguez, Francisco
    Mogerza, Javier
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON TOURISM RESEARCH, ICTR 2018, 2018, : 176 - 184
  • [30] Sentitext: a sentiment analysis system for Spanish
    Moreno Ortiz, Antonio
    Perez Pozo, Alvaro
    Torres Sanchez, Sergio
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (45): : 297 - 298