Multimodal model for the Spanish sentiment analysis in a tourism domain

被引:0
|
作者
Monsalve-Pulido, Julian [1 ]
Parra, Carlos Alberto [2 ]
Aguilar, Jose [3 ,4 ,5 ]
机构
[1] Univ Pedag & Tecnol Colombia, GIMI, Tunja, Colombia
[2] Pontificia Univ Javeriana, Bogota, Colombia
[3] Univ Los Andes, CEMISID, Merida, Venezuela
[4] Univ EAFIT, CIDITIC, Medellin, Colombia
[5] IMDEA Networks Inst, Madrid, Spain
关键词
Multimodal model; Sentiment analysis; Opinion mining; Spanish language; Tourism;
D O I
10.1007/s13278-024-01202-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of sentiment analysis of tourism data focuses on the analysis of the multimodal characteristics of the data generated digitally by tourists on each platform or social network. Generally, their opinions have multimodal characteristics, since they combine text, images or numbers (ratings), which represents an important challenge in sentiment analysis that requires new models or multimodal data classification techniques. This work proposes a multimodal sentiment analysis model for data in Spanish in the tourism domain composed of four main phases (extraction, classification, fusion, visualization), and a transversal phase to evaluate the quality of the multimodal sentiment analysis process. Thus, the multimodal sentiment analysis model integrates a data quality model to improve multimodal sentiment analysis tasks, but in addition, the linguistic resource "SenticNet 5" is adapted to Spanish. The model was validated by applying various classification metrics, and the classification results were compared to a manually labeled dataset (TASS) using two machine learning classification algorithms. The first was Random Forest, where the manually labeled dataset has a 50% F1 score compared to the adapted SenticNet automatically generated dataset, which has a 71% F1 score measure and a 70% accuracy. The classification generated by SenticNet is 21% higher than that of the TASS data set. The second algorithm applied was Support Vector Machine (SVM), which classified the SenticNet-generated dataset with an F1 score of 72% versus the manually created dataset with 57.7% (14.3% more effective). In the fusion tests of the multimodal sentiment inputs, the accuracy results for text were 65%, for images 33%, and the fusion of both was 71%. In general, it was identified that the opinions made by users composed of text in Spanish and images improve polarity identification if an independent classification is carried out, and then apply a polarity fusion process.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Informal Multilingual Multi-domain Sentiment Analysis
    Stajner, Tadej
    Novalija, Inna
    Mladenic, Dunja
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (04): : 373 - 380
  • [42] FinSSLx: A Sentiment Analysis Model for the Financial Domain Using Text Simplification
    Maia, Macedo
    Freitas, Andre
    Handschuh, Siegfried
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 318 - 319
  • [43] A Model for Cross-Domain Opinion Target Extraction in Sentiment Analysis
    Pak, Muhammet Yasin
    Gunal, Serkan
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 42 (03): : 1215 - 1239
  • [44] Multi-Level Attention Map Network for Multimodal Sentiment Analysis
    Xue, Xiaojun
    Zhang, Chunxia
    Niu, Zhendong
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5105 - 5118
  • [45] Sentiment Analysis of Japanese Tourism Online Reviews
    Chuanming Yu
    Xingyu Zhu
    Bolin Feng
    Lin Cai
    Lu An
    Journal of Data and Information Science, 2019, (01) : 89 - 113
  • [46] Sentiment Analysis in Tourism: Capitalizing on Big Data
    Alaei, Ali Reza
    Becken, Susanne
    Stantic, Bela
    JOURNAL OF TRAVEL RESEARCH, 2019, 58 (02) : 175 - 191
  • [47] A case study of Spanish text transformations for twitter sentiment analysis
    Tellez, Eric S.
    Miranda-Jimenez, Sabino
    Graff, Mario
    Moctezuma, Daniela
    Siordia, Oscar S.
    Villasenor, Elio A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 81 : 457 - 471
  • [48] LIWC-Based Sentiment Analysis in Spanish Product Reviews
    Lopez-Lopez, Estanislao
    del Pilar Salas-Zarate, Maria
    Almela, Angela
    Angel Rodriguez-Garcia, Miguel
    Valencia-Garcia, Rafael
    Alor-Hernandez, Giner
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 11TH INTERNATIONAL CONFERENCE, 2014, 290 : 379 - 386
  • [49] Automated Sentiment Analysis in Tourism: Comparison of Approaches
    Kirilenko, Andrei P.
    Stepchenkova, Svetlana O.
    Kim, Hany
    Li, Xiang
    JOURNAL OF TRAVEL RESEARCH, 2018, 57 (08) : 1012 - 1025
  • [50] Sentiment Analysis of Japanese Tourism Online Reviews
    Yu, Chuanming
    Zhu, Xingyu
    Feng, Bolin
    Cai, Lin
    An, Lu
    JOURNAL OF DATA AND INFORMATION SCIENCE, 2019, 4 (01) : 89 - 113