Cross lingual transfer learning for sentiment analysis of Italian TripAdvisor reviews

被引:8
|
作者
Catelli, Rosario [1 ]
Bevilacqua, Luca [5 ]
Mariniello, Nicola [5 ]
di Carlo, Vladimiro Scotto [5 ]
Magaldi, Massimo [5 ]
Fujita, Hamido [2 ,3 ,4 ]
De Pietro, Giuseppe [1 ]
Esposito, Massimo [1 ]
机构
[1] Natl Res Council CNR, Inst High Performance Comp & Networking ICAR, Naples, Italy
[2] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Natl Taipei Univ Technol, Taipei, Taiwan
[4] I Somet Inc Assoc, Morioka, Iwate, Japan
[5] Engn Ingn Informat SpA, Naples, Italy
关键词
Transfer learning; Sentiment analysis; Italian dataset; BERT; TripAdvisor; Reviews;
D O I
10.1016/j.eswa.2022.118246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the years, the attention of the scientific world towards the techniques of sentiment analysis has increased considerably, driven by industry. The arrival of the Google BERT language model has confirmed the superiority of models based on a particular structure of artificial neural network called Transformer, from which many variants have resulted. These models are generally pre-trained on large text corpora and only later specialized according to the precise task to be faced on much smaller amounts of data. For these reasons, countless versions were developed to meet the specific needs of each language, especially in the case of languages with relatively few datasets available. At the same time, models that were pre-trained for multiple languages became widespread, providing greater flexibility of use in exchange for lower performance. This study shows how the use of techniques to transfer learning from languages with high resources to languages with low resources provides an important performance increase: a multilingual BERT model fine tuned on a mixed English/Italian dataset (using for the English a literature dataset and for the Italian a reviews dataset created ad-hoc from the well-known platform TripAdvisor), provides much higher performance than models specific to Italian. Overall, the results obtained by comparing the different possible approaches indicate which one is the most promising to pursue in order to obtain the best results in low resource scenarios.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Comparative Sentiment Analysis on a Set of Movie Reviews Using Deep Learning Approach
    Chakraborty, Koyel
    Bhattacharyya, Siddhartha
    Bag, Rajib
    Hassanien, Aboul Ella
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 311 - 318
  • [42] Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings
    Alamoudi, Eman Saeed
    Alghamdi, Norah Saleh
    JOURNAL OF DECISION SYSTEMS, 2021, 30 (2-3) : 259 - 281
  • [43] Aspect-Based Sentiment Analysis of Drug Reviews Applying Cross-Domain and Cross-Data Learning
    Grasser, Felix
    Kallumadi, Surya
    Malberg, Hagen
    Zaunseder, Sebastian
    DH '18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, 2018, : 121 - 125
  • [44] Deep Learning Model for Sentiment Analysis in Multi-lingual Corpus
    Medrouk, Lisa
    Pappa, Anna
    NEURAL INFORMATION PROCESSING, ICONIP 2017, PT I, 2017, 10634 : 205 - 212
  • [45] Cross-Lingual Transfer Learning for Statistical Type Inference
    Li, Zhiming
    Xie, Xiaofei
    Li, Haoliang
    Xu, Zhengzi
    Li, Yi
    Liu, Yang
    PROCEEDINGS OF THE 31ST ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2022, 2022, : 239 - 250
  • [46] A review on multi-lingual sentiment analysis by machine learning methods
    Sagnika S.
    Pattanaik A.
    Mishra B.S.P.
    Meher S.K.
    Journal of Engineering Science and Technology Review, 2020, 13 (02) : 154 - 166
  • [47] Sentiment Analysis and Classification of Restaurant Reviews using Machine Learning
    Zahoor, Kanwal
    Bawany, Narmeen Zakaria
    Hamid, Soomaiya
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [48] CROSS-LINGUAL TRANSFER LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING
    Quynh Ngoc Thi Do
    Gaspers, Judith
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5956 - 5960
  • [49] Sentiment Analysis on Reviews: Understanding eWOM Using Deep Learning
    Che, Pak Hou
    Chen, Caleb Huanyong
    PROCEEDINGS OF 2020 CHINA MARKETING INTERNATIONAL CONFERENCE (WEB CONFERENCING): MARKETING AND MANAGEMENT IN THE DIGITAL AGE, 2020, : 732 - 740
  • [50] Comparing deep learning architectures for sentiment analysis on drug reviews
    Colon-Ruiz, Cristobal
    Segura-Bedmar, Isabel
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 110