Proposing sentiment analysis model based on BERT and XLNet for movie reviews

被引:5
作者
Danyal, Mian Muhammad [1 ]
Khan, Sarwar Shah [2 ,3 ]
Khan, Muzammil [2 ]
Ullah, Subhan [1 ]
Mehmood, Faheem [4 ]
Ali, Ijaz [3 ]
机构
[1] City Univ Sci & Informat Technol, Dept Comp Sci, Peshawar 25000, Pakistan
[2] Univ Swat, Dept Comp & Software Technol, Swat 19130, Pakistan
[3] Iqra Natl Univ, Dept Comp Sci, Swat 19200, Pakistan
[4] Air Univ Islamabad, Dept Comp Sci, Islamabad 44320, Pakistan
关键词
XLNet; BERT; Sentiment analysis; IMDB dataset; Rotten tomatoes dataset; Machine learning;
D O I
10.1007/s11042-024-18156-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Movie reviews are a valuable source of information for potential viewers. However, reading all of the reviews can be time-consuming and overwhelming. Summarizing all of the reviews will help you make the correct choice without wasting time reading all of the reviews. Sentiment analysis, or opinion mining, can extract subjective information from movie reviews, such as the reviewer's overall opinion of the movie, its strengths and weaknesses, and the reviewer's recommendations. This information can help potential viewers make informed decisions about whether or not to watch a movie. XLNet and Bidirectional Encoder Representations from Transformers (BERT) are pre-trained advanced language models that learn bidirectional relationships between words, improving performance on many natural language processing tasks. BERT uses a masked language modeling objective, while XLNet uses a permutation language modeling objective. This experiment is based on the proposed method for XLNet and BERT, two advanced techniques and popular baseline techniques using the Internet Movie Database (IMDB) Dataset of 50K reviews and the Rotten Tomatoes dataset. We pre-processed both datasets using data cleaning, the removal of duplicate reviews, lemmatization, and handling of chat words to improve baseline technique results. The results indicate that XLNet achieved the highest accuracy on both datasets. As a result of the research experiment, sentiment analysis provides insights into how emotions and attitudes are expressed in movie reviews that can be used to predict a movie's performance based on their overall sentiment.
引用
收藏
页码:64315 / 64339
页数:25
相关论文
共 33 条
[1]  
Abimanyu AJ., 2023, Build Inf Technol Sci (BITS), V5, P162
[2]  
Abinash N, 2023, NeuroQuantology, V21, P1084
[3]  
Agrawal Tanay., 2021, Hyperparameter Optimization in Machine Learning
[4]   A review of feature selection techniques in sentiment analysis [J].
Ahmad, Siti Rohaidah ;
Abu Bakar, Azuraliza ;
Yaakub, Mohd Ridzwan .
INTELLIGENT DATA ANALYSIS, 2019, 23 (01) :159-189
[5]  
Amolik A., 2015, Int J Eng Technol, V7, P2038
[6]  
Andersson V, 2017, Machine Learning in Logistics: Machine Learning Algorithms: Data Preprocessing and Machine Learning Algorithms
[7]  
[Anonymous], 2020, Rotten Tomatoes Movie Reviews dataset
[8]  
Aziz MM, 2023, Building of Informatics, Technology and Science, V4, P1714, DOI [10.47065/bits.v4i4.2644, DOI 10.47065/BITS.V4I4.2644]
[9]  
Banik N, 2018, 2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP)
[10]   A comprehensive survey on sentiment analysis: Approaches, challenges and trends [J].
Birjali, Marouane ;
Kasri, Mohammed ;
Beni-Hssane, Abderrahim .
KNOWLEDGE-BASED SYSTEMS, 2021, 226