Framework for Improved Sentiment Analysis via Random Minority Oversampling for User Tweet Review Classification

被引:17
作者
Almuayqil, Saleh Naif [1 ]
Humayun, Mamoona [1 ]
Jhanjhi, N. Z. [2 ]
Almufareh, Maram Fahaad [1 ]
Javed, Danish [2 ]
机构
[1] Jouf Univ, Coll Comp & Informat Sci, Dept Informat Syst, Sakakah 72311, Saudi Arabia
[2] Taylors Univ, Sch Comp Sci SCS, Subang Jaya 47500, Malaysia
关键词
sentiment analysis (SA); sentiment classification; resampling; random minority oversampling; random majority under sampling; deep learning (DL); machine learning (ML); term frequency inverse document frequency (TF-IDF);
D O I
10.3390/electronics11193058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social networks such as twitter have emerged as social platforms that can impart a massive knowledge base for people to share their unique ideas and perspectives on various topics and issues with friends and families. Sentiment analysis based on machine learning has been successful in discovering the opinion of the people using redundantly available data. However, recent studies have pointed out that imbalanced data can have a negative impact on the results. In this paper, we propose a framework for improved sentiment analysis through various ordered preprocessing steps with the combination of resampling of minority classes to produce greater performance. The performance of the technique can vary depending on the dataset as its initial focus is on feature selection and feature combination. Multiple machine learning algorithms are utilized for the classification of tweets into positive, negative, or neutral. Results have revealed that random minority oversampling can provide improved performance and it can tackle the issue of class imbalance.
引用
收藏
页数:17
相关论文
共 39 条
[21]  
IEEE Thailand Section and Institute of Electrical and Electronics Engineers, 2018, ICEAST 2018 P 4 INT
[22]   A systematic literature review on machine learning applications for consumer sentiment analysis using online reviews [J].
Jain, Praphula Kumar ;
Pamula, Rajendra ;
Srivastava, Gautam .
COMPUTER SCIENCE REVIEW, 2021, 41 (41)
[23]   KSCB: a novel unsupervised method for text sentiment analysis [J].
Jiang, Weili ;
Zhou, Kangneng ;
Xiong, Chenchen ;
Du, Guodong ;
Ou, Chubin ;
Zhang, Junpeng .
APPLIED INTELLIGENCE, 2023, 53 (01) :301-311
[24]   A hybrid model integrating deep learning with investor sentiment analysis for stock price prediction [J].
Jing, Nan ;
Wu, Zhao ;
Wang, Hefei .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178
[25]   Multi-Scale Network for Thoracic Organs Segmentation [J].
Khalil, Muhammad Ibrahim ;
Tehsin, Samabia ;
Humayun, Mamoona ;
Jhanjhi, N. Z. ;
AlZain, Mohammed A. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02) :3251-3265
[26]   SVM and k-Means Hybrid Method for Textual Data Sentiment Analysis [J].
Korovkinas, Konstantinas ;
Danenas, Paulius ;
Garsva, Gintautas .
BALTIC JOURNAL OF MODERN COMPUTING, 2019, 7 (01) :47-60
[27]   Multi-class sentiment classification: The experimental comparisons of feature selection and machine learning algorithms [J].
Liu, Yang ;
Bi, Jian-Wu ;
Fan, Zhi-Ping .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 80 :323-339
[28]  
Mukherjee A., 2019, P 2019 IEEE 10 INT C
[29]   Sentiment Analysis of Customers' Reviews Using a Hybrid Evolutionary SVM-Based Approach in an Imbalanced Data Distribution [J].
Obiedat, Ruba ;
Qaddoura, Raneem ;
Al-Zoubi, Ala' M. ;
Al-Qaisi, Laila ;
Harfoushi, Osama ;
Alrefai, Mo'ath ;
Faris, Hossam .
IEEE ACCESS, 2022, 10 :22260-22273
[30]  
Rahmanda R., 2022, J RESTI, V6, P599, DOI [10.29207/resti.v6i4.4186, DOI 10.29207/RESTI.V6I4.4186]