An efficient hybrid filter and evolutionary wrapper approach for sentiment analysis of various topics on Twitter

被引:52
作者
Hassonah, Mohammad A. [1 ]
Al-Sayyed, Rizik [1 ]
Rodan, Ali [1 ,2 ]
Al-Zoubi, Ala' M. [3 ]
Aljarah, Ibrahim [1 ]
Faris, Hossam [1 ]
机构
[1] Univ Jordan, King Abdullah II Sch Informat Technol, Amman, Jordan
[2] Higher Coll Technol, Abu Dhabi, U Arab Emirates
[3] Univ Granada, Sch Informat & Telecommun Engn, Granada, Spain
关键词
Sentiment analysis; Support Vector Machine; SVM; Feature selection; Multi-Verse Optimizer; ReliefF; Social network; SUPPORT VECTOR MACHINE; FEATURE-SELECTION; CLASSIFICATION; OPTIMIZATION; ALGORITHM; SYSTEM;
D O I
10.1016/j.knosys.2019.105353
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment Analysis is currently considered as one of the most attractive research topics in Natural Language Processing (NLP) field. The main objective of sentiment analysis is to identify the opinions and emotions of the users through written contents. While there are different studies that have approached this field using various techniques, it is still considered a challenging topic with many difficulties that are yet to be solved, such as having modern accents, slang words, spelling and grammatical mistakes, and other issues that cannot be overcome with traditional methods and sentiment lexicons. In this work, we propose a hybrid machine learning approach to enhance sentiment analysis; as we build a classification model based on three classes, which are positive, neutral, and negative emotions, using Support Vector Machines (SVM) classifier, while combining two feature selection techniques using the ReliefF and Multi-Verse Optimizer (MVO) algorithms. We also extract more than 6900 tweets from Twitter social network to test our work. Our hybrid method is compared against other classifiers and methods in terms of accuracy. Results show that our proposed method outperforms other techniques and classifiers, by obtaining better results in most of the datasets while reducing the number of features by up to 96.85% from the original feature set. We also categorize the extracted features into Objective, Subjective and Emoticon words to analyze them during the first and the final feature selection processes and find any existing relations. Very similar results are obtained by both feature selection techniques; due to a number of factors that are explained in this paper. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:19
相关论文
共 84 条
[1]   Effective Sentimental Analysis and Opinion Mining of Web Reviews Using Rule Based Classifiers [J].
Ahmed, Shoiab ;
Danti, Ajit .
COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 1, CIDM 2015, 2016, 410 :171-179
[2]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[3]  
Al-Mosimi T., 2017, J INF SCI
[4]  
Al-Zoubi A., 2019, J INF SCI
[5]  
Al-Zoubi AM, 2020, ALGO INTELL SY, P11, DOI 10.1007/978-981-32-9990-0_2
[6]   Identifying beta-thalassemia carriers using a data mining approach: The case of the Gaza Strip, Palestine [J].
AlAgha, Alaa S. ;
Faris, Hossam ;
Hammo, Bassam H. ;
Al-Zoubi, Ala M. .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 88 :70-83
[7]  
Alrefai M., 2018, International Journal of Advanced Science and Technology, P13, DOI DOI 10.14257/IJAST.2018.119.02
[8]  
Alsaffar A, 2014, I C INF TECH MULTIM, P270, DOI 10.1109/ICIMU.2014.7066643
[9]  
[Anonymous], 2018, TWITTER SENTIMENT AN
[10]  
[Anonymous], 2012, Mining text data, DOI DOI 10.1007/978-1-4614-3223-4_13