Exploiting Stacked Autoencoders for Improved Sentiment Analysis

被引:13
作者
Ahmed, Kanwal [1 ]
Nadeem, Muhammad Imran [1 ]
Li, Dun [1 ]
Zheng, Zhiyun [1 ]
Ghadi, Yazeed Yasin [2 ]
Assam, Muhammad [3 ]
Mohamed, Heba G. [4 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[2] Al Ain Univ, Dept Comp Sci & Software Engn, Al Ain, U Arab Emirates
[3] Univ Sci & Technol Bannu, Dept Software Engn, Bannu 28100, Pakistan
[4] Princess Nourah bint Abdulrahman Univ, Coll Engn, Dept Elect Engn, POB 84428, Riyadh 11671, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 23期
关键词
data mining; natural language processing; text mining; text analysis; web mining; GENETIC ALGORITHM; NETWORKS;
D O I
10.3390/app122312380
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sentiment analysis is an ongoing research field within the discipline of data mining. The majority of academics employ deep learning models for sentiment analysis due to their ability to self-learn and process vast amounts of data. However, the performance of deep learning models depends on the values of the hyperparameters. Determining suitable values for hyperparameters is a cumbersome task. The goal of this study is to increase the accuracy of stacked autoencoders for sentiment analysis using a heuristic optimization approach. In this study, we propose a hybrid model GA(SAE)-SVM using a genetic algorithm (GA), stacked autoencoder (SAE), and support vector machine (SVM) for fine-grained sentiment analysis. Features are extracted using continuous bag-of-words (CBOW), and then input into the SAE. In the proposed GA(SAE)-SVM, the hyperparameters of the SAE algorithm are optimized using GA. The features extracted by SAE are input into the SVM for final classification. A comparison is performed with a random search and grid search for parameter optimization. GA optimization is faster than grid search, and selects more optimal values than random search, resulting in improved accuracy. We evaluate the performance of the proposed model on eight benchmark datasets. The proposed model outperformed when compared to the baseline and state-of-the-art techniques.
引用
收藏
页数:24
相关论文
共 94 条
[1]   Selecting Attributes for Sentiment Classification Using Feature Relation Networks [J].
Abbasi, Ahmed ;
France, Stephen ;
Zhang, Zhu ;
Chen, Hsinchun .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (03) :447-462
[2]   Energy choices in Alaska: Mining people's perception and attitudes from geotagged tweets [J].
Abdar, Moloud ;
Basiri, Mohammad Ehsan ;
Yin, Junjun ;
Habibnezhad, Mahmoud ;
Chi, Guangqing ;
Nemati, Shahla ;
Asadi, Somayeh .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2020, 124 (124)
[3]   Sentiment analysis through recurrent variants latterly on convolutional neural network of Twitter [J].
Abid, Fazeel ;
Alam, Muhammad ;
Yasir, Muhammad ;
Li, Chen .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 :292-308
[4]   Sentiment classification within online social media using whale optimization algorithm and social impact theory based optimization [J].
Akyol, Sinem ;
Alatas, Bilal .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 540
[5]   A Hybrid Semantic Knowledgebase-Machine Learning Approach for Opinion Mining [J].
Alfrjani, Rowida ;
Osman, Taha ;
Cosma, Georgina .
DATA & KNOWLEDGE ENGINEERING, 2019, 121 :88-108
[6]  
[Anonymous], MULT SENT DAT
[7]  
[Anonymous], 2013, International Journal of Emerging Sciences, DOI DOI 10.14355/ijes.2013.0305.05
[8]  
[Anonymous], 2011, P 28 INT C MACH LEAR
[9]  
[Anonymous], Large Movie Review Dataset
[10]  
[Anonymous], CORN CIS COMP SCI