A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification

被引:274
作者
Onan, Aytug [1 ,2 ]
Korukoglu, Serdar [2 ]
Bulut, Hasan [2 ]
机构
[1] Celal Bayar Univ, Dept Comp Engn, TR-45140 Muradiye, Manisa, Turkey
[2] Ege Univ, Dept Comp Engn, TR-35100 Izmir, Turkey
关键词
Sentiment analysis; Ensemble learning; Weighted majority voting; Multiobjective optimization; GENETIC ALGORITHM; OPTIMIZATION; SELECTION; RECOGNITION; PREDICTION; FRAMEWORK; MACHINE;
D O I
10.1016/j.eswa.2016.06.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typically performed by supervised machine learning algorithms, sentiment analysis is highly useful for extracting subjective information from text documents online. Most approaches that use ensemble learning paradigms toward sentiment analysis involve feature engineering in order to enhance the predictive performance. In response, we sought to develop a paradigm of a multiobjective, optimization-based weighted voting scheme to assign appropriate weight values to classifiers and each output class based on the predictive performance of classification algorithms, all to enhance the predictive performance of sentiment classification. The proposed ensemble method is based on static classifier selection involving majority voting error and forward search, as well as a multiobjective differential evolution algorithm. Based on the static classifier selection scheme, our proposed ensemble method incorporates Bayesian logistic regression, naive Bayes, linear discriminant analysis, logistic regression, and support vector machines as base learners, whose performance in terms of precision and recall values determines weight adjustment. Our experimental analysis of classification tasks, including sentiment analysis, software defect prediction, credit risk modeling, spam filtering, and semantic mapping, suggests that the proposed classification scheme can predict better than conventional ensemble learning methods such as AdaBoost, bagging, random subspace, and majority voting. Of all datasets examined, the laptop dataset showed the best classification accuracy (98.86%). (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 98 条
[1]   A novel SVM-kNN-PSO ensemble method for intrusion detection system [J].
Aburomman, Abdulla Amin ;
Reaz, Mamun Bin Ibne .
APPLIED SOFT COMPUTING, 2016, 38 :360-372
[2]   Genetic algorithms and Darwinian approaches in financial applications: A survey [J].
Aguilar-Rivera, Ruben ;
Valenzuela-Rendon, Manuel ;
Rodriguez-Ortiz, J. J. .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) :7684-7697
[3]   An integrated fuzzy AHP and fuzzy MOORA approach to the problem of industrial engineering sector choosing [J].
Akkaya, Gokay ;
Turanoglu, Betul ;
Oztas, Sinan .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (24) :9565-9573
[4]  
[Anonymous], 2008, WEBOLOGY
[5]  
[Anonymous], 2015, SOCIAL NETWORK ANAL, DOI [DOI 10.1109/wcsp.2015.7340981, DOI 10.1117/1.JPE.5.057612]
[6]  
[Anonymous], 2006, International Journal of Hybrid Intelligent Systems, DOI [10.3233/HIS-2006-3104, DOI 10.3233/HIS-2006-3104]
[7]   Comprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis [J].
Augustyniak, Lukasz ;
Szymanski, Piotr ;
Kajdanowicz, Tomasz ;
Tuliglowicz, Wlodzimierz .
ENTROPY, 2016, 18 (01)
[8]  
Bache K., 2013, UCI Machine Learning Repository
[9]   Heterogeneous classifiers fusion for dynamic breast cancer diagnosis using weighted vote based ensemble [J].
Bashir, Saba ;
Qamar, Usman ;
Khan, Farhan Hassan .
QUALITY & QUANTITY, 2015, 49 (05) :2061-2076
[10]   BagMOOV: A novel ensemble for heart disease prediction bootstrap aggregation with multi-objective optimized voting [J].
Bashir, Saba ;
Qamar, Usman ;
Khan, Farhan Hassan .
AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2015, 38 (02) :305-323