Lexicon-based approach outperforms Supervised Machine Learning approach for Urdu Sentiment Analysis in multiple domains

被引:66
作者
Mukhtar, Neelam [1 ]
Khan, Mohammad Abid [1 ]
Chiragh, Nadia [2 ]
机构
[1] Univ Peshawar, Dept Comp Sci, Peshawar, Kpk, Pakistan
[2] Univ Agr, Peshawar, Pakistan
关键词
Supervised Machine Learning approach; Lexicon-based approach; Urdu Sentiment Lexicon; Urdu Sentiment Analyzer; ONTOLOGY;
D O I
10.1016/j.tele.2018.08.003
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Web is facilitating people to express their views and opinions on different topics through reviews and blogs. Effective advantages can be reaped from these reviews and blogs by fusing the sentiment knowledge. In this research, Sentiment Analysis of Urdu blogs from multiple domains is done by using the two widely used approaches i.e. the Lexicon-based approach and the Supervised Machine Learning approach. Three well known classifiers i.e. Support Vector Machine, Decision Tree and K Nearest Neighbor are used in case of Supervised Machine Learning approach whereas a wide coverage Urdu Sentiment Lexicon and an efficient Urdu Sentiment Analyzer are used in Lexicon-based approach. In both the approaches the information are fused from two sources to successfully perform Sentiment Analysis. In case of Lexicon-based approach, the two sources are the wide coverage Urdu Sentiment Lexicon and the efficient Urdu Sentiment Analyzer. In case of Supervised Machine Learning approach, the two sources are the un-annotated data and annotated data along with important attributes. After performing Sentiment Analysis using both the approaches, the results are observed carefully and on the basis of experiments performed in this research, it is concluded that the Lexicon-based approach outperforms Supervised Machine Learning approach not only in terms of Accuracy, Precision, Recall and F-measure but also in terms of economy of time and efforts used.
引用
收藏
页码:2173 / 2183
页数:11
相关论文
共 29 条
[1]   Fuzzy ontology-based sentiment analysis of transportation and city feature reviews for safe traveling [J].
Ali, Farman ;
Kwak, Daehan ;
Khan, Pervez ;
Islam, S. M. Riazul ;
Kim, Kye Hyun ;
Kwak, K. S. .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 77 :33-48
[2]   Opinion mining based on fuzzy domain ontology and Support Vector Machine: A proposal to automate online review classification [J].
Ali, Farman ;
Kwak, Kyung-Sup ;
Kim, Yong-Gi .
APPLIED SOFT COMPUTING, 2016, 47 :235-250
[3]   Type-2 fuzzy ontology-based opinion mining and information extraction: A proposal to automate the hotel reservation system [J].
Ali, Farman ;
Kim, Eun Kyoung ;
Kim, Yong-Gi .
APPLIED INTELLIGENCE, 2015, 42 (03) :481-500
[4]  
[Anonymous], 2014, P 8 INT WORKSH SEM E
[5]  
[Anonymous], 2011, HPL201189
[6]  
[Anonymous], 2013, P 1 ACM C ONL SOC NE
[7]  
[Anonymous], THESIS
[8]   Successes and challenges in developing a hybrid approach to sentiment analysis [J].
Appel, Orestes ;
Chiclana, Francisco ;
Carter, Jenny ;
Fujita, Hamido .
APPLIED INTELLIGENCE, 2018, 48 (05) :1176-1188
[9]   Comprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis [J].
Augustyniak, Lukasz ;
Szymanski, Piotr ;
Kajdanowicz, Tomasz ;
Tuliglowicz, Wlodzimierz .
ENTROPY, 2016, 18 (01)
[10]  
D'Andrea A., 2015, International Journal of Computer Applications, V125, P26, DOI [10.5120/ijca2015905866, DOI 10.5120/IJCA2015905866]