Constructing automatic domain-specific sentiment lexicon using KNN search via terms discrimination vectors

被引:3
作者
Alqasemi F. [1 ]
Abdelwahab A. [1 ]
Abdelkader H. [1 ]
机构
[1] Information Systems Department, Menoufia University, Menoufia
关键词
KNN; lexicon-based SA; natural language processing; Sentiment analysis; sentiment lexicon; sentiment seeds;
D O I
10.1080/1206212X.2017.1409477
中图分类号
学科分类号
摘要
Web textual data content is a viable source for decision-makers’ knowledge, so are text analytic applications. Sentiment analysis (SA) is one of text mining fields, in which text is analyzed to recognize text writer implied opinion. In this paper, a new approach had been presented for automatic Arabic language sentiment lexicon constructing. Popular KNN search algorithm is utilized for this objective. Cosine distance between seeds terms and corpus terms is employed in KNN search query. Generated lexicon terms are launched from sentiment seeds and seeds terms are augmented via Arabic-specific NLP-based algorithm, which is helped to enhance seeds terms selection process. Term discrimination vector (TDV) is the main part of KNN query inputs TDV components are computed for each corpus term and it is constituted by four term weight techniques. According to the experimental results, TDV accomplished better results than TF-IDF traditional method with lower computation cost. Also, constructed lexicons outperformed premade lexicons accuracy results. © 2017, © 2017 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:127 / 137
页数:10
相关论文
共 31 条
[1]  
Liu B., Sentiment analysis and opinion mining, Synth Lect Hum Lang Technol, 16, pp. 1-167, (2012)
[2]  
Alqasemi F., Abdelwahab A., Abelkader H., Adapting domain-specific sentiment lexicon using new NLP-based method in Arabic language, Int J Comput Syst, 3, pp. 188-193, (2016)
[3]  
Taboada M., Brooke J., Tofiloski M., Et al., Lexicon-based methods for sentiment analysis, Comput Linguist, 37, pp. 267-307, (2011)
[4]  
Bai A., Hammer H., Yazidi A., Et al., Constructing sentiment lexicons in norwegian from a large text corpus, IEEE 17th 2014 International Conference on Computational Science and Engineering (CSE), (2014)
[5]  
Mahyoub F., Siddiqui M., Dahab M., Building an Arabic sentiment lexicon using semi-supervised learning, J King Saud Univ-Comput Inf Sci, 26, pp. 417-424, (2014)
[6]  
Habash N., Introduction to Arabic natural language processing, Synth Lect Hum Lang Technol, 10, pp. 1-167, (2010)
[7]  
Abdelwahab A., Alqasemi F., abelkader H., Enhancing the performance of sentiment analysis supervised learning using sentiments keywords based technique, Seventh International Conference on Computer Science and Information Technology (CCSIT), (2017)
[8]  
Abdulla N., Majdalawi R., Mohammed S., Et al., Automatic lexicon construction for Arabic sentiment analysis, IEEE International Conference on Future Internet of Things and Cloud (FiCloud), (2014)
[9]  
Elarnaoty M., AbdelRahman S., Fahmy A., A machine learning approach for opinion holder extraction in Arabic language
[10]  
El-Halees A., Arabic opinion mining using combined classification approach, Proceedings of the International Arab Conference on Information Technology, ACIT, (2011)