A polarity calculation approach for lexicon-based Turkish sentiment analysis

被引:11
作者
Yurtalan, Gokhan [1 ]
Koyuncu, Murat [2 ]
Turhan, Cigdem [3 ]
机构
[1] HAVELSAN Inc, Ankara, Turkey
[2] Atilim Univ, Fac Engn, Informat Syst Engn, Ankara, Turkey
[3] Atilim Univ, Fac Engn, Software Engn, Ankara, Turkey
关键词
Sentiment analysis; lexicon-based; Turkish language; opinion mining;
D O I
10.3906/elk-1803-92
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis attempts to resolve the senses or emotions that a writer or speaker intends to send across to the people about an object or event. It generally uses natural language processing and/or artificial intelligence techniques for processing electronic documents and mining the opinion specified in the content. In recent years, researchers have conducted many successful sentiment analysis studies for the English language which consider many words and word groups that set emotion polarities arising from the English grammar structure, and then use datasets to test their performance. However, there are only a limited number of studies for the Turkish language, and these studies have lower performance results compared to those studies for English. The reasons for this can be incorrect translation of datasets from English into Turkish and ignoring the special grammar structures in the latter. In this study, special Turkish words and linguistic constructs which affect the polarity of a sentence are determined with the aid of a Turkish linguist, and an appropriate lexicon-based polarity determination and calculation approach is introduced for this language. The proposed methodology is tested using different datasets collected from Twitter, and the test results show that the proposed system achieves better accuracy than the previously developed lexical-based sentiment analysis systems for Turkish. The authors conclude that especially analysis of word groups increases the overall performance of the system significantly.
引用
收藏
页码:1325 / 1339
页数:15
相关论文
共 33 条
[1]  
Akba Firat, 2014, Proceedings of the European Conference on Data Mining 2014 and International Conferences on Intelligent Systems and Agents 2014 and Theory and Practice in Modern Computing 2014, P180
[2]   Sentiment analysis with Twitter [J].
Akgul, Eyup Sercan ;
Ertano, Caner ;
Diri, Banu .
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2016, 22 (02) :106-110
[3]  
Akin AA., 2007, Structure, V10, P1
[4]   Predicting consumer sentiments from online text [J].
Bai, Xue .
DECISION SUPPORT SYSTEMS, 2011, 50 (04) :732-742
[5]  
Balahur A, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P4265
[6]  
Çetin M, 2013, SIG PROCESS COMMUN
[7]  
Çoban Ö, 2015, SIG PROCESS COMMUN, P2388, DOI 10.1109/SIU.2015.7130362
[8]   Sentiment analysis in Turkish at different granularity levels [J].
Dehkharghani, Rahim ;
Yanikoglu, Berrin ;
Saygin, Yucel ;
Oflazer, Kemal .
NATURAL LANGUAGE ENGINEERING, 2017, 23 (04) :535-559
[9]   SentiTurkNet: a Turkish polarity lexicon for sentiment analysis [J].
Dehkharghani, Rahim ;
Saygin, Yucel ;
Yanikoglu, Berrin ;
Oflazer, Kemal .
LANGUAGE RESOURCES AND EVALUATION, 2016, 50 (03) :667-685
[10]   Unsupervised method for sentiment analysis in online texts [J].
Fernandez-Gavilanes, Milagros ;
Alvarez-Lopez, Tamara ;
Juncal-Martinez, Jonathan ;
Costa-Montenegro, Enrique ;
Gonzalez-Castano, Francisco Javier .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 58 :57-75