Sentiment Analysis in Turkish Based on Weighted Word Embeddings

被引:12
作者
Onan, Aytug [1 ]
机构
[1] Izmir Katip Celebi Univ, Bilgisayar Muhendisligi Bolumu, Izmir, Turkey
来源
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2020年
关键词
sentiment analysis; word embeddings; vector pooling;
D O I
10.1109/siu49456.2020.9302182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the era of big data, natural language processing becomes an important research discipline, owing to the immense quantity of text documents and the progresses in machine learning. Natural language processing has been succesfully employed in many different areas, including machine translation, search engines, virtual assistants, spam filtering, question answering and sentiment analysis. Recent studies in the field of natural language processing indicate that word embedding based representation, in which words have been represented in dense spaces through fixed length vectors, can yield promising results. In this study, we evaluate the predictive performance of 36 word embedding based representation obtained by three word embedding methods (i.e., word2vec, fastText and DOC2vec), two basic weighting functions (i.e., inverse document frequency and smooth inverse document frequency) and three vector pooling schemes (namely, weighted sum, center based approach and delta rule). Experimental analysis indicates that word2vec based representation in conjunction with inverse document frequency based weighting and center based pooling, yields promising results for sentiment analysis in Turkish.
引用
收藏
页数:4
相关论文
共 18 条
[1]  
Aci C.I., 2018, INT J INFORM TECHOLO, V12, P219
[2]  
Adali E, TURKIYE BILISIM VAKF, V5, P1
[3]  
Akin A. A., 2007, Structure, V10, P1
[4]  
Arora Sanjeev, 2017, ICLR
[5]  
Ayata D, 2017, SIG PROCESS COMMUN
[6]  
Bojanowski Piotr, 2016, Transactions of the Association for Computational Linguistics
[7]   An evaluation of document clustering and topic modelling in two online social networks: Twitter and Reddit [J].
Curiskis, Stephan A. ;
Drake, Barry ;
Osborn, Thomas R. ;
Kennedy, Paul J. .
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
[8]   Representation learning for very short texts using weighted word embedding aggregation [J].
De Boom, Cedric ;
Van Canneyt, Steven ;
Demeester, Thomas ;
Dhoedt, Bart .
PATTERN RECOGNITION LETTERS, 2016, 80 :150-156
[9]  
Djaballah KA, 2019, 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), P223, DOI [10.1109/snams.2019.8931827, 10.1109/SNAMS.2019.8931827]
[10]  
Gungor O, 2018, P 26 SIGN PROC COMM, P1