High Performance Twitter Sentiment Analysis Using CUDA Based Distance Kernel on GPUs

被引:6
作者
Bozkurt, Ferhat [1 ]
Coban, Onder [2 ]
Gunay, Faruk Baturalp [1 ]
Yucel Altay, Seyma [1 ]
机构
[1] Ataturk Univ, Fac Engn, Dept Comp Engn, TR-25240 Erzurum, Turkey
[2] Adiyaman Univ, Fac Engn, Dept Comp Engn, TR-02040 Adiyaman, Turkey
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2019年 / 26卷 / 05期
关键词
CUDA; k-NN; LDA; parallel computing; sentiment analysis; twitter;
D O I
10.17559/TV-20180123005000
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Sentiment analysis techniques are widely used for extracting feelings of users in different domains such as social media content, surveys, and user reviews. This is mostly performed by using classical text classification techniques. One of the major challenges in this field is having a large and sparse feature space that stems from sparse representation of texts. The high dimensionality of the feature space creates a serious problem in terms of time and performance for sentiment analysis. This is particularly important when selected classifier requires intense calculations as in k-NN. To cope with this problem, we used sentiment analysis techniques for Turkish Twitter feeds using the NVIDIA's CUDA technology. We employed our CUDA-based distance kernel implementation for k-NN which is a widely used lazy classifier in this field. We conducted our experiments on four machines with different computing capacities in terms of GPU and CPU configuration to analyze the impact on speed-up.
引用
收藏
页码:1218 / 1227
页数:10
相关论文
共 57 条
[1]  
Agarwal Apoorv., 2011, P WORKSHOP LANGUAGES
[2]  
Alpaydin E, 2014, ADAPT COMPUT MACH LE, P1
[3]  
[Anonymous], SCALING MACHINE LEAR
[4]  
[Anonymous], 2014, MUHENDISLIK BILIMLER
[5]  
[Anonymous], PROGRAMMING MASSIVEL
[6]  
[Anonymous], 2010, 2010 IEEE 2 S WEB SO, DOI DOI 10.1109/SWS.2010.5607480
[7]  
[Anonymous], 2011, P AM C INF SYST AMCI
[8]  
[Anonymous], BALKAN J ELECT COMPU
[9]  
[Anonymous], DATA MINING CONCEPTS
[10]  
[Anonymous], ARCHITECTURE