Social Media Sentiment Analysis Using K-Means and Naive Bayes Algorithm

被引:0
作者
Zul, Muhammad Ihsan [1 ]
Yulia, Feoni [2 ]
Nurmalasari, Dini [3 ]
机构
[1] Politekn Caltex Riau, Informat Engn, Pekanbaru, Indonesia
[2] Politekn Caltex Riau, Informat Syst, Pekanbaru, Indonesia
[3] Politekn Caltex Riau, Comp Engn, Pekanbaru, Indonesia
来源
2018 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICON EEI): TOWARD THE MOST EFFICIENT WAY OF MAKING AND DEALING WITH FUTURE ELECTRICAL POWER SYSTEM AND BIG DATA ANALYSIS | 2018年
关键词
k-means; naive bayes; sentiwordnet; sentiment analysis; text mining; k-fold cross validation; confusion matrix;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Opinions are a major influence when making decisions for individuals or organizations. A collection of opinions can be extracted to gain useful knowledge. This knowledge is used as a source of information which can be used as a consideration in decision making. The extraction of knowledge from text has been known as text mining. Text mining has any kinds of algorithm to extract information from collected text, such as K-Means, K-Nearest Neighbors, Naive Bayes, and the others. One of the sources of opinion is from social media, especially Facebook and Twitter. On Facebook and Twitter, many people have been writing their opinions about many things. This very much data are difficult to analyze thoroughly. In this paper, K-Means and Naive Bayes algorithm are developed to analyze public opinions or sentiments. Outlier removal is also added to this analysis. Opinions are taken from Facebook and Twitter. The accuracy of the system is tested 10 times at k different points for each k value (k=6, 7, 8, 9 and 10). As the result, the combination of K-Means and Naive Bayes has lower accuracy than the accuracy produced by Naive Bayes without the combination of K-Means, but almost same accuracies. The accuracy of Naive Bayes algorithm is from 80.526%-82.500%, while the combination of Naive Bayes and K-Means has 80.323%-81.523% accuracy.
引用
收藏
页码:24 / 29
页数:6
相关论文
共 9 条
  • [1] [Anonymous], INT J COMPUTER SCI A
  • [2] Buana P.W., 2012, INT J COMPUTER APPL, V50, P37
  • [3] Dey Lopamudra, 2016, International Journal of Information Engineering and Electronic Business, V8, P54, DOI 10.5815/ijieeb.2016.04.07
  • [4] Gurusamy V., 2017, INT J ADV RES COMPUT, V7
  • [5] Jotheeswaran J., 2013, J THEORETICAL APPL I, V58
  • [6] Liu B, 2012, SENTIMENT ANAL OPINI, DOI DOI 10.2200/S00416ED1V01Y201204HLT016
  • [7] Narayanan V, 2013, LECT NOTES COMPUT SC, V8206, P194, DOI 10.1007/978-3-642-41278-3_24
  • [8] IMPLEMENTATION OF K-MEANS CLUSTERING METHOD FOR ELECTRONIC LEARNING MODEL
    Sari, Herlina Latipa
    Suranti, Dewi
    Zulita, Leni Natalia
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICONICT), 2017, 930
  • [9] An Efficient Global K-means Clustering Algorithm
    Xie, Juanying
    Jiang, Shuai
    Xie, Weixin
    Gao, Xinbo
    [J]. JOURNAL OF COMPUTERS, 2011, 6 (02) : 271 - 279