A Text Mining Application of Emotion Classifications of Twitter's Users Using Naive Bayes Method

被引:0
作者
Wikarsa, Liza [1 ]
Thahir, Sherly Novianti [1 ]
机构
[1] Univ De La Salle Manado, Informat Engn, Manado, Indonesia
来源
PROCEEDING OF 2015 1ST INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT) | 2015年
关键词
text mining; Twitter; emotion; classification; naive bayes;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Twitter is one of social media with more than 500 million users and 400 million tweets per day. In any written tweet of twitter users it contains various emotions. Most research on the use of social media classifies sentiments into three categories that are positive, negative, and neutral. However, none of these studies has developed an application that can detect user emotions in the social media, particularly on Twitter. Hence, this research developed a text mining application to detect emotions of Twitter users that are classified into six emotions, namely happiness, sadness, anger, disgust, fear, and surprise. Three main phases of the text mining utilized in this application were preprocessing, processing, and validation. Activities conducted in the preprocessing phase were case folding, cleansing, stop-word removal, emoticons conversion, negation conversion, and tokenization to the training data and the test data based on the sentiment analysis that performed morphological analysis to build several models. In the processing phase, it performed weighting and classification using the Naive Bayes algorithm on the validated model. The process for measuring the level of accuracy generated by the application using 10-fold cross validation was done in the validation phase. The findings showed that this application is able to achieve 83% accuracy for 105 tweets. In order to get a higher accuracy, one requires a better model in training data.
引用
收藏
页数:6
相关论文
共 12 条
  • [1] Adedoyin-Olowe M., 2013, A survey of data mining techniques for social media analysis
  • [2] [Anonymous], 2014, DRAMATIC LANGUAGES
  • [3] [Anonymous], 2003, EMOTIONS REVEALED RE
  • [4] [Anonymous], 2010, Text Mining: Applications and Theory
  • [5] Farber D., 2012, Twitter hits 400 million tweets per day, mostly mobile
  • [6] Gundecha P., 2012, INFORMS
  • [7] Han J, 2012, MOR KAUF D, P1
  • [8] Irfan R., 2004, KNOWL ENG REV, P1
  • [9] Juju D., 2010, BROADING PROMOTION S
  • [10] Patil T., 2013, J IEEE COMPUTER SOC, V4, P31661