Extraction of emotions from multilingual text using intelligent text processing and computational linguistics

被引:79
作者
Jain, Vinay Kumar [1 ]
Kumar, Shishir [1 ]
Fernandes, Steven Lawrence [2 ]
机构
[1] Jaypee Univ Engn & Guna MP, Dept Comp Sci & Engn, Guna, MP, India
[2] Sahyadri Coll Engn & Management, Dept Elect & Commun Engn, Mangalore, Karnataka, India
关键词
Emotion extraction; Machine learning; Text mining; Twitter; Classification; Natural language processing; TWITTER; ELECTION; US;
D O I
10.1016/j.jocs.2017.01.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Extraction of Emotions from Multilingual Text posted on social media by different categories of users is one of the crucial tasks in the field of opining mining and sentiment analysis. Every major event in the world has an online presence and social media. Users use social media platforms to express their sentiments and opinions towards it. In this paper, an advanced framework for detection of emotions of users in Multilanguage text data using emotion theories has been presented, which deals with linguistics and psychology. The emotion extraction system is developed based on multiple features groups for the better understanding of emotion lexicons. Empirical studies of three real-time events in domains like a Political election, healthcare, and sports are performed using proposed framework. The technique used for dynamic keywords collection is based on RSS (Rich Site Summary) feeds of headlines of news articles and trending hashtags from Twitter. An intelligent data collection model has been developed using dynamic keywords. Every word of emotion contained in a tweet is important in decision making and hence to retain the importance of multilingual emotional words, effective pre-processing technique has been used. Naive Bayes algorithm and Support Vector Machine (SVM) are used for fine-grained emotions classification of tweets. Experiments conducted on collected data sets, show that the proposed method performs better in comparison to corpus-driven approach which assign affective orientation or scores to words. The proposed emotion extraction framework performs better on the collected dataset by combining feature sets consisting of words from publicly available lexical resources. Furthermore, the presented work for extraction of emotion from tweets performs better in comparisons of other popular sentiment analysis techniques which are dependent of specific existing affect lexicons. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:316 / 326
页数:11
相关论文
共 89 条
[1]  
Alm C. O., 2005, P HUM LANG TECHN C, P579
[2]  
Aman S., 2007, THESIS
[3]  
Aman S., 2008, Proceedings of the Third International Joint Conference on Natural Language Processing, P296
[4]  
Aman S, 2007, LECT NOTES ARTIF INT, V4629, P196
[5]  
Anjaria M., 2014, P 6 INT C COMM SYST, P1
[6]  
[Anonymous], 2011, Proceedings of the conference on empirical methods in natural language processing
[7]  
[Anonymous], 2014, KDD WORKSH LARG SCAL
[8]  
[Anonymous], 2013, SMA BCS SGAI
[9]  
[Anonymous], 2005, Proceedings of the ACL student research workshop
[10]  
[Anonymous], P 19 TRIENN C INT ER