Classifying streaming of Twitter data based on sentiment analysis using hybridization

被引:50
作者
Nagarajan, Senthil Murugan [1 ]
Gandhi, Usha Devi [1 ]
机构
[1] VIT Univ, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
基金
美国国家卫生研究院;
关键词
Sentiment analysis; Preprocessing; Machine learning; Particle swarm optimization (PSO); Genetic algorithm (GA); Decision tree (DT); CLASSIFICATION;
D O I
10.1007/s00521-018-3476-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is a social media that developed rapidly in today's modern world. As millions of Twitter messages are sent day by day, the value and importance of developing a new technique for detecting spammers become significant. Moreover, legitimate users are affected by means of spams in the form of unwanted URLs, irrelevant messages, etc. Another hot topic of research is sentiment analysis that is based on each tweet sent by the user and opinion mining of the customer reviews. Most commonly natural language processing is used for sentiment analysis. The text is collected from user's tweets by opinion mining and automatic sentiment analysis that are oriented with ternary classifications, such as positive, neutral, and negative. Due to limited size, unstructured nature, misspells, slangs, and abbreviations, it is more challenging for researchers to find sentiments for Twitter data. In this paper, we collected 600 million public tweets using URL-based security tool and feature generation is applied for sentiment analysis. The ternary classification is processed based on preprocessing technique, and the results of tweets sent by the users are obtained. We use a hybridization technique using two optimization algorithms and one machine learning classifier, namely particle swarm optimization and genetic algorithm and decision tree for classification accuracy by sentiment analysis. The results are compared with previous works, and our proposed method shows a better analysis than that of other classifiers.
引用
收藏
页码:1425 / 1433
页数:9
相关论文
共 40 条
[1]   Concept-Level Sentiment Analysis with Dependency-Based Semantic Parsing: A Novel Approach [J].
Agarwal, Basant ;
Poria, Soujanya ;
Mittal, Namita ;
Gelbukh, Alexander ;
Hussain, Amir .
COGNITIVE COMPUTATION, 2015, 7 (04) :487-499
[2]  
[Anonymous], FUTUR GENER COMPUT S
[3]  
[Anonymous], P 2004 C EMP METH NA
[4]  
Balan EV, 2015, 2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), P185, DOI 10.1109/ICCSP.2015.7322846
[5]   Fuzzy Based Intrusion Detection Systems in MANET [J].
Balan, Vishnu E. ;
Priyan, M. K. ;
Gokulnath, C. ;
Devi, Usha G. .
BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 :109-114
[6]   Sentiment analysis: Measuring opinions [J].
Bhadane, Chetashri ;
Dalal, Hardi ;
Doshi, Heenal .
INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING TECHNOLOGIES AND APPLICATIONS (ICACTA), 2015, 45 :808-814
[7]  
Chen LS, 2009, P INT MULT ENG COMP, V1, P18
[8]   Combining Classification and Clustering for Tweet Sentiment Analysis [J].
Coletta, Luiz F. S. ;
da Silva, Nadia F. F. ;
Hruschka, Eduardo R. ;
Hruschka, Estevam R., Jr. .
2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, :210-215
[9]   Tweet sentiment analysis with classifier ensembles [J].
da Silva, Nadia F. F. ;
Hruschka, Eduardo R. ;
Hruschka, Estevam R., Jr. .
DECISION SUPPORT SYSTEMS, 2014, 66 :170-179
[10]  
Devi GU., 2015, IND J SCI TECHNOL, V8, P4