Classifying streaming of Twitter data based on sentiment analysis using hybridization

被引:50
作者
Nagarajan, Senthil Murugan [1 ]
Gandhi, Usha Devi [1 ]
机构
[1] VIT Univ, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
基金
美国国家卫生研究院;
关键词
Sentiment analysis; Preprocessing; Machine learning; Particle swarm optimization (PSO); Genetic algorithm (GA); Decision tree (DT); CLASSIFICATION;
D O I
10.1007/s00521-018-3476-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter is a social media that developed rapidly in today's modern world. As millions of Twitter messages are sent day by day, the value and importance of developing a new technique for detecting spammers become significant. Moreover, legitimate users are affected by means of spams in the form of unwanted URLs, irrelevant messages, etc. Another hot topic of research is sentiment analysis that is based on each tweet sent by the user and opinion mining of the customer reviews. Most commonly natural language processing is used for sentiment analysis. The text is collected from user's tweets by opinion mining and automatic sentiment analysis that are oriented with ternary classifications, such as positive, neutral, and negative. Due to limited size, unstructured nature, misspells, slangs, and abbreviations, it is more challenging for researchers to find sentiments for Twitter data. In this paper, we collected 600 million public tweets using URL-based security tool and feature generation is applied for sentiment analysis. The ternary classification is processed based on preprocessing technique, and the results of tweets sent by the users are obtained. We use a hybridization technique using two optimization algorithms and one machine learning classifier, namely particle swarm optimization and genetic algorithm and decision tree for classification accuracy by sentiment analysis. The results are compared with previous works, and our proposed method shows a better analysis than that of other classifiers.
引用
收藏
页码:1425 / 1433
页数:9
相关论文
共 40 条
[11]  
Devi GU., 2015, INDIAN J SCI TECHNOL, V8, P15, DOI DOI 10.17485/IJST/2015/V8I26/80996
[12]   Sentiment Analysis of Twitter Data [J].
El Rahman, Sahar A. ;
AlOtaibi, Feddah Alhumaidi ;
AlShehri, Wejdan Abdullah .
2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, :336-339
[13]  
Go Alec., 2009, CS224N project report 1.12
[14]  
Gokulnath C, 2015, 2015 INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), P202, DOI 10.1109/ICSTM.2015.7225414
[15]  
Hu M., 2004, P 10 ACM SIGKDD INT, P168
[16]  
Kaewpitakkun Y., 2014, P 28 PAC AS C LANG I
[17]   A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet [J].
Khan, Farhan Hassan ;
Qamar, Usman ;
Bashir, Saba .
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (03) :851-872
[18]   RETRACTED: Intelligent face recognition and navigation system using neural learning for smart security in Internet of Things (Retracted Article) [J].
Kumar, Priyan Malarvizhi ;
Gandhi, Ushadevi ;
Varatharajan, R. ;
Manogaran, Gunasekaran ;
Jidhesh, R. ;
Vadivel, Thanjai .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4) :S7733-S7744
[19]  
Liu K., 2012, Emoticon Smoothed Language Models For Twitter Sentiment Analysis
[20]  
Lu TJ, 2015, INT CONF BIG DATA, P194, DOI 10.1109/35021BIGCOMP.2015.7072831