A reliable sentiment analysis for classification of tweets in social networks

被引：0

作者：

Masoud AminiMotlagh

HadiShahriar Shahhoseini

Nina Fatehi

机构：

[1] Iran University of Science and Technology,School of Electrical Engineering

[2] Wayne State University,Department of Electrical and Computer Engineering

来源：

Social Network Analysis and Mining | / 13卷

关键词：

Social networks analysis; Sentiment analysis; Data mining; Text mining;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In modern society, the use of social networks is more than ever and they have become the most popular medium for daily communications. Twitter is a social network where users are able to share their daily emotions and opinions with tweets. Sentiment analysis is a method to identify these emotions and determine whether a text is positive, negative, or neutral. In this article, we apply four widely used data mining classifiers, namely K-nearest neighbor, decision tree, support vector machine, and naive Bayes, to analyze the sentiment of the tweets. The analysis is performed on two datasets: first, a dataset with two classes (positive and negative) and then a three-class dataset (positive, negative and neutral). Furthermore, we utilize two ensemble methods to decrease variance and bias of the learning algorithms and subsequently increase the reliability. Also, we have divided the dataset into two parts: training set and testing set with different percentages of data to show the best train–test split ratio. Our results show that support vector machine demonstrates better outcomes compared to other algorithms, showing an improvement of 3.53% on dataset with two-class data and 7.41% on dataset with three-class data in accuracy rate compared to other algorithms. The experiments show that the accuracy of single classifiers slightly outperforms that of ensemble methods; however, they propose more reliable learning models. Results also demonstrate that using 50% of the dataset as training data has almost the same results as 70%, while using tenfold cross-validation can reach better results.

引用

共 78 条

[1]

Al-Laith A(2021)Arasencorpus: a semi-supervised approach for sentiment annotation of a large arabic text corpus Appl Sci 11 2434-946

[2]

Shahbaz M(2018)An ensemble classification system for Twitter sentiment analysis Procedia Comput Sci 132 937-294

[3]

Alaskar HF(2021)ABCDM: an attention-based bidirectional CNN-RNN deep model for sentiment analysis Futur Gener Comput Syst 115 279-1829

[4]

Rehmat A(2021)Making sense of tweets using sentiment analysis on closely related topics Soc Netw Anal Min 11 44-705

[5]

Ankit N(2020)A comprehensive analysis of adverb types for mining user sentiments on amazon product reviews World Wide Web 23 1811-104

[6]

Saleena ME(2020)Analyzing the sentiment correlation between regular tweets and retweets Soc Netw Anal Min 10 13-310

[7]

Basiri S(2020)Tweets can tell: activity recognition using hybrid gated recurrent neural networks Soc Netw Anal Min 10 16-undefined

[8]

Nemati M(2021)Unsupervised Sentiment Analysis by Transferring Multi-source Knowledge Cogn Comput 118 108475-undefined

[9]

Abdar E(2022)An automata algorithm for generating trusted graphs in online social networks Appl Soft Comput 10 61-undefined

[10]

Cambria AU(2020)Feature selection methods for event detection in Twitter: a text mining approach Soc Netw Anal Min 10 82-undefined

← 1 2 3 4 5 6 7 8 →