Sentiment Analysis using Optimised Feature Sets in Different Facebook/Twitter Dataset Domains with Big Data

被引:0
作者
Al-Mashhadani M.I. [1 ]
Hussein K.M. [2 ]
Khudir E.T. [3 ]
Ilyas M. [4 ]
机构
[1] Department of Electrical and Electronics Engineering, Altinbas University, Istanbul
[2] Diyala University, Electronic Computer Center
[3] Electrical and Electronics Engineering, Altinbas University
来源
Iraqi Journal for Computer Science and Mathematics | 2022年 / 3卷 / 01期
关键词
Big data; Classification; Data Mining; Features; Online Social Networks; Opinion Mining; Sentiment Analysis;
D O I
10.52866/ijcsm.2022.01.01.007
中图分类号
学科分类号
摘要
Nowadays, in many real-life applications, sentiment analysis plays a vital role in the automatic prediction of human activities, especially on online social networks (OSNs). Since the last decade, the research on opinion mining and sentiment analysis has been growing with the increase in volume of online reviews available over the social media networks, such as Facebook OSNs. Sentiment analysis falls under the data mining domain research problem. Sentiment analysis is a type of text mining process to determine subjective attitude, such as sentiments from written texts, and hence has become a main research interest in domains of natural language processing and data mining. The main task of sentiment analysis is to classify human sentiments with the objective of classifying the sentiment or emotion of end users based on their specific text on the OSNs. Several research approaches have been designed for sentiment analysis, in which the factors of accuracy, efficiency and speed have been used to evaluate the effectiveness of sentiment analysis methods. The Map-Reduce framework under the domain of big data is used to minimise the speed of execution and efficiency recently with many data mining methods. The sentiment analysis of messages on Facebook OSNs is more challenging compared with those of other sites because of the misspellings and slang words in their Twitter dataset. In this study, the different solutions that have been recently presented are discussed in detail. Then, a new approach for sentiment analysis based on hybrid feature extraction and multi-class support vector machine methods is proposed. The algorithms have been designed using big data techniques in view of optimising the performance of sentiment analysis. © 2022 Iraqi Journal for Computer Science and Mathematics. All rights reserved.
引用
收藏
页码:64 / 70
页数:6
相关论文
共 14 条
[1]  
Turney P. D., Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews, Proceedings of the 40th annual meeting on association for computational linguistics, (2002)
[2]  
Kamps J., Marx M., Mokken R. J., De Rijke M., Using wordnet to measure semantic orientations of adjectives, (2004)
[3]  
Pucci D., Baroni M., Cutugno F., Lenci A., Unsupervised lexical substitution with a word space model, Proceedings of EVALITA workshop, 11th Congress of Italian Association for Artificial Intelligence, (2009)
[4]  
Balahur A., Hermida J. M., Montoyo A., Building and exploiting emotinet, a knowledge base for emotion detection based on the appraisal theory model, Affective Computing, IEEE Transactions on, 3, 1, (2012)
[5]  
Barbosa L., Feng J., Robust sentiment detection on twitter from biased and noisy data, Proceedings of the 23rd International Conference on Computational Linguistics: Posters, (2010)
[6]  
Celikyilmaz A., Hakkani-Tur D., Feng J., Probabilistic model-based sentiment analysis of twitter messages, Spoken Language Technology Workshop (SLT), 2010 IEEE, (2010)
[7]  
Wu Y., Ren F., Learning sentimental influence in twitter, Future Computer Sciences and Application (ICFCSA), 2011 International Conference on, (2011)
[8]  
Pak A., Paroubek P., Twitter as a corpus for sentiment analysis and opinion mining, Proceedings of LREC, 2010, (2010)
[9]  
Xia R., Zong C., Li S., Ensemble of feature sets and classification algorithms for sentiment classification, Information Sciences: and International Journal, 181, 6, (2011)
[10]  
Neethu M.S., Rajasree R., Sentiment Analysis in Twitter using Machine Learning Techniques, 4th ICCCNT 2013, (2013)