Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers

被引:10
作者
Cam, Handan [1 ]
Cam, Alper Veli [2 ]
Demirel, Ugur [3 ]
Ahmed, Sana [4 ]
机构
[1] Gumushane Univ, Fac Econ & Adm Sci, Dept Management Informat Syst, TR-29000 Gumushane, Turkiye
[2] Gumushane Univ, Fac Hlth Sci, Dept Hlth Care Management, TR-29000 Gumushane, Turkiye
[3] Gumushane Univ, Irfan Can Kose Vocat Sch, TR-29000 Gumushane, Turkiye
[4] Univ Reading, Henley Business Sch, Reading RG6 6AH, England
关键词
Sentiment analysis; Natural language processing; Machine learning; Stock market; Twitter; CLASSIFICATION; OPINIONS;
D O I
10.1016/j.heliyon.2023.e23784
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a sentiment analysis combining the lexicon-based and machine learning (ML)based approaches in Turkish to investigate the public mood for the prediction of stock market behavior in BIST30, Borsa Istanbul. Our main motivation behind this study is to apply sentiment analysis to financial-related tweets in Turkish. We import 17189 tweets posted as "#Borsaistanbul, #Bist, #Bist30, #Bist100 '' on Twitter between November 7, 2022, and November 15, 2022, via a MAXQDA 2020, a qualitative data analysis program. For the lexicon-based side, we use a multilingual sentiment offered by the Orange program to label the polarities of the 17189 samples as positive, negative, and neutral labels. Neutral labels are discarded for the machine learning experiments. For the machine learning side, we select 9076 data as positive and negative to implement the classification problem with six different supervised machine learning classifiers conducted in Python 3.6 with the sklearn library. In experiments, 80 % of the selected data is used for the training phase and the rest is used for the testing and validation phase. Results of the experiments show that the Support Vector Machine and Multilayer Perceptron classifier perform better than other classifiers with 0.89 and 0.88 accuracy and AUC values of 0.8729 and 0.8647 respectively. Other classifiers obtain approximately a 78,5 % accuracy rate. It is possible to increase sentiment analysis accuracy with parameter optimization on a larger, cleaner, and more balanced dataset by changing the pre-processing steps. This work can be expanded in the future to develop better sentiment analysis using deep learning approaches.
引用
收藏
页数:15
相关论文
共 78 条
[1]   Turkish Sentiment Analysis Using BERT [J].
Acikalin, Utku Umur ;
Bardak, Benan ;
Kutlu, Mucahid .
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[2]  
Akba Firat, 2014, Proceedings of the European Conference on Data Mining 2014 and International Conferences on Intelligent Systems and Agents 2014 and Theory and Practice in Modern Computing 2014, P180
[3]   Sentiment analysis with Twitter [J].
Akgul, Eyup Sercan ;
Ertano, Caner ;
Diri, Banu .
PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2016, 22 (02) :106-110
[4]   Sentiment Analysis Of English Tweets: A Comparative Study of Supervised and Unsupervised Approaches [J].
Al-Hadhrami, Suheer ;
Al-Fassam, Norah ;
Benhidour, Hafida .
2019 2ND INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS & INFORMATION SECURITY (ICCAIS), 2019,
[5]   A Survey on the Roles of Communication Technologies in IoT-Based Personalized Healthcare Applications [J].
Alam, Muhammad Mahtab ;
Malik, Hassan ;
Khan, Muhidul Islam ;
Pardy, Tamas ;
Kuusik, Alar ;
Le Moullec, Yannick .
IEEE ACCESS, 2018, 6 :36611-36631
[6]  
[Anonymous], Precision-Recall
[7]  
Appel O, 2016, IEEE C EVOL COMPUTAT, P4950, DOI 10.1109/CEC.2016.7744425
[8]   Predicting consumer sentiments from online text [J].
Bai, Xue .
DECISION SUPPORT SYSTEMS, 2011, 50 (04) :732-742
[9]  
Baloglu A., 2010, International Journal of Advances in Internet Technology, V3, P234
[10]   Modeling monthly reference evapotranspiration process in Turkey: application of machine learning methods [J].
Bayram, Savas ;
Citakoglu, Hatice .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2023, 195 (01)