FinSoSent: Advancing Financial Market Sentiment Analysis through Pretrained Large Language Models

被引:5
作者
Delgadillo, Josiel [1 ]
Kinyua, Johnson [2 ]
Mutigwe, Charles [3 ]
机构
[1] Univ Penn, Sch Engn & Appl Sci, Philadelphia, PA 19104 USA
[2] Penn State Univ, Coll Informat Sci & Technol, Philadelphia, PA 19104 USA
[3] Western New England Univ, Coll Business, Springfield, MA 01119 USA
关键词
BERT; financial markets; Twitter/X; StockTwits; sentiment analysis; LLM; social media;
D O I
10.3390/bdcc8080087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the directions of financial markets has been performed using a variety of approaches, and the large volume of unstructured data generated by traders and other stakeholders on social media microblog platforms provides unique opportunities for analyzing financial markets using additional perspectives. Pretrained large language models (LLMs) have demonstrated very good performance on a variety of sentiment analysis tasks in different domains. However, it is known that sentiment analysis is a very domain-dependent NLP task that requires knowledge of the domain ontology, and this is particularly the case with the financial domain, which uses its own unique vocabulary. Recent developments in NLP and deep learning including LLMs have made it possible to generate actionable financial sentiments using multiple sources including financial news, company fundamentals, technical indicators, as well social media microblogs posted on platforms such as StockTwits and X (formerly Twitter). We developed a financial social media sentiment analyzer (FinSoSent), which is a domain-specific large language model for the financial domain that was pretrained on financial news articles and fine-tuned and tested using several financial social media corpora. We conducted a large number of experiments using different learning rates, epochs, and batch sizes to yield the best performing model. Our model outperforms current state-of-the-art FSA models based on over 860 experiments, demonstrating the efficacy and effectiveness of FinSoSent. We also conducted experiments using ensemble models comprising FinSoSent and the other current state-of-the-art FSA models used in this research, and a slight performance improvement was obtained based on majority voting. Based on the results obtained across all models in these experiments, the significance of this study is that it highlights the fact that, despite the recent advances of LLMs, sentiment analysis even in domain-specific contexts remains a difficult research problem.
引用
收藏
页数:21
相关论文
共 71 条
[1]  
Agaian S., 2017, Int. J. Invest. Manag. Financ. Innov, V3, P1
[2]  
Amazon Web Services, Amazon comprehend-determine sentiment
[3]  
Amazon Web Services, Amazon Comprehend: Features
[4]  
[Anonymous], 1 WORKSH FIN TECHN N
[5]  
[Anonymous], 2023, Financial terms dictionary
[6]  
Araci D.T., 2020, FinBERT: Financial Sentiment Analysis with BERT Prosus AI Tech Blog
[7]  
Araci D, 2019, Arxiv, DOI arXiv:1908.10063
[8]  
Balaji P, 2017, PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND COMPUTATIONAL INTELLIGENCE (ICBDAC), P436, DOI 10.1109/ICBDACI.2017.8070879
[9]  
Bartunov O., 2007, Technical Report
[10]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615