Named Entity Recognition Based Neural Network Framework for Stock Trend Prediction Using Latent Dirichlet Allocation

被引：1

作者：

Prusty, Manas Ranjan ^{[1
]}

Sinha, Apoorv Kumar ^{[2
]}

Singh, Sanskriti Sanjay Kumar ^{[2
]}

Sai, Shreyas ^{[2
]}

Poornachary, Vijayakumar Kedalu ^{[2
]}

Patra, Subhra Rani ^{[3
]}

机构：

[1] Vellore Inst Technol, Ctr Cyber Phys Syst, Chennai, Tamil Nadu, India

[2] Vellore Inst Technol, Sch Comp Sci & Engn, Chennai, Tamil Nadu, India

[3] Univ Texas Arlington, Informat Syst & Operat Management, Arlington, TX 76019 USA

来源：

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2025年

关键词：

Stock market prediction; Natural language processing; Named entity recognition; Latent Dirichlet allocation; Recurrent neural network; Pruning;

D O I：

10.1007/s13369-025-10090-4

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Stock price prediction is an extensively researched topic as the precise prophecy of stock trends is decisive in the investment marketing sphere. With increasing opinions by many market giants on the internet about given stocks, it surges the necessity to study these sentiments in detail for forthcoming predictions. From these articles on the internet, natural text is generated by examining factors that affect the values of stocks and therefore these texts are reliable features to go ahead with this study. The idea behind tackling such work is that conglomerates and businesses are able to tangibly understand the aftermath of articles that usually mobilize public opinion and gear them in a certain direction. The aim of this study is to utilize named entity recognition (NER) on a neural network framework for stock trend prediction through latent Dirichlet allocation using these natural texts generated from internet articles. This method is used to understand the words that occur at the highest frequency and add the most information to the corpus depending on the topic's importance. With this, the model adopts K x K words that have the most decisive impact on the target class that has been created with which it alters the sparse density matrix that has been generated. The proposed model of the NER-based neural network was fitted on a real-world dataset, and its performance was good in comparison with state-of-the-art models developed by fellow researchers. However, since the model does not use the BERT tokenizers, it cannot be adjudged on the FinBERT model, and therefore, the preprocessed data is fed to a pruned recurrent neural network which is robustly stopped with a simple callback function. The final result was a strong 0.81 tetrachoric correlation between the testing target class and the predicted target class. With this, the model provides a different approach to natural language processing, especially those with high sparse density for stock prediction.

引用

页数：14

共 33 条

[1] Financial sentiment analysis model utilizing knowledge-base and domain-specific representation [J].

Agarwal, Basant .

MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) :8899-8920

[2]

Baccianella S, 2010, LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION

[3]

Beysolow T., 2018, Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing

[4]

Boyd A., 2022, explosion/spaCy: v2.3.9: Compatibility with NumPy v1.24+

[5]

Cambria E, 2018, AAAI CONF ARTIF INTE, P1795

[6]

Chen K.-J., 2005, P ONTOLEX 2005 ONT L

[7] Sentiment analysis on stock social media for stock price movement prediction [J].

Derakhshan, Ali ;

Beigy, Hamid .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 :569-578

[8] Systematic analysis and review of stock market prediction techniques [J].

Gandhmal, Dattatray P. ;

Kumar, K. .

COMPUTER SCIENCE REVIEW, 2019, 34

[9]

Guida Tony, 2019, BIG DATA MACHINE LEA

[10]

Gupta I, 2022, Arxiv, DOI [arXiv:2203.08143, 10.48550/arXiv.2203.08143, DOI 10.48550/ARXIV.2203.08143]

← 1 2 3 4 →