Detection of temporality at discourse level on financial news by combining Natural Language Processing and Machine Learning

被引:10
作者
Garcia-Mendez, Silvia [1 ]
de Arriba-Perez, Francisco [1 ]
Barros-Vila, Ana [1 ]
Gonzalez-Castano, Francisco J. [1 ]
机构
[1] Univ Vigo, Informat Technol Grp, atlanTTic, Vigo 36310, Spain
关键词
Computational Linguistics; Financial news; Knowledge extraction; Machine Learning; Natural Language Processing; Temporal analysis; STOCK; PREPOSITIONS;
D O I
10.1016/j.eswa.2022.116648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finance-related news such as Bloomberg News, CNN Business and Forbes are valuable sources of real data for market screening systems. In news, an expert shares opinions beyond plain technical analyses that include context such as political, sociological and cultural factors. In the same text, the expert often discusses the performance of different assets. Some key statements are mere descriptions of past events while others are predictions. Therefore, understanding the temporality of the key statements in a text is essential to separate context information from valuable predictions. We propose a novel system to detect the temporality of financerelated news at discourse level that combines Natural Language Processing and Machine Learning techniques, and exploits sophisticated features such as syntactic and semantic dependencies. More specifically, we seek to extract the dominant tenses of the main statements, which may be either explicit or implicit. We have tested our system on a labelled dataset of finance-related news annotated by researchers with knowledge in the field. Experimental results reveal a high detection precision compared to an alternative rule-based baseline approach. Ultimately, this research contributes to the state-of-the-art of market screening by identifying predictive knowledge for financial decision making.
引用
收藏
页数:9
相关论文
empty
未找到相关数据