Performance Evaluation of Word and Sentence Embeddings for Finance Headlines Sentiment Analysis

被引:3
作者
Mishev, Kostadin [1 ]
Gjorgjevikj, Ana [1 ]
Stojanov, Riste [1 ]
Mishkovski, Igor [1 ]
Vodenska, Irena [2 ]
Chitkushev, Ljubomir [2 ]
Trajanov, Dimitar [1 ]
机构
[1] Ss Cyril & Methodius Univ Skopje, FCSE, Skopje, North Macedonia
[2] Boston Univ, Boston, MA 02215 USA
来源
ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING | 2019年 / 1110卷
关键词
Sentiment analysis; Finance; Deep learning; Word embedding; Sentence embedding;
D O I
10.1007/978-3-030-33110-8_14
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, tremendous number of financial online articles are published every day. Numerous natural language processing (NLP) algorithms and methodologies have arose, not only for correct, but also for fast financial sentiment extraction. Currently, word and sentence encoders are popular topic in NLP field, due to their ability to represent them as dense vectors in a continuous real numbers space, referred to as embeddings. These low dimensional embedding vectors are appropriate for deep neural networks (DNN) inputs, and their invention boosted the performance of multiple of NLP tasks. In this paper, we evaluate different word and sentence embeddings in combination with standard machine learning and deep-learning classifiers for financial texts sentiment extraction. Our evaluation shows the BiGRU+Attention architecture with word embedding as features, give the best score in overall evaluation.
引用
收藏
页码:161 / 172
页数:12
相关论文
共 21 条
[1]   Is all that talk just noise? The information content of Internet stock message boards [J].
Antweiler, W ;
Frank, MZ .
JOURNAL OF FINANCE, 2004, 59 (03) :1259-1294
[2]  
Ba J., 2014, Multiple object recognition with visual attention
[3]   Investor sentiment in the stock market [J].
Baker, Malcolm ;
Wurgler, Jeffrey .
JOURNAL OF ECONOMIC PERSPECTIVES, 2007, 21 (02) :129-151
[4]  
Bojanowski Piotr, 2017, Trans. Assoc. Comput. Linguist., V5, P135, DOI DOI 10.1162/TACL_A_00051
[5]   Sentiment Analysis Is a Big Suitcase [J].
Cambria, Erik ;
Poria, Soujanya ;
Gelbukh, Alexander ;
Thelwall, Mike .
IEEE INTELLIGENT SYSTEMS, 2017, 32 (06) :74-80
[6]  
Cer D, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P169
[7]  
Chung J, 2014, ARXIV
[8]  
Cortis A., 2017, P 11 INT WORKSHOP SE, P519
[9]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[10]  
Hochreiter S, 1997, ADV NEUR IN, V9, P473