Semi-supervised learning with generative model for sentiment classification of stock messages

被引:17
作者
Duan, Jiangjiao [1 ]
Luo, Banghui [2 ]
Zeng, Jianping [2 ,3 ]
机构
[1] Univ Shanghai Sci & Technol, Business Sch, Shanghai 200093, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
[3] Minist Educ, Engn Res Ctr Cyber Secur Auditing & Monitoring, Shanghai 200433, Peoples R China
关键词
Sentiment analysis; Generative model; Semi-supervised learning; Stock message board; MICROBLOGGING DATA;
D O I
10.1016/j.eswa.2020.113540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification of investors' sentiments in stock message boards has attracted a great deal of attention. Since the messages are usually short, we propose a semi-supervised learning method to make full use of the features in both train and test messages. The generative emotion model takes message, emotion and words into consideration simultaneously. Based on the facts that words are of different ability in discriminating sentiments, they are categorized into three classes in the model with different emotion strength. Training the generative model can transform the messages into emotion vectors which finally feeds to a sentiment classifier. The experiment results show that the proposed model and learning method are efficient for modeling sentiment in short text, and by properly selecting the amount of train data and the percent of test samples, we can achieve higher classification accuracy than traditional ones. The results indicate that the generative model is effective for short message sentiment classification, and provides a significant approach for the implementation of semi-supervised learning which is a typical expert and intelligent information processing method. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 28 条
[1]  
[Anonymous], 2011, P ANN ACM SIGIR INT
[2]   Can Twitter Help Predict Firm-Level Earnings and Stock Returns? [J].
Bartov, Eli ;
Faurel, Lucile ;
Mohanram, Partha S. .
ACCOUNTING REVIEW, 2018, 93 (03) :25-57
[3]  
Bianchini M., 2016, C IT ASS ART INT
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]   PAN-CULTURAL ELEMENTS IN FACIAL DISPLAYS OF EMOTION [J].
EKMAN, P ;
SORENSON, ER ;
FRIESEN, WV .
SCIENCE, 1969, 164 (3875) :86-&
[6]  
Hasan AA, 2018, INT CONF ELECTRO INF, P23, DOI 10.1109/EIT.2018.8500292
[7]  
Iwata T, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P1427
[8]  
Li F., 2010, P AAAI
[9]  
Li Q., 2017, P 21 C COMPUTATIONAL
[10]  
Lin C, 2009, Proceedings of the 18th ACM conference on information and knowledge management, DOI [10.1145/1645953.1646003, DOI 10.1145/1645953.1646003]