Combining User-based and Global Lexicon Features for Sentiment Analysis in Twitter

被引:0
作者
Jin, Zhou [1 ]
Yang, Yujiu [1 ]
Bao, Xianyu [2 ]
Huang, Biqing [3 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Key Lab Broadband Network & Multimedia, Shenzhen 518055, Peoples R China
[2] Shenzhen Acad inspect & Quarantine, Shenzhen, Guangdong, Peoples R China
[3] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
来源
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年
关键词
sentiment analysis; feature construction; user-based features; global features; rule-based fusing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generally speaking, sentiment lexicons employed in the majority of current sentiment analysis systems are trained globally from public data stream source or other large independent corpus. However, sentiments are rather subjective and personal states of mind that the individuality and diversity of characteristics, particular writing habit and idiolect could play a crucial role in the judgment of sentiment expressed by a specific user. In this paper, we present a novel feature construction method to combine user-based and global lexicon features in sentiment analysis for short social media text. After the creation of user-based sentiment lexicons from user-timeline corpus, a rule-based fusing approach is adopted subsequently to generate user-based lexicon features in combination with general lexicon features. Experiments show that user-based features may capture potential user preferences hence adjusting the bias caused by representing an individual's sentiment with an averaged lexicon score, and our proposed method yield better results in comparison with some of the state-of-the-art sentiment analysis systems in twitter.
引用
收藏
页码:4525 / 4532
页数:8
相关论文
共 27 条
[1]  
Agarwal Apoorv., 2009, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL '09, P24
[2]  
[Anonymous], P 7 INT WORKSH SEM E
[3]  
[Anonymous], 2015, P 9 INT WORKSH SEM E
[4]  
[Anonymous], 2010, P NAACL HLT 2010 WOR, DOI DOI 10.5555/1860631.1860635
[5]  
[Anonymous], WEBIS ENSEM IN PRESS
[6]  
[Anonymous], 1966, The general inquirer: A computer approach to content analysis
[7]  
[Anonymous], UNSUPERVISE IN PRESS
[8]  
Baccianella S., 2010, LREC 10, V10, P2200
[9]  
BIRD S, 2006, P COLING ACL INT PRE, P69, DOI DOI 10.3115/1225403.1225421
[10]  
Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001