Hybrid Approach Framework for Sentiment Classification on Microblogging

被引:0
作者
Orkphol, Korawit [1 ]
Yang, Wu [1 ]
Wang, Wei [1 ]
Zhu, Wenlong [1 ]
机构
[1] Harbin Engn Univ, Dept Comp Sci & Technol, Informat Secur Res Ctr, Harbin 150001, Heilongjiang, Peoples R China
来源
2017 COMPUTING CONFERENCE | 2017年
关键词
Sentiment Classification; Opinion Mining; Machine Learning; Microblogging; SentiWordnet; REVIEWS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Microblogging is used widely to express opinions toward an entity, knowing sentiment polarity can get benefit for decision making, planning, and visualization and so on. Outdated training data along with the nature of Microblogging which is short and noisy cause low accuracy. Existing approach requires human effort to manually label huge training data. To tackle these problems, we proposed a framework that used a hybrid approach between lexicon-based approach and machine learning approach. SentiWordnet has been used to automatically label training data and then using Support Vector Machine for sentiment classification. We study two scoring mechanisms for labeling training data, Word Sense Disambiguation and Non Word Sense Disambiguation. The framework also used MapReduce for computing large dataset. The result shows that Non Word Sense Disambiguation is optimal for this framework. The framework is functional, more automatically and less human efforts.
引用
收藏
页码:893 / 898
页数:6
相关论文
共 19 条
  • [1] [Anonymous], 2010, P LREC
  • [2] [Anonymous], 2006, 11 C EUR CHAPT ASS C
  • [3] Baharudin B., 2011, P 2011 NATL POSTGRAD, DOI [10.1109/NatPC.2011.6136319, DOI 10.1109/NATPC.2011.6136319]
  • [4] Banerjee S., 2002, Computational Linguistics and Intelligent Text Processing. Third International Conference, CICLing 2002. Proceedings (Lecture Notes in Computer Science Vol.2276), P136
  • [5] Barbosa L., 2010, P COL
  • [6] Bifet A., 2010, P 13 INT C DISC SCI
  • [7] Using EmotiBlog to annotate and analyse subjectivity in the new textual genres
    Boldrini, Ester
    Balahur, Alexandra
    Martinez-Barco, Patricio
    Montoyo, Andres
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 25 (03) : 603 - 634
  • [8] Chamlertwat W, 2012, J UNIVERS COMPUT SCI, V18, P973
  • [9] Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon
    Fu Xianghua
    Liu Guo
    Guo Yanyan
    Wang Zhiqiang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 37 : 186 - 195
  • [10] Lesk M., 1986, P 5 ANN INT C SYSTEM, P24, DOI 10.1145/318723.318728