An Improved Term Weighting Scheme for Sentiment Classification

被引:0
作者
Zhang, Pu [1 ,2 ]
Wang, Yinghao [1 ,2 ]
Wang, Junxia [1 ,2 ]
Zeng, Xianhua [1 ]
Wang, Yong [3 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Lab Engn Res Ctr Mobile Internet Data A, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Key Lab Elect Commerce & Logist, Chongqing, Peoples R China
来源
2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC) | 2017年
基金
中国国家自然科学基金;
关键词
class contribution; term weighting; supervised learning; sentiment classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Supervised learning methods are widely used in text sentiment classification. To acquire high classification performance, the effective and precise term weighting scheme plays a prime and necessary role for classification system. The traditional term weighting schemes often ignore the use of the available labeling information as the prior knowledge, which results the expressed relationships between features and class labels are not accurate and adequate enough. Hence, this paper proposes a term weighting scheme which makes use of class contribution of feature terms, and obtains a scheme which takes into account the class contribution, local distribution and global distribution of feature terms. The scheme expresses the relationships between features and class labels as a class contribution degree factor which is based on posterior probability, then combines it with the factors of frequency and inverse document frequency. Compared with a series of term weighting schemes, experimental results on a widely used dataset show that the proposed scheme can significantly improve the performance of sentiment classification.
引用
收藏
页码:462 / 466
页数:5
相关论文
共 17 条
  • [1] [Anonymous], STANFORD WORD SEGMEN
  • [2] Debole F., 2003, P SAC 2003, P81
  • [3] A study of supervised term weighting scheme for sentiment analysis
    Deng, Zhi-Hong
    Luo, Kun-Hu
    Yu, Hong-Liang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (07) : 3506 - 3513
  • [4] Supervised and Traditional Term Weighting Methods for Automatic Text Categorization
    Lan, Man
    Tan, Chew Lim
    Su, Jian
    Lu, Yue
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (04) : 721 - 735
  • [5] Analysis of motivation, travel risk, and travel satisfaction of Taiwan undergraduates on work and travel overseas programmes: Developing measurement scales
    Lin, Yi-Hsin
    Lee, Yi-Chun
    Wang, Shih-Chuan
    [J]. TOURISM MANAGEMENT PERSPECTIVES, 2012, 2-3 : 35 - 46
  • [6] [马雯雯 Ma Wenwen], 2013, [计算机应用, Journal of Computer Applications], V33, P2280
  • [7] Maas A., 2011, P 49 ANN M ASS COMP, P142
  • [8] Martineau J., 2009, P ICWSM 2009
  • [9] Paltoglou G, 2010, ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P1386
  • [10] Thumbs up? Sentiment classification using machine learning techniques
    Pang, B
    Lee, L
    Vaithyanathan, S
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 79 - 86