Hierarchical Classification in Text Mining for Sentiment Analysis

被引:4
作者
Li, Jinyan [1 ]
Fong, Simon [1 ]
Zhuang, Yan [1 ]
Khoury, Richard [2 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Taipa, Macau, Peoples R China
[2] Lakehead Univ, Dept Software Engn, Thunder Bay, ON, Canada
来源
2014 INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE ISCMI 2014 | 2014年
关键词
sentiment analysis; text mining; classification;
D O I
10.1109/ISCMI.2014.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis in text mining is known to be a challenging task. Sentiment is subtly reflected by the tone, affective state or emotion of a writer's expression in words. Conventional text mining techniques which are based on keyword frequency counting usually run short of accurately detecting such subjective information implied in the text. In this paper we evaluated several popular classification algorithms, along with three filtering schemes. The filtering schemes progressively shrink the original dataset, with respect to the contextual polarity and frequent terms of a document. In general the proposed approach is coined hierarchical classification. The effects of the approach in different combination of classification algorithms and filtering schemes are discussed over three sets of controversial online news articles where binary and multi-class classifications are applied.
引用
收藏
页码:46 / 51
页数:6
相关论文
共 10 条
  • [1] [Anonymous], 2005, P 14 ACM INT C INF
  • [2] [Anonymous], 2007, Hlt-naacl
  • [3] [Anonymous], 2008, THESIS
  • [4] Argamon S., 2007, 3rd Language and Technology Conference, P369, DOI [10.1007/978-3-642-04235-5_19, DOI 10.1007/978-3-642-04235-5_19]
  • [5] Banchs RafaelE., 2012, Text mining with MATLAB
  • [6] Sentiment Anlaysis of Online News using MALLET
    Fong, Simon
    Zhuang, Yan
    Li, Jinyan
    Khoury, Richard
    [J]. 2013 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2013, : 301 - 304
  • [7] Hernandez Laritza, 2009, 14 PORT C ART INT EP, P525
  • [8] Leskovec J, 2014, MINING OF MASSIVE DATASETS, 2ND EDITION, P1
  • [9] Thumbs up? Sentiment classification using machine learning techniques
    Pang, B
    Lee, L
    Vaithyanathan, S
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 79 - 86
  • [10] Turney PD, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P417