Assessment of Sentiment Analysis Using Information Gain Based Feature Selection Approach

被引:0
作者
Madhumathi, R. [1 ]
Kowshalya, A. Meena [2 ]
Shruthi, R. [1 ]
机构
[1] Sri Ramakrishna Engn Coll, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
[2] Govt Coll Technol, Coimbatore, Tamil Nadu, India
来源
COMPUTER SYSTEMS SCIENCE AND ENGINEERING | 2022年 / 43卷 / 02期
关键词
Sentiment analysis; sentence level; document level; feature level; information gain;
D O I
10.32604/csse.2022.023568
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis is the process of determining the intention or emotion behind an article. The subjective information from the context is analyzed by the sentimental analysis of the people's opinion. The data that is analyzed quantifies the reactions or sentiments and reveals the information's contextual polarity. In social behavior, sentiment can be thought of as a latent variable. Measuring and comprehending this behavior could help us to better understand the social issues. Because sentiments are domain specific, sentimental analysis in a specific context is critical in any real-world scenario. Textual sentiment analysis is done in sentence, document level and feature levels. This work introduces a new Information Gain based Feature Selection (IGbFS) algorithm for selecting highly correlated features eliminating irrelevant and redundant ones. Extensive textual sentiment analysis on sentence, document and feature levels are performed by exploiting the proposed Information Gain based Feature Selection algorithm. The analysis is done based on the datasets from Cornell and Kaggle repositories. When compared to existing baseline classifiers, the suggested Information Gain based classifier resulted in an increased accuracy of 96% for document, 97.4% for sentence and 98.5% for feature levels respectively. Also, the proposed method is tested with IMDB, Yelp 2013 and Yelp 2014 datasets. Experimental results for these high dimensional datasets give increased accuracy of 95%, 96% and 98% for the proposed Information Gain based classifier for document, sentence and feature levels respectively compared to existing baseline classifiers.
引用
收藏
页码:849 / 860
页数:12
相关论文
共 25 条
  • [1] Agarwal Apoorv., 2009, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL '09, P24
  • [2] [Anonymous], 2009, P 14 AUSTRALASIAN DO
  • [3] Behdenna S., 2018, EAI endorsed transactions on context-aware systems and applications, V4, P2, DOI [10.4108/eai.14-3-2018.154339, DOI 10.4108/EAI.14-3-2018.154339]
  • [4] Bhatia P., 2015, P 2015 C EMPIRICAL M, P2212, DOI [10.18653/v1/D15-1263, DOI 10.18653/V1/D15-1263]
  • [5] de Marneffe MC., 2008, COLING 2008 P WORKSH, P1
  • [6] Dos Santos C., 2014, P COLING 2014 25 INT, P69
  • [7] Sentiment Analysis in Social Media and Its Application: Systematic Literature Review
    Drus, Zulfadzli
    Khalid, Haliyana
    [J]. FIFTH INFORMATION SYSTEMS INTERNATIONAL CONFERENCE, 2019, 161 : 707 - 714
  • [8] El Rahman Sahar A., 2019, 2019 INT C COMP INF, P1
  • [9] Detecting Clusters/Communities in Social Networks
    Hoffman, Michaela
    Steinley, Douglas
    Gates, Kathleen M.
    Prinstein, Mitchell J.
    Brusco, Michael J.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2018, 53 (01) : 57 - 73
  • [10] Ikonomakis M., 2005, WSEAS Transactions on Computers, V4, P966