Opinion Mining based complex polarity shift pattern handling for improved sentiment classification

被引:1
作者
Japhne, A. [1 ]
Murugeswari, R. [2 ]
机构
[1] Kalasalingam Acad Res & Educ, Dept Comp Applicat, Krishnankoil, Tamil Nadu, India
[2] Kalasalingam Acad Res & Educ, Dept Comp Sci & Engn, Krishnankoil, Tamil Nadu, India
来源
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020) | 2020年
关键词
Sentiment classification; Polarity shift; Sentiment analysis; Opinion mining;
D O I
10.1109/icict48043.2020.9112565
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The availability of rich volume of information on the web makes sentiment analysis an interesting, informative and impressive task. There is now an increasing desire to mine this information to gain new insights. But how well we extract value from this wealth of information is much important because identifying, accurately analyzing and extracting subjective information from the available text is not an easy task. Hence the purpose of this research work is to effectively handle complex polarity shift patterns such as negations, intensifiers, diminishers and contrast transitions that affect (sometimes, reverse) the polarity of the text for better classification of sentiments and nine classification techniques in supervised learning were used for binary sentiment classification. The most commonly used standard labelled dataset with movie reviews was used. The implementation was done in Python using the libraries of Natural Language Toolkit (NLTK) and Scikit-Learn (sklearn). Grid Search cross validation technique was also used to choose best parameters for each model while training so that the classifier could accurately predict the testing data. The accuracy of sentiment classification task of the original dataset was compared with the one after complex polarity shift pattern handling. The results of this study show an increase in the accuracy up to 3.5% for various classifiers after polarity shift pattern handling. Random Forest classifier which is an ensemble of classifiers tops the list with high accuracy among the other classifiers used.
引用
收藏
页码:323 / 329
页数:7
相关论文
共 32 条
  • [1] [Anonymous], 2005, INT C SYST SCI 2005
  • [2] [Anonymous], 2010, P 23 INT C COMP LING
  • [3] Arya Premnarayan, 2017, IJESC, V7, P4796
  • [4] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [5] Chauhan P., 2017, INT J RES APPL SCI E, Vv, P80, DOI [10.22214/ijraset.2017.11011., DOI 10.22214/IJRASET.2017.11011, 10.22214/ijraset.2017.11011]
  • [6] Devishree T., 2018, IJEDR, V6, P314
  • [7] Techniques and Applications for Sentiment Analysis
    Feldman, Ronen
    [J]. COMMUNICATIONS OF THE ACM, 2013, 56 (04) : 82 - 89
  • [8] Govindarajan M., 2013, International Journal of Advanced Computer Research, V3, P139
  • [9] hew Daya Mary Mat, 2017, IJCA, V158
  • [10] ini Claudia Diamant, 2018, FUTURE GENERATION CO