A decision tree using ID3 algorithm for English semantic analysis

被引:30
作者
Phu V.N. [1 ]
Tran V.T.N. [2 ]
Chau V.T.N. [3 ]
Dat N.D. [4 ]
Duy K.L.D. [5 ]
机构
[1] Institute of Research and Development, Duy Tan University - DTU, Da Nang
[2] School of Industrial Management (SIM), Ho Chi Minh City University of Technology - HCMUT, Vietnam National University, Ho Chi Minh City
[3] Computer Science & Engineering (CSE), Ho Chi Minh City University of Technology - HCMUT, Vietnam National University, Ho Chi Minh City
[4] Faculty of Information Technology, Ly Tu Trong Technical College, Ho Chi Minh City
[5] Faculty of Information Technology, Ho Chi Minh City University of Foreign Languages, Ho Chi Minh City
关键词
Decision tree; English document opinion mining; English sentiment classification; ID3; algorithm; id3; Sentiment classification;
D O I
10.1007/s10772-017-9429-x
中图分类号
学科分类号
摘要
Natural language processing has been studied for many years, and it has been applied to many researches and commercial applications. A new model is proposed in this paper, and is used in the English document-level emotional classification. In this survey, we proposed a new model by using an ID3 algorithm of a decision tree to classify semantics (positive, negative, and neutral) for the English documents. The semantic classification of our model is based on many rules which are generated by applying the ID3 algorithm to 115,000 English sentences of our English training data set. We test our new model on the English testing data set including 25,000 English documents, and achieve 63.6% accuracy of sentiment classification results. © 2017, Springer Science+Business Media, LLC.
引用
收藏
页码:593 / 613
页数:20
相关论文
共 53 条
  • [1] Agarwal B., Mittal N., Semantic orientation-based approach for sentiment analysis, Prominent Feature Extraction for Sentiment Analysis, (2016)
  • [2] Agarwal B., Mittal N., Machine learning approach for sentiment analysis, Prominent Feature Extraction for Sentiment Analysis, (2016)
  • [3] Ahmed S., Danti A., Effective sentimental analysis and opinion mining of web reviews using rule based classifiers, Computational Intelligence in Data Mining, (2016)
  • [4] Baldwin J.F., Lawry J., Martin T.P., A mass assignment based ID3 algorithm for decision tree induction, International Journal of Intelligent Systems, (1997)
  • [5] Canuto S., Goncalves M.A., Benevenuto F., Exploiting new sentiment-based meta-level features for effective sentiment analysis, Proceedings of the ninth ACM International conference on web search and data mining (WSDM ‘16, pp. 53-62, (2016)
  • [6] Cendrowska J., PRISM: An algorithm for inducing modular rules, International Journal of Man-Machine Studies, 27, 4, pp. 349-370, (1987)
  • [7] Chaovalit P., Zhou L., Movie review mining: a comparison between supervised and unsupervised classification approaches, (2005)
  • [8] Cheng J., Fayyad U.M., Irani K.B., Qian Z., Improved decision trees: A generalized version of ID3. In Proceedings of the fifth international conference on machine learning, (1988)
  • [9] Cios K.J., Liu N., A machine learning method for generation of a neural network architecture: A continuous ID3 algorithm, IEEE Transactions on Neural Networks, 3, 2, pp. 280-291, (2002)
  • [10] Cios K.J., Sztandera L.M., Continuous ID3 algorithm with fuzzy entropy measures, IEEE international conference on fuzzy systems, pp. 469-476, (1992)