Enhancing Machine-Learning Methods for Sentiment Classification of Web Data

被引:0
作者
Wang, Zhaoxia [1 ]
Tong, Victor Joo Chuan [1 ]
Chin, Hoong Chor [2 ]
机构
[1] ASTAR, Social & Cognit Comp Dept, Singapore 138632, Singapore
[2] Natl Univ Singapore, Dept Civil & Environm Engn, Singapore 117578, Singapore
来源
INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2014 | 2014年 / 8870卷
关键词
Emoticon handling; negation dealing; feature selection; hybrid method; machine learning; sentiment classification; Twitter; Web data; FEATURE-SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With advances in Web technologies, more and more people are turning to popular social media platforms such as Twitter to express their feelings and opinions on a variety of topics and current issues online. Sentiment analysis of Web data is becoming a fast and effective way of evaluating public opinion and sentiment for use in marketing and social behavioral studies. This research investigates the enhancement techniques in machine-learning methods for sentiment classification of Web data. Feature selection, negation dealing, and emoticon handling are studied in this paper for their ability to improve the performance of machine-learning methods. The range of enhancement techniques is tested using different text data sets, such as tweets and movie reviews. The results show that different enhancement methods can improve classification efficacy and accuracy differently.
引用
收藏
页码:394 / 405
页数:12
相关论文
共 27 条
[1]   Experiments on the Use of Feature Selection and Machine Learning Methods in Automatic Malay Text Categorization [J].
Alshalabi, Hamood ;
Tiun, Sabrina ;
Omar, Nazlia ;
Albared, Mohammed .
4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI 2013), 2013, 11 :748-754
[2]  
[Anonymous], 2010, Proceedings of the 23rdInternational Conference on Computational Linguistics: Posters
[3]   Sentiment analysis of twitter audiences: Measuring the positive or negative influence of popular twitterers [J].
Bae, Younggue ;
Lee, Hongchul .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (12) :2521-2535
[4]   A machine learning approach to sentiment analysis in multilingual Web texts [J].
Boiy, Erik ;
Moens, Marie-Francine .
INFORMATION RETRIEVAL, 2009, 12 (05) :526-558
[5]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]   Discovering business intelligence from online product reviews: A rule-induction framework [J].
Chung, Wingyan ;
Tseng, Tzu-Liang .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (15) :11870-11879
[8]   Feature selection for sentiment analysis based on content and syntax models [J].
Duric, Adnan ;
Song, Fei .
DECISION SUPPORT SYSTEMS, 2012, 53 (04) :704-711
[9]   Twitter brand sentiment analysis: A hybrid system using n-gram analysis and dynamic artificial neural network [J].
Ghiassi, M. ;
Skinner, J. ;
Zimbra, D. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) :6266-6282
[10]  
Glorot X., 2011, INT C MACH LEARN, P513