Sentiment Analysis of Twitter Data Using Machine Learning Techniques and Scikit-learn

被引:8
作者
Elbagir, Shihab [1 ]
Yang, Jing [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin, Heilongjiang, Peoples R China
来源
2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018) | 2018年
基金
中国国家自然科学基金;
关键词
Twitter; sentiment analysis; Machine Learning Techniques; Scikit-learn;
D O I
10.1145/3302425.3302492
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis of Twitter data is an area that has experienced significant growth in recent years. The ability to identify sentiment from tweets using machine learning techniques has attracted researchers because of the simple efficiency of machine learning techniques. This paper tackles the use of machine learning algorithms and Scikit-learn in sentiment analysis of Twitter data. To do this, we perform analyses on Twitter datasets made publicly available by NLTK Corpora and create an efficient feature by using a feature extraction technique. We train and test various machine learning classifiers such as MultinomialNB, BernoulliNB, LogisticRegression, SGD classifier, SVC, LinearSVC, and NuSVC. Experimental results demonstrate that BernoulliNB, LogisticRegression, and SGD classifier reached accuracy as high as 75%.
引用
收藏
页数:5
相关论文
共 9 条
[1]  
[Anonymous], 2002, P 8 ACM SIGKDD INT C
[2]  
Cabanlit MA, 2014, 5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, P94, DOI 10.1109/IISA.2014.6878767
[3]  
Manning ChristopherD., 2008, Introduction to Information Retrieval, P234
[4]  
OConnor B., 2010, Icwsm, V11, P1, DOI [https://doi.org/10.1609/icwsm.v4i1.14031, DOI 10.1609/ICWSM.V4I1.14031]
[5]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[6]  
Schmidt M, 2017, MATH PROGRAM, V162, P83, DOI 10.1007/s10107-016-1030-6
[7]   STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL [J].
SPARCKJONES, K .
JOURNAL OF DOCUMENTATION, 1972, 28 (01) :11-+
[8]  
Tao K, 2014, IEEE INT CONF BIG DA, P421, DOI 10.1109/BigData.2014.7004259
[9]  
Wu TF, 2004, J MACH LEARN RES, V5, P975