Training Data Optimization Strategy for Multiclass Text Classification

被引:0
作者
Arusada, Muhammad Diaphan Nizam [1 ]
Putri, Novi Amalia Santika [1 ]
Alamsyah, Andry [1 ]
机构
[1] Telkom Univ, Sch Business & Econ, Bandung, Indonesia
来源
2017 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOIC7) | 2017年
关键词
Big Data; Training Data; Twitter; Multiclass Classification; Customer Complaint; Text Classification; Support Vector Machine; Naive Bayes;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data has been widely spread throughout social media in this digital era. Indeed, it is a good chance for business to get the information in real time. Since the data from social media is unstructured, thus we need to process it beforehand. Machine learning needs proper training data that makes the classification model perform accurately. In order to actualize it, we need a qualified domain knowledge and the right strategy to make an optimal training data. This paper shows the strategy to make optimal training data by using customer's complaint data from Twitter. We use both Naive Bayes and Support Vector Machine as classifiers. The experimental result shows that our strategy of training data optimization can give good performance for multi-class text classification model.
引用
收藏
页数:5
相关论文
共 18 条
[1]  
Aggarwal C. C., 2012, MINING TEXT DATA, P163, DOI [DOI 10.1007/978-1-4614-3223-46, DOI 10.1007/978-1-4614-3223-4, 10.1007/978-1-4614-3223-4]
[2]  
BARBER D., 2012, Bayesian Reasoning and Machine Learning
[3]  
Chapman C., 2015, R MARKETING RES ANAL
[4]  
Gupta MR, 2014, J MACH LEARN RES, V15, P1461
[5]  
Hofmann Markus., 2016, Text Mining and Visualization: Case Studies Using Open-Source Tools
[6]  
Imtiyazi MA, 2016, 2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT)
[7]  
Ingersoll G.S., 2013, TAMING TEXT FIND ORG
[8]  
Kotler P., 2016, MARKETING MANAGEMEN, V15th
[9]  
Krendzelak M., 2015, INT C ENG TECHN APPL, P26
[10]  
Laksana J., 2014, INT C ADV INF CONC T