Using VADER sentiment and SVM for predicting customer response sentiment

被引:130
作者
Borg, Anton [1 ]
Boldt, Martin [1 ]
机构
[1] Blekinge Inst Technol, S-37179 Karlskrona, Sweden
关键词
Sentiment prediction; VADER sentiment; Supervised classification; Customer support; E-mail sentiment analysis; SVM; CLASSIFICATION;
D O I
10.1016/j.eswa.2020.113746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Customer support is important to corporate operations, which involves dealing with disgruntled customer and content customers that can have different requirements. As such, it is important to quickly extract the sentiment of support errands. In this study we investigate sentiment analysis in customer support for a large Swedish Telecom corporation. The data set consists of 168,010 e-mails divided into 69,900 conversation threads without any sentiment information available. Therefore, VADER sentiment is used together with a Swedish sentiment lexicon in order to provide initial labeling of the e-mails. The e-mail content and sentiment labels are then used to train two Support Vector Machine models in extracting/classifying the sentiment of e-mails. Further, the ability to predict sentiment of not-yet-seen e-mail responses is investigated. Experimental results show that the LinearSVM model was able to extract sentiment with a mean F-1-score of 0.834 and mean AUC of 0.896. Moreover, the LinearSVM algorithm was also able to predict the sentiment of an e-mail one step ahead in the thread (based on the text in the an already sent e-mail) with a mean F-1-score of 0.688 and the mean AUC of 0.805. The results indicate a predictable pattern in e-mail conversation that enables predicting the sentiment of a not-yet-seen e-mail. This can be used e.g. to prepare particular actions for customers that are likely to have a negative response. It can also provide feedback on possible sentiment reactions to customer support e-mails. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 29 条
[1]   Profit optimization in multi-service cognitive mesh network using machine learning [J].
Alsarhan, Ayoub ;
Agarwal, Anjali .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2011,
[2]  
[Anonymous], 2012, Machine Learning, DOI DOI 10.1017/CBO9780511973000
[3]   Letters by phone or speech by other means: the linguistics of email [J].
Baron, NS .
LANGUAGE & COMMUNICATION, 1998, 18 (02) :133-170
[4]   Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization [J].
Basari, Abd Samad Hasan ;
Hussin, Burairah ;
Ananta, I. Gede Pramudya ;
Zeniarja, Junta .
MALAYSIAN TECHNICAL UNIVERSITIES CONFERENCE ON ENGINEERING & TECHNOLOGY 2012 (MUCET 2012), 2013, 53 :453-462
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   A Lexicon-Enhanced Method for Sentiment Classification: An Experiment on Online Product Reviews [J].
Dang, Yan ;
Zhang, Yulei ;
Chen, Hsinchun .
IEEE INTELLIGENT SYSTEMS, 2010, 25 (04) :46-53
[7]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[8]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[9]  
Fawcett T., 2004, Machine Learning, V31, P1
[10]  
Feldman R., 2006, TEXT MINING HDB ADV