The Impact of Big Data Quality on Sentiment Analysis Approaches

被引:8
作者
El Alaoui, Imane [1 ]
Gahi, Youssef [2 ]
机构
[1] Univ Ibn Tofail, Lab Syst Telecommun & Ingn Decis, LASTID, Kenitra, Morocco
[2] Univ Ibn Tofail, Ecole Natl Sci Appl, LGS, Kenitra, Morocco
来源
10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS | 2019年 / 160卷
关键词
Big Data Quality Metrics; Big Data Value Chain; Big Data; Big Social Data; Sentiment Analysis; Opinion Mining;
D O I
10.1016/j.procs.2019.11.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Human beings share their good or bad opinions about subjects, products, and services through internet and social networks. The ability to effectively analyze this kind of information is now seen as a key competitive advantage to better inform decisions. In order to do so, organizations employ Sentiment Analysis (SA) techniques on these data. However, the usage of social media around the world is ever-increasing, which considerably accelerates massive data generation and makes traditional SA systems unable to deliver useful insights. Such volume of data can be efficiently analyzed using the combination of SA techniques and Big Data technologies. In fact, big data is not a luxury but an essential necessary to make valuable predictions. However, there are some challenges associated with big data such as quality that could highly affect the SA systems' accuracy that use huge volume of data. Thus, the quality aspect should be addressed in order to build reliable and credible systems. For this, the goal of our research work is to consider Big Data Quality Metrics (BDQM) in SA that rely of big data. In this paper, we first highlight the most eloquent BDQM that should be considered throughout the Big Data Value Chain (BDVC) in any big data project. Then, we measure the impact of BDQM on a novel SA method accuracy in a real case study by giving simulation results. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:803 / 810
页数:8
相关论文
共 28 条
[1]  
Alaoui I. E., 2019, 2019 INT C BIG DAT E
[2]  
Alrubaian M., 2016, CONCURRENCY COMPUTAT, P29
[3]   Credibility in Online Social Networks: A Survey [J].
Alrubaian, Majed ;
Al-Qurishi, Muhammad ;
Alamri, Atif ;
Al-Rakhami, Mabrook ;
Hassan, Mohammad Mehedi ;
Fortino, Giancarlo .
IEEE ACCESS, 2019, 7 :2828-2855
[4]   A Credibility Analysis System for Assessing Information on Twitter [J].
Alrubaian, Majed ;
Al-Qurishi, Muhammad ;
Hassan, Mohammad Mehedi ;
Alamri, Atif .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2018, 15 (04) :661-674
[5]  
[Anonymous], P PAC AS C INF SYST
[6]  
[Anonymous], 2012, P 2012 ACM INT C INT, DOI DOI 10.1145/2166966.2166998
[7]  
[Anonymous], IEEE ACCESS
[8]  
[Anonymous], 2015, DATA SCI J
[9]  
BIFFIGNANDI S, 2018, INT TOT SURV ERR WOR
[10]  
Canini K. R., 2011, Proceedings of the 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and IEEE Third International Conference on Social Computing (PASSAT/SocialCom 2011), P1, DOI 10.1109/PASSAT/SocialCom.2011.91