Data Mining for Managing and Using Online Information on Facebook

被引:1
作者
Al Said, Nidal [1 ]
机构
[1] Ajman Univ, Coll Mass Commun, Ajman, U Arab Emirates
关键词
social networks; data mining; classification; accuracy; multinomial naive Bayes classifier;
D O I
10.12720/jait.14.4.769-776
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem under the study of this work is investigating data mining algorithms for intelligent analysis of data written in Arabic. The study comprised instead involves several stages, including Data Collection and Pre-Processing; Data Mining Algorithms (Multinomial Naive Bayes Classifier, Naive Bayes Classifier, Support Vector Machine and Modified K-Means); Study Results Processing and Software Implementation. A total of 16,732 Facebook posts written exclusively in Arabic were downloaded. Almost two-thirds of them (namely 11,155 items) were used to train algorithms, while the rest (5577 items) were subject to research. The training data were categorized into five groups based on subjects (politics, entertainment, medicine, science, and religion) with five keywords used for testing in each group. Most posts (4736 items) were related to politics. The most accurate algorithm proved to be the multinomial Naive Bayesian classifier for the maximum number of test data, while the minimum values of this feature were recorded for the Support vector machine. The effectiveness of the multinomial Naive Bayesian classifier algorithm was most remarkable for the maximum amount of data, while the Support Vector Machine was most effective for the minimum amount. The argument's fit score is maximum at 5577 data points for the multinomial Naive Bayesian classifier and 1394 data points for K-means. To improve and refine the results of data mining, the sample must be expanded, adding more data classes and keywords. Other machine learning models, such as deep learning algorithms, could also be used. The significance of investigation is very important because it expands our knowledge about the use of Machine Learning Algorithms to mine Arabic texts on social media platforms.
引用
收藏
页码:769 / 776
页数:8
相关论文
共 44 条
  • [1] Agarkar V. V., 2020, WEBMINING APPL DATA
  • [2] Al-Hagery M. A., INT J INF TECHNOL CO, V5, P1
  • [3] Data Mining and Privacy of Social Network Sites' Users: Implications of the Data Mining Problem
    Al-Saggaf, Yeslam
    Islam, Md Zahidul
    [J]. SCIENCE AND ENGINEERING ETHICS, 2015, 21 (04) : 941 - 966
  • [4] Identifying Fake Facebook Profiles Using Data Mining Techniques
    Albayati, Mohammed Basil
    Altamimi, Ahmad Mousa
    [J]. JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2019, 13 (02) : 107 - 117
  • [5] Applying multi-label techniques in emotion identification of short texts
    Almeida, Alex M. G.
    Cerri, Ricardo
    Paraiso, Emerson Cabrera
    Mantovani, Rafael Gomes
    Barbon Junior, Sylvio
    [J]. NEUROCOMPUTING, 2018, 320 : 35 - 46
  • [6] Awad M., 2015, Efficlearn mach, P39, DOI [http://dx.doi.org/10.1007/978-1-4302-5990-9, DOI 10.1007/978-1-4302-5990-9, 10.1007/978-1-4302-5990-93, DOI 10.1007/978-1-4302-5990-9_3]
  • [7] Bayhaqy A, 2018, INT CONF ORANGE TECH
  • [8] Social big data: Recent achievements and new challenges
    Bello-Orgaz, Gema
    Jung, Jason J.
    Camacho, David
    [J]. INFORMATION FUSION, 2016, 28 : 45 - 59
  • [9] Bozkir AS, 2010, COMM COM INF SC, V96, P145
  • [10] Hierarchical Clustering: Objective Functions and Algorithms
    Cohen-Addad, Vincent
    Kanade, Varun
    Mallmann-Trenn, Frederik
    Mathieu, Claire
    [J]. JOURNAL OF THE ACM, 2019, 66 (04)