Performance Analysis of Machine Learning Algorithms for Big Data Classification: ML and Al-Based Algorithms for Big Data Analysis

被引:38
作者
Punia, Sanjeev Kumar [1 ]
Kumar, Manoj [2 ]
Stephan, Thompson [3 ]
Deverajan, Ganesh Gopal [4 ]
Patan, Rizwan [5 ]
机构
[1] JIMS Engn Management Tech Campus, Greater Noida, India
[2] Univ Petr & Energy Studies UPES, Sch Comp Sci, Dehra Dun, Uttarakhand, India
[3] MS Ramaiah Univ Appl Sci, Fac Engn & Technol, Dept Comp Sci & Engn, Bangalore, Noida, India
[4] Galgotias Univ, Greater Noida, India
[5] Velagapudi Ramakrishna Siddhartha Engn Coll, Vijayawada, India
关键词
Big Data; Decision Trees (DT); Floating Centroid Method (FCM); Neural Networks (NN); Semantic Web Service Classification; Spark Framework; Support Vector Machines (SVM); Twitter Streaming API;
D O I
10.4018/IJEHMC.20210701.oa4
中图分类号
R-058 [];
学科分类号
摘要
In broad, three machine learning classification algorithms are used to discover correlations, hidden patterns, and other useful information from different data sets known as big data. Today, Twitter, Facebook, Instagram, and many other social media networks are used to collect the unstructured data. The conversion of unstructured data into structured data or meaningful information is a very tedious task. The different machine learning classification algorithms are used to convert unstructured data into structured data. In this paper, the authors first collect the unstructured research data from a frequently used social media network (i.e., Twitter) by using a Twitter application program interface (API) stream. Secondly, they implement different machine classification algorithms (supervised, unsupervised, and reinforcement) like decision trees (DT), neural networks (NN), support vector machines (SVM), naive Bayes (NB), linear regression (LR), and k-nearest neighbor (K-NN) from the collected research data set. The comparison of different machine learning classification algorithms is concluded.
引用
收藏
页码:60 / 75
页数:16
相关论文
共 17 条
[1]   The application of machine learning techniques for driving behavior analysis: A conceptual framework and a systematic literature review [J].
Abou Elassad, Zouhair Elamrani ;
Mousannif, Hajar ;
Al Moatassime, Hassan ;
Karkouch, Aimad .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 87
[2]  
Al-Dlaeen D, 2014, INT CONF COMP SCI, P122, DOI 10.1109/CSIT.2014.6805989
[3]  
Alty SR, 2003, Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, P376
[4]  
[Anonymous], 2010, CANCER ETIOL DIAGN T
[5]  
Benito S., 2008, ENCY HEALTHCARE INFO, P1277
[6]  
Goyal A, 2015, 2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), P16, DOI 10.1109/CCAA.2015.7148364
[7]  
Grover P, 2015, 2015 GLOBAL CONFERENCE ON COMMUNICATION TECHNOLOGIES (GCCT), P764
[8]  
Hashem H, 2016, 2016 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS (ICMCIS)
[9]  
Hassan M, 2014, IEEE INT SYMP CIRC S, P2579, DOI 10.1109/ISCAS.2014.6865700
[10]  
Jiang QW, 2016, INT CONF CLOUD COMPU, P160, DOI 10.1109/CCIS.2016.7790245