FCNB: Fuzzy Correlative Naive Bayes Classifier with MapReduce Framework for Big Data Classification

被引:10
作者
Banchhor, Chitrakant [1 ]
Srinivasu, N. [1 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Comp Sci & Engn Dept, Guntur, Andhra Pradesh, India
关键词
Big data; classification; correlative naive Bayes classifier; fuzzy theory; MapReduce; MAP REDUCE SOLUTION; ALGORITHM; MACHINE;
D O I
10.1515/jisys-2018-0020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The term "big data" means a large amount of data, and big data management refers to the efficient handling, organization, or use of large volumes of structured and unstructured data belonging to an organization. Due to the gradual availability of plenty of raw data, the knowledge extraction process from big data is a very difficult task for most of the classical data mining and machine learning tools. In a previous paper, the correlative naive Bayes (CNB) classifier was developed for big data classification. This work incorporates the fuzzy theory along with the CNB classifier to develop the fuzzy CNB (FCNB) classifier. The proposed FCNB classifier solves the big data classification problem by using the MapReduce framework and thus achieves improved classification results. Initially, the database is converted to the probabilistic index table, in which data and attributes are presented in rows and columns, respectively. Then, the membership degree of the unique symbols present in each attribute of data is found. Finally, the proposed FCNB classifier finds the class of data based on training information. The simulation of the proposed FCNB classifier uses the localization and skin segmentation datasets for the purpose of experimentation. The results of the proposed FCNB classifier are analyzed based on the metrics, such as sensitivity, specificity, and accuracy, and compared with the various existing works.
引用
收藏
页码:994 / 1006
页数:13
相关论文
共 27 条
[11]   Extreme learning machine: Theory and applications [J].
Huang, Guang-Bin ;
Zhu, Qin-Yu ;
Siew, Chee-Kheong .
NEUROCOMPUTING, 2006, 70 (1-3) :489-501
[12]   Support Vector Machine Classifier with Pinball Loss [J].
Huang, Xiaolin ;
Shi, Lei ;
Suykens, Johan A. K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (05) :984-997
[13]   De-Bruijn graph with MapReduce framework towards metagenomic data classification [J].
Kamal M.S. ;
Parvin S. ;
Ashour A.S. ;
Shi F. ;
Dey N. .
International Journal of Information Technology, 2017, 9 (1) :59-75
[14]   An Ensemble Random Forest Algorithm for Insurance Big Data Analysis [J].
Lin, Weiwei ;
Wu, Ziming ;
Lin, Longxin ;
Wen, Angzhan ;
Li, Jin .
IEEE ACCESS, 2017, 5 :16568-16575
[15]   Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data [J].
Lopez, Victoria ;
del Rio, Sara ;
Manuel Benitez, Jose ;
Herrera, Francisco .
FUZZY SETS AND SYSTEMS, 2015, 258 :5-38
[16]  
López V, 2014, IEEE INT FUZZY SYST, P1905, DOI 10.1109/FUZZ-IEEE.2014.6891753
[17]   MapReduce-based fuzzy c-means clustering algorithm: implementation and scalability [J].
Ludwig, Simone A. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) :923-934
[18]   A MapReduce-based k-Nearest Neighbor Approach for Big Data Classification [J].
Maillo, Jesus ;
Triguero, Isaac ;
Herrera, Francisco .
2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2, 2015, :167-172
[19]   Grey Wolf Optimizer [J].
Mirjalili, Seyedali ;
Mirjalili, Seyed Mohammad ;
Lewis, Andrew .
ADVANCES IN ENGINEERING SOFTWARE, 2014, 69 :46-61
[20]   Enriched Over-Sampling Techniques for Improving Classification of Imbalanced Big Data [J].
Patil, Sachin Subhash ;
Sonavane, Shefali Pratap .
2017 THIRD IEEE INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2017), 2017, :1-10