FCNB: Fuzzy Correlative Naive Bayes Classifier with MapReduce Framework for Big Data Classification

被引:10
作者
Banchhor, Chitrakant [1 ]
Srinivasu, N. [1 ]
机构
[1] Koneru Lakshmaiah Educ Fdn, Comp Sci & Engn Dept, Guntur, Andhra Pradesh, India
关键词
Big data; classification; correlative naive Bayes classifier; fuzzy theory; MapReduce; MAP REDUCE SOLUTION; ALGORITHM; MACHINE;
D O I
10.1515/jisys-2018-0020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The term "big data" means a large amount of data, and big data management refers to the efficient handling, organization, or use of large volumes of structured and unstructured data belonging to an organization. Due to the gradual availability of plenty of raw data, the knowledge extraction process from big data is a very difficult task for most of the classical data mining and machine learning tools. In a previous paper, the correlative naive Bayes (CNB) classifier was developed for big data classification. This work incorporates the fuzzy theory along with the CNB classifier to develop the fuzzy CNB (FCNB) classifier. The proposed FCNB classifier solves the big data classification problem by using the MapReduce framework and thus achieves improved classification results. Initially, the database is converted to the probabilistic index table, in which data and attributes are presented in rows and columns, respectively. Then, the membership degree of the unique symbols present in each attribute of data is found. Finally, the proposed FCNB classifier finds the class of data based on training information. The simulation of the proposed FCNB classifier uses the localization and skin segmentation datasets for the purpose of experimentation. The results of the proposed FCNB classifier are analyzed based on the metrics, such as sensitivity, specificity, and accuracy, and compared with the various existing works.
引用
收藏
页码:994 / 1006
页数:13
相关论文
共 27 条
[1]  
[Anonymous], 2017, IEEE T NEURAL NETW L
[2]  
[Anonymous], 2001, Encyclopedia of Mathematics
[3]   MR-DIS: democratic instance selection for big data by MapReduce [J].
Arnaiz-González Á. ;
González-Rogel A. ;
Díez-Pastor J.-F. ;
López-Nozal C. .
Progress in Artificial Intelligence, 2017, 6 (3) :211-219
[4]   A Map Reduce solution for associative classification of big data [J].
Bechini, Alessio ;
Marcelloni, Francesco ;
Segatori, Armando .
INFORMATION SCIENCES, 2016, 332 :33-55
[5]  
Bhukya R, 2015, PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), P357, DOI 10.1109/ICATCCT.2015.7456909
[6]   MR-ELM: a MapReduce-based framework for large-scale ELM training in big data era [J].
Chen, Jiaoyan ;
Chen, Huajun ;
Wan, Xiangyi ;
Zheng, Guozhou .
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (01) :101-110
[7]  
El Bakry M, 2016, PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), P118, DOI 10.1109/SAI.2016.7555971
[8]   CHI-BD: A fuzzy rule-based classification system for Big Data classification problems [J].
Elkano, Mikel ;
Galar, Mikel ;
Sanz, Jose ;
Bustince, Humberto .
FUZZY SETS AND SYSTEMS, 2018, 348 :75-101
[9]   Fuzzy rule based classification systems for big data with MapReduce: granularity analysis [J].
Fernandez, Alberto ;
del Rio, Sara ;
Bawakid, Abdullah ;
Herrera, Francisco .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2017, 11 (04) :711-730
[10]   Evolving Big Data Stream Classification with MapReduce [J].
Haque, Ahsanul ;
Parker, Brandon ;
Khan, Latifur ;
Thuraisingham, Bhavani .
2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, :570-577