Soft Computing Based Evolutionary Multi-Label Classification

被引:0
作者
Aslam, Rubina [1 ]
Tamimy, Manzoor Illahi [1 ]
Aslam, Waqar [2 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Islamabad 4550, Pakistan
[2] Islamia Univ Bahawalpur, Dept Comp Sci & IT, Bahawalpur 63100, Pakistan
关键词
Multi-label classification; genetic algorithm; ensemble; noisy datasets; Credal C4.5; DECISION TREES; ENSEMBLES; CHALLENGES; KNN;
D O I
10.32604/iasc.2020.013086
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine Learning (ML) has revolutionized intelligent systems that range from self-driving automobiles, search engines, business/market analysis, fraud detection, network intrusion investigation, and medical diagnosis. Classification lies at the core of Machine Learning and Multi-label Classification (MLC) is the closest to real-life problems related to heuristics. It is a type of classification problem where multiple labels or classes can be assigned to more than one instance simultaneously. The level of complexity in MLC is increased by factors such as data imbalance, high dimensionality, label correlations, and noise. Conventional MLC techniques such as ensembles- based approaches, Multi-label Stacking, Random k-label sets, and Hierarchy of Multi-label Classifiers struggle to handle these issues and suffer from the increased complexity introduced by these factors. The application of Soft Computing (SC) techniques in intelligent systems has provided a new paradigm for complex real-life problems. These techniques are more tolerant of the inherent imprecision and ambiguity in human thinking. Based on SC techniques such as evolutionary computing and genetic algorithms, intelligent classification systems can be developed that can recognize complex patterns even in noisy datasets otherwise invisible to conventional systems. This study uses an evolutionary approach to handle the MLC noise issue by proposing the Evolutionary Ensemble of Credal C4.5 (EECC). It uses the Credal C4.5 classifier which is based on imprecise probability theory for handling noisy datasets. It can perform effectively in diverse areas of multi-label classification. Experiments on different datasets show that EECC outperforms other techniques in the presence of noise and is noise-robust. Statistical tests show the significance of EECC as compared to other techniques.
引用
收藏
页码:1233 / 1249
页数:17
相关论文
共 47 条
[1]  
Alabdullatif A., 2016, MOBILE INFORM SYSTEM, V2016, P1
[2]  
Alazaidah R, 2016, INT J ADV COMPUT SC, V7, P127
[3]  
[Anonymous], 2010, P ACM SIGKDD
[4]  
[Anonymous], 2001, EUR C PRINC DAT MIN
[5]  
[Anonymous], 2008, P ECML PKDD 2008 WOR
[6]  
[Anonymous], 2013, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
[7]   Improving kNN multi-label classification in Prototype Selection scenarios using class proposals [J].
Calvo-Zaragoza, Jorge ;
Valero-Mas, Jose J. ;
Rico-Juan, Juan R. .
PATTERN RECOGNITION, 2015, 48 (05) :1608-1622
[8]  
Cappozzo A., 2019, ADV DATA ANAL CLASSI, V14, P1
[9]   Addressing imbalance in multilabel classification: Measures and random resampling algorithms [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
NEUROCOMPUTING, 2015, 163 :3-16
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)