Tackling the problem of classification with noisy data using Multiple Classifier Systems: Analysis of the performance and robustness

被引:70
|
作者
Saez, Jose A. [1 ]
Galar, Mikel [2 ]
Luengo, Julian [3 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, Dept Comp Sci & Artificial Intelligence, CITIC UGR, E-18071 Granada, Spain
[2] Univ Publ Navarra, Dept Automat & Computac, Pamplona 31006, Spain
[3] Univ Burgos, LSI, Dept Civil Engn, Burgos 09006, Spain
关键词
Noisy data; Class noise; Attribute noise; Multiple Classifier System; Classification; RECOGNITION; COMBINATION;
D O I
10.1016/j.ins.2013.06.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional classifier learning algorithms build a unique classifier from the training data. Noisy data may deteriorate the performance of this classifier depending on the degree of sensitiveness to data corruptions of the learning method. In the literature, it is widely claimed that building several classifiers from noisy training data and combining their predictions is an interesting method of overcoming the individual problems produced by noise in each classifier. This statement is usually not supported by thorough empirical studies considering problems with different types and levels of noise. Furthermore, in noisy environments, the noise robustness of the methods can be more important than the performance results themselves and, therefore, it must be carefully studied. This paper aims to reach conclusions on such aspects focusing on the analysis of the behavior, in terms of performance and robustness, of several Multiple Classifier Systems against their individual classifiers when these are trained with noisy data. In order to accomplish this study, several classification algorithms, of varying noise robustness, will be chosen and compared with respect to their combination on a large collection of noisy datasets. The results obtained show that the success of the Multiple Classifier Systems trained with noisy data depends on the individual classifiers chosen, the decisions combination method and the type and level of noise present in the dataset, but also on the way of creating diversity to build the final system. In most of the cases, they are able to outperform all their single classification algorithms in terms of global performance, even though their robustness results will depend on the way of introducing diversity into the Multiple Classifier System. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [1] Evaluating the classifier behavior with noisy data considering performance and robustness: The Equalized Loss of Accuracy measure
    Saez, Jose A.
    Luengo, Julian
    Herrera, Francisco
    NEUROCOMPUTING, 2016, 176 : 26 - 35
  • [2] Defect classification of highly noisy NDE data using classifier ensembles
    Goebel, Kai F.
    Yan, Weizhong
    Eklund, Neil H. W.
    Hu, Xiao
    Avasarala, Vishwanath
    Celaya, Jose
    SMART STRUCTURES AND MATERIALS 2006: SMART SENSOR MONITORING SYSTEMS AND APPLICATIONS, 2006, 6167
  • [3] Multiple Classifier Systems for Hyperspectral Remote Sensing Data Classification
    Iman Khosravi
    Majid Mohammad-Beigi
    Journal of the Indian Society of Remote Sensing, 2014, 42 : 423 - 428
  • [4] Multiple Classifier Systems for Hyperspectral Remote Sensing Data Classification
    Khosravi, Iman
    Mohammad-Beigi, Majid
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2014, 42 (02) : 423 - 428
  • [5] Enhancing classification in correlative microscopy using multiple classifier systems with dynamic selection
    Bitrus, Samuel
    Fitzek, Harald
    Rigger, Eugen
    Rattenberger, Johannes
    Entner, Doris
    ULTRAMICROSCOPY, 2022, 240
  • [6] Effective classification of noisy data streams with attribute-oriented dynamic classifier selection
    Xingquan Zhu
    Xindong Wu
    Ying Yang
    Knowledge and Information Systems, 2006, 9 : 339 - 363
  • [7] Effective classification of noisy data streams with attribute-oriented dynamic classifier selection
    Zhu, XQ
    Wu, XD
    Yang, Y
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (03) : 339 - 363
  • [8] A Data Mining Tool for Water Uses Classification Based on Multiple Classifier Systems
    Dario Lopez, Ivan
    Heidelberg Valencia, Cristian
    Carlos Corrales, Juan
    MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017, 2018, 10710 : 362 - 375
  • [9] A New Approach of Boosting using Decision Tree Classifier for Classifying Noisy Data
    Farid, Dewan Md.
    Maruf, Golam Morshed
    Rahman, Chowdhury Mofizur
    2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [10] The effectiveness of using diversity to select multiple classifier systems with varying classification thresholds
    Butler, Harris K.
    Friend, Mark A.
    Bauer, Kenneth W., Jr.
    Bihl, Trevor J.
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 12 (03) : 187 - 199