A fuzzy random forest

被引：129

作者：

Bonissone, Piero ^{[2
]}

Cadenas, Jose M. ^{[1
]}

Carmen Garrido, M. ^{[1
]}

Andres Diaz-Valladares, R. ^{[3
]}

机构：

[1] Univ Murcia, Dept Ingn Informac & Comun, E-30001 Murcia, Spain

[2] GE Global Res, Moscow 123098, Russia

[3] Univ Montemorelos, Dept Ciencias Computac, Mexico City, DF, Mexico

来源：

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING | 2010年 / 51卷 / 07期

关键词：

Random forest; Fuzzy decision tree; Combination methods; Fuzzy sets; DECISION TREE; CLASSIFICATION; CLASSIFIERS; ENSEMBLES;

D O I：

10.1016/j.ijar.2010.02.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When individual classifiers are combined appropriately, a statistically significant increase in classification accuracy is usually obtained. Multiple classifier systems are the result of combining several individual classifiers. Following Breiman's methodology, in this paper a multiple classifier system based on a "forest" of fuzzy decision trees, i.e., a fuzzy random forest, is proposed. This approach combines the robustness of multiple classifier systems, the power of the randomness to increase the diversity of the trees, and the flexibility of fuzzy logic and fuzzy sets for imperfect data management. Various combination methods to obtain the final decision of the multiple classifier system are proposed and compared. Some of them are weighted combination methods which make a weighting of the decisions of the different elements of the multiple classifier system (leaves or trees). A comparative study with several datasets is made to show the efficiency of the proposed multiple classifier system and the various combination methods. The proposed multiple classifier system exhibits a good accuracy classification, comparable to that of the best classifiers when tested with conventional data sets. However, unlike other classifiers, the proposed classifier provides a similar accuracy when tested with imperfect datasets (with missing and fuzzy values) and with datasets with noise. (C) 2010 Elsevier Inc. All rights reserved.

引用

页码：729 / 747

页数：19

共 32 条

[1] Classification by ensembles from random partitions of high-dimensional data [J].