Fuzzy Hoeffding Decision Tree for Data Stream Classification

被引:21
作者
Ducange, Pietro [1 ]
Marcelloni, Francesco [1 ]
Pecori, Riccardo [2 ]
机构
[1] Univ Pisa, Dept Informat Engn, Largo L Lazzerino 1, I-56122 Pisa, Italy
[2] Univ Sannio, Dept Engn, Via Traiano 9, I-82100 Benevento, Italy
关键词
Streaming data classification; Fuzzy decision tree; Hoeffding decision tree; Model interpretability; EVOLVING FUZZY; IDENTIFICATION;
D O I
10.2991/ijcis.d.210212.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream mining has recently grown in popularity, thanks to an increasing number of applications which need continuous and fast analysis of streaming data. Such data are generally produced in application domains that require immediate reactions with strict temporal constraints. These particular characteristics make problematic the use of classical machine learning algorithms for mining knowledge from these fast data streams and call for appropriate techniques. In this paper, based on the well-known Hoeffding Decision Tree (HDT) for streaming data classification, we introduce FHDT, a fuzzy HDT that extends HDT with fuzziness, thus making HDT more robust to noisy and vague data. We tested FHDT on three synthetic datasets, usually adopted for analyzing concept drifts in data stream classification, and two real-world datasets, already exploited in some recent researches on fuzzy systems for streaming data. We show that FHDT outperforms HDT, especially in presence of concept drift. Furthermore, FHDT is characterized by a high level of interpretability, thanks to the linguistic rules that can be extracted from it. (C) 2021 The Authors. Published by Atlantis Press B.V.
引用
收藏
页码:946 / 964
页数:19
相关论文
共 41 条
[1]  
Amoretti M., 2020, IEEE T IND INFORM, P1
[2]   Evolving fuzzy systems from data streams in real-time [J].
Angelov, Plamen ;
Zhou, Xiaowei .
2006 INTERNATIONAL SYMPOSIUM ON EVOLVING FUZZY SYSTEMS, PROCEEDINGS, 2006, :29-+
[3]   An approach to Online identification of Takagi-Suigeno fuzzy models [J].
Angelov, PP ;
Filev, DP .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (01) :484-498
[4]  
Ayres A. O. C., 2019, EVOL SYST-GER, P1
[5]   An analysis of boosted ensembles of binary fuzzy decision trees [J].
Barsacchi, Marco ;
Bechini, Alessio ;
Marcelloni, Francesco .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 154
[6]  
Bhandari B., 2018, Int. J. Educ. Manag. Eng, V8, P40, DOI 10.5815/ijeme.2018.01.05
[7]  
Bifet A, 2013, INFORM-J COMPUT INFO, V37, P15
[8]  
Breiman L., 1984, Classification and Regression Trees, DOI DOI 10.1201/9781315139470
[9]   Accurate occupancy detection of an office room from light, temperature, humidity and CO2 measurements using statistical learning models [J].
Candanedo, Luis M. ;
Feldheim, Veronique .
ENERGY AND BUILDINGS, 2016, 112 :28-39
[10]  
Casalino G., 2018, INT WORKSH FUZZ LOG, P109, DOI DOI 10.1007/978-3-030-12544-8_9