Boosted incremental tree-based imputation of missing data

被引:0
作者
Siciliano, Roberta [1 ]
Aria, Massimo [1 ]
D'Ambrosio, Antonio [1 ]
机构
[1] Univ Naples Federico II, Dipartimento Matemat & Stat, Naples, Italy
来源
DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH | 2006年
关键词
D O I
10.1007/3-540-35978-8_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tree-based procedures have been recently considered as non parametric tools for missing data imputation when dealing with large data structures and no probability assumption. A previous work used an incremental algorithm based on cross-validated decision trees and a lexicographic ordering of the single data to be imputed. This paper considers ail ensemble method where tree-based model is used as learner. Furthermore, the incremental imputation concerns missing data of each variable at turn. As a result, the proposed method allows more accurate imputations through a more efficient algorithm. A simulation case study shows the overall good performance of the proposed method against some competitors. A MatLab implementation enriches Tree Harvest Software for non-standard classification and regression trees.
引用
收藏
页码:271 / +
页数:2
相关论文
共 17 条