On the Robustness of Decision Tree Learning Under Label Noise

被引：19

作者：

Ghosh, Aritra ^{[1
]}

Manwani, Naresh ^{[2
]}

Sastry, P. S. ^{[3
]}

机构：

[1] Microsoft, Bangalore, Karnataka, India

[2] Int Inst Informat Technol, Hyderabad, India

[3] Indian Inst Sci, Bangalore, Karnataka, India

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I | 2017年 / 10234卷

关键词：

Robust learning; Decision trees; Label noise; CLASSIFICATION;

D O I：

10.1007/978-3-319-57454-7_53

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most practical problems of classifier learning, the training data suffers from label noise. Most theoretical results on robustness to label noise involve either estimation of noise rates or non-convex optimization. Further, none of these results are applicable to standard decision tree learning algorithms. This paper presents some theoretical analysis to show that, under some assumptions, many popular decision tree learning algorithms are inherently robust to label noise. We also present some sample complexity results which provide some bounds on the sample size for the robustness to hold with a high probability. Through extensive simulations we illustrate this robustness.

引用

页码：685 / 697

页数：13

共 19 条

[1]

[Anonymous], 2013, C LEARN THEOR

[2] Random forests [J].