Properly Learning Decision Trees in almost Polynomial Time

被引:4
作者
Blanc, Guy [1 ]
Lange, Jane [2 ]
Qiao, Mingda [1 ]
Tan, Li-Yang [1 ]
机构
[1] Gates Comp Sci Bldg,353 Serra Mall, Stanford, CA 94305 USA
[2] Stata Ctr, 32 Vassar St, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Decision trees; proper learning; LISTS;
D O I
10.1145/3561047
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We give an n(O(log log n))-time membership query algorithm for properly and agnostically learning decision trees under the uniform distribution over {+/- 1}(n). Even in the realizable setting, the previous fastest runtime was n(O(log n)) a consequence of a classic algorithm of Ehrenfeucht and Haussler. Our algorithm shares similarities with practical heuristics for learning decision trees, which we augment with additional ideas to circumvent known lower bounds against these heuristics. To analyze our algorithm, we prove a new structural result for decision trees that strengthens a theorem of O'Donnell, Saks, Schramm, and Servedio. While the OSSS theorem says that every decision tree has an influential variable, we show how every decision tree can be "pruned" so that every variable in the resulting tree is influential.
引用
收藏
页数:19
相关论文
共 30 条
[1]   Approximating Optimal Binary Decision Trees [J].
Adler, Micah ;
Heeringa, Brent .
ALGORITHMICA, 2012, 62 (3-4) :1112-1121
[2]  
[Anonymous], 2006, Theory of Computing, DOI DOI 10.4086/T0C.2006.V002A008
[3]  
Blanc Guy, 2020, P 11 INNOVATIONS THE, V151, P1
[4]  
Blanc Guy, 2020, P 34 C NEURAL INFORM
[5]   RANK-R DECISION TREES ARE A SUBCLASS OF R-DECISION LISTS [J].
BLUM, A .
INFORMATION PROCESSING LETTERS, 1992, 42 (04) :183-185
[6]  
Blum A., 1994, Proceedings of the Twenty-Sixth Annual ACM Symposium on the Theory of Computing, P253, DOI 10.1145/195058.195147
[7]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[8]  
BSHOUTY NH, 1993, AN S FDN CO, P302
[9]   Beyond the Low-Degree Algorithm: Mixtures of Subcubes and Their Applications [J].
Chen, Sitan ;
Moitra, Ankur .
PROCEEDINGS OF THE 51ST ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '19), 2019, :869-880
[10]   Sharp phase transition for the random-cluster and Potts models via decision trees [J].
Duminil-Copin, Hugo ;
Raoufi, Aran ;
Tassion, Vincent .
ANNALS OF MATHEMATICS, 2019, 189 (01) :75-99