Approximating Optimal Binary Decision Trees

被引:20
作者
Adler, Micah [2 ]
Heeringa, Brent [1 ]
机构
[1] Williams Coll, Dept Comp Sci, Williamstown, MA 01267 USA
[2] Fiksu Inc, Boston, MA 02110 USA
关键词
Approximation algorithms; Decision trees; Greedy algorithms; Accounting schemes;
D O I
10.1007/s00453-011-9510-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We give a (ln n+1)-approximation for the decision tree (DT) problem. An instance of DT is a set of m binary tests T=(T (1),aEuro broken vertical bar,T (m) ) and a set of n items X=(X (1),aEuro broken vertical bar,X (n) ). The goal is to output a binary tree where each internal node is a test, each leaf is an item and the total external path length of the tree is minimized. Total external path length is the sum of the depths of all the leaves in the tree. DT has a long history in computer science with applications ranging from medical diagnosis to experiment design. It also generalizes the problem of finding optimal average-case search strategies in partially ordered sets which includes several alphabetic tree problems. Our work decreases the previous best upper bound on the approximation ratio by a constant factor. We provide a new analysis of the greedy algorithm that uses a simple accounting scheme to spread the cost of a tree among pairs of items split at a particular node. We conclude by showing that our upper bound also holds for the DT problem with weighted tests.
引用
收藏
页码:1112 / 1121
页数:10
相关论文
共 17 条
[11]  
Hyafil L., 1976, Information Processing Letters, V5, P15, DOI 10.1016/0020-0190(76)90095-8
[12]  
Kosaraju SR, 1999, LECT NOTES COMPUT SC, V1663, P157
[13]   On the hardness of the minimum height decision tree problem [J].
Laber, ES ;
Nogueira, LT .
DISCRETE APPLIED MATHEMATICS, 2004, 144 (1-2) :209-212
[14]  
MORET BME, 1982, COMPUT SURV, V14, P593, DOI 10.1145/356893.356898
[15]  
Moshkov MJ, 2004, LECT NOTES COMPUT SC, V3100, P161
[16]  
Munagala K, 2005, LECT NOTES COMPUT SC, V3363, P83
[17]  
Murthy K.V.S., 1996, THESIS JH U