PALO: A probabilistic hill-climbing algorithm

被引:28
作者
Greiner, R
机构
[1] Siemens Corporate Research, Princeton, NJ 08540-6632
关键词
computational learning theory; hill-climbing; speed-up learning; utility problem; knowledge compilation; theory revision; prioritized default theories;
D O I
10.1016/0004-3702(95)00040-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many learning systems search through a space of possible performance elements, seeking an element whose expected utility, over the distribution of problems, is high. As the task of finding the globally optimal element is often intractable, many practical learning systems instead hill-climb to a local optimum. Unfortunately, even this is problematic as the learner typically does not know the underlying distribution of problems, which it needs to determine an element's expected utility. This paper addresses the task of approximating this hill-climbing search when the utility function can only be estimated by sampling. We present a general algorithm, PALO, that returns an element that is, with provably high probability, essentially a local optimum. We then demonstrate the generality of this algorithm by presenting three distinct applications that respectively find an element whose efficiency, accuracy or completeness is nearly optimal. These results suggest approaches to solving the utility problem from explanation-based learning, the multiple extension problem from nonmonotonic reasoning and the tractability/completeness tradeoff problem from knowledge representation.
引用
收藏
页码:177 / 208
页数:32
相关论文
共 80 条
[1]  
[Anonymous], 1982, ESTIMATION DEPENDENC
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[3]  
ARORA S, 1992, AN S FDN CO, P14
[4]  
AUER P, 1995, P 12 INT C MACH LEAR
[5]  
Berry D. A., 1985, BANDIT PROBLEMS SEQU
[6]  
BODDY M, 1988, SOLVING TIME DEPENDE
[7]  
Bollobas B, 1985, RANDOM GRAPHS
[8]   CLASSIFIER SYSTEMS AND GENETIC ALGORITHMS [J].
BOOKER, LB ;
GOLDBERG, DE ;
HOLLAND, JH .
ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) :235-282
[9]  
BORGIDA A, 1989, S REPR REAS, P33
[10]  
Boros E., 1990, Annals of Mathematics and Artificial Intelligence, V1, P21