On-line maximum likelihood prediction with respect to general loss functions

被引：3

作者：

Yamanishi, K ^{[1
]}

机构：

[1] NEC RES INST,PRINCETON,NJ 08540

来源：

JOURNAL OF COMPUTER AND SYSTEM SCIENCES | 1997年 / 55卷 / 01期

关键词：

D O I：

10.1006/jcss.1997.1503

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces a new family of deterministic and stochastic on-line prediction algorithms which work with respect to general loss functions and analyzes their behavior in terms of expected loss bounds. The algorithms use parametric probabilistic models regardless of the kind of loss function used. The key ideas of the algorithms are to iteratively estimate the probabilistic model using the maximum likelihood method and then to construct an optimal prediction function that minimizes the average of the loss taken with respect to the estimated probabilistic model. A future outcome is predicted using this optimal prediction function. We analyze the algorithms in the cases where the target distribution is (1) k-dimensional parametric and k is known, (2) k-dimensional parametric but k is unknown, and (3) nonparametric. For all the cases, we derive upper bounds on the expected instantaneous or cumulative losses for the algorithms with respect to a large family of loss functions satisfying the constraint introduced by Merhav and Feder. These loss bounds show new universal relations among the expected prediction accuracy, the indexes of the loss function, the complexity of the target rule, and the number of training examples. (C) 1997 Academic Press.

引用

页码：105 / 118

页数：14

共 37 条

[1] THE STRONG LAW OF LARGE NUMBERS FOR SEQUENTIAL DECISIONS UNDER UNCERTAINTY
ALGOET, PH
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1994, 40 (03) : 609 - 633
[2] STATISTICAL-THEORY OF LEARNING-CURVES UNDER ENTROPIC LOSS CRITERION
AMARI, S
MURATA, N
[J]. NEURAL COMPUTATION, 1993, 5 (01) : 140 - 153
[3] BARRON A., 1992, P 3 NEC S COMP COGN, P74
[4] MINIMUM COMPLEXITY DENSITY-ESTIMATION
BARRON, AR
COVER, TM
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (04) : 1034 - 1054
[5] BERGER JO, 1980, STATISTICAL DECISION
[6] Cencov N. N., 1962, SOV MATH, V3, P1559
[7] INFORMATION-THEORETIC ASYMPTOTICS OF BAYES METHODS
CLARKE, BS
BARRON, AR
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1990, 36 (03) : 453 - 471
[8] Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[9] STATISTICAL-THEORY - THE PREQUENTIAL APPROACH
DAWID, AP
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1984, 147 : 278 - 292
[10] Haussler D., 1995, Computational Learning Theory. Second European Conference, EuroCOLT '95. Proceedings, P69

← 1 2 3 4 →