ONLINE LEARNING WITH MARKOV SAMPLING

被引：124

作者：

Smale, Steve ^{[1
]}

Zhou, Ding-Xuan ^{[2
]}

机构：

[1] Toyota Technol Inst Chicago, Chicago, IL 60637 USA

[2] City Univ Hong Kong, Dept Math, Kowloon, Hong Kong, Peoples R China

来源：

ANALYSIS AND APPLICATIONS | 2009年 / 7卷 / 01期

基金：

美国国家科学基金会;

关键词：

Learning theory; online learning; Markov sampling; reproducing kernel Hilbert space; CLASSIFICATION; ERROR;

D O I：

10.1142/S0219530509001293

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

This paper attempts to give an extension of learning theory to a setting where the assumption of i.i.d. data is weakened by keeping the independence but abandoning the identical restriction. We hypothesize that a sequence of examples (xt, yt) in X x Y for t = 1, 2, 3,... is drawn from a probability distribution rho(t) on X x Y. The marginal probabilities on X are supposed to converge to a limit probability on X. Two main examples for this time process are discussed. The first is a stochastic one which in the special case of a finite space X is defined by a stochastic matrix and more generally by a stochastic kernel. The second is determined by an underlying discrete dynamical system on the space X. Our theoretical treatment requires that this dynamics be hyperbolic (or "Axiom A") which still permits a class of chaotic systems (with Sinai-Ruelle-Bowen attractors). Even in the case of a limit Dirac point probability, one needs the measure theory to be defined using Holder spaces. Many implications of our work remain unexplored. These include, for example, the relation to Hidden Markov Models, as well as Markov Chain Monte Carlo methods. It seems reasonable that further work should consider the push forward of the process from X x Y by some kind of observable function to a data space.

引用

页码：87 / 113

页数：27

共 32 条

[1]

[Anonymous], 1999, STUDIES ADV MATH

[2] THEORY OF REPRODUCING KERNELS [J].