Interacting reinforced stochastic processes: Statistical inference based on the weighted empirical means

被引：11

作者：

Aletti, Giacomo ^{[1
]}

Crimaldi, Irene ^{[2
]}

Ghiglietti, Andrea ^{[3
]}

机构：

[1] Univ Milan, ADAMSS Ctr, Milan, Italy

[2] IMT Sch Adv Studies, Lucca, Italy

[3] LivaNova, Milan, Italy

来源：

BERNOULLI | 2020年 / 26卷 / 02期

关键词：

asymptotic normality; complex networks; interacting random systems; reinforced learning; reinforced stochastic processes; synchronization; urn models; weighted empirical means; CENTRAL LIMIT-THEOREMS; SYNCHRONIZATION; ASYMPTOTICS;

D O I：

10.3150/19-BEJ1143

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

This work deals with a system of interacting reinforced stochastic processes, where each process X-j = (X-n,X-j )(n), is located at a vertex j of a finite weighted directed graph, and it can be interpreted as the sequence of "actions" adopted by an agent j of the network. The interaction among the dynamics of these processes depends on the weighted adjacency matrix W associated to the underlying graph: indeed, the probability that an agent j chooses a certain action depends on its personal "inclination" Z(n,j) and on the inclinations Z(n,h), with h not equal j, of the other agents according to the entries of W. The best known example of reinforced stochastic process is the Polya urn. The present paper focuses on the weighted empirical means N-n,N-j = Sigma(n)(k)(=1) q(n,k) X-k,X- j, since, for example, the current experience is more important than the past one in reinforced learning. Their almost sure synchronization and some central limit theorems in the sense of stable convergence are proven. The new approach with weighted means highlights the key points in proving some recent results for the personal inclinations Z(j) = (Z(n,j))(n) and for the empirical means (X) over bar (j) = (Sigma(n)(k)(=1) X-k,X- j/n)(n) given in recent papers (e.g. Aletti, Crimaldi and Ghiglietti (2019), Ann. Appl. Probab. 27 (2017) 3787-3844, Crimaldi et al. Stochastic Process. Appl. 129 (2019) 70-101). In fact, with a more sophisticated decomposition of the considered processes, we can understand how the different convergence rates of the involved stochastic processes combine. From an application point of view, we provide confidence intervals for the common limit inclination of the agents and a test statistics to make inference on the matrix W, based on the weighted empirical means. In particular, we answer a research question posed in Aletti, Crimaldi and Ghiglietti (2019).

引用

页码：1098 / 1138

页数：41

共 34 条

[1]

Aletti G., 2020, INTERACTING REINFO S, DOI [10.3150/19-BEJ1143SUPP, DOI 10.3150/19-BEJ1143SUPP]

[2] Networks of reinforced stochastic processes: Asymptotics for the empirical means [J].