Interacting reinforced stochastic processes: Statistical inference based on the weighted empirical means

被引:11
作者
Aletti, Giacomo [1 ]
Crimaldi, Irene [2 ]
Ghiglietti, Andrea [3 ]
机构
[1] Univ Milan, ADAMSS Ctr, Milan, Italy
[2] IMT Sch Adv Studies, Lucca, Italy
[3] LivaNova, Milan, Italy
关键词
asymptotic normality; complex networks; interacting random systems; reinforced learning; reinforced stochastic processes; synchronization; urn models; weighted empirical means; CENTRAL LIMIT-THEOREMS; SYNCHRONIZATION; ASYMPTOTICS;
D O I
10.3150/19-BEJ1143
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This work deals with a system of interacting reinforced stochastic processes, where each process X-j = (X-n,X-j )(n), is located at a vertex j of a finite weighted directed graph, and it can be interpreted as the sequence of "actions" adopted by an agent j of the network. The interaction among the dynamics of these processes depends on the weighted adjacency matrix W associated to the underlying graph: indeed, the probability that an agent j chooses a certain action depends on its personal "inclination" Z(n,j) and on the inclinations Z(n,h), with h not equal j, of the other agents according to the entries of W. The best known example of reinforced stochastic process is the Polya urn. The present paper focuses on the weighted empirical means N-n,N-j = Sigma(n)(k)(=1) q(n,k) X-k,X- j, since, for example, the current experience is more important than the past one in reinforced learning. Their almost sure synchronization and some central limit theorems in the sense of stable convergence are proven. The new approach with weighted means highlights the key points in proving some recent results for the personal inclinations Z(j) = (Z(n,j))(n) and for the empirical means (X) over bar (j) = (Sigma(n)(k)(=1) X-k,X- j/n)(n) given in recent papers (e.g. Aletti, Crimaldi and Ghiglietti (2019), Ann. Appl. Probab. 27 (2017) 3787-3844, Crimaldi et al. Stochastic Process. Appl. 129 (2019) 70-101). In fact, with a more sophisticated decomposition of the considered processes, we can understand how the different convergence rates of the involved stochastic processes combine. From an application point of view, we provide confidence intervals for the common limit inclination of the agents and a test statistics to make inference on the matrix W, based on the weighted empirical means. In particular, we answer a research question posed in Aletti, Crimaldi and Ghiglietti (2019).
引用
收藏
页码:1098 / 1138
页数:41
相关论文
共 34 条
[1]  
Aletti G., 2020, INTERACTING REINFO S, DOI [10.3150/19-BEJ1143SUPP, DOI 10.3150/19-BEJ1143SUPP]
[2]   Networks of reinforced stochastic processes: Asymptotics for the empirical means [J].
Aletti, Giacomo ;
Crimaldi, Irene ;
Ghiglietti, Andrea .
BERNOULLI, 2019, 25 (4B) :3339-3378
[3]   NONPARAMETRIC COVARIATE-ADJUSTED RESPONSE-ADAPTIVE DESIGN BASED ON A FUNCTIONAL URN MODEL [J].
Aletti, Giacomo ;
Ghiglietti, Andrea ;
Rosenberger, William F. .
ANNALS OF STATISTICS, 2018, 46 (6B) :3838-3866
[4]   Dynamics of an adaptive randomly reinforced urn [J].
Aletti, Giacomo ;
Ghiglietti, Andrea ;
Vidyashankar, Anand N. .
BERNOULLI, 2018, 24 (03) :2204-2255
[5]   SYNCHRONIZATION OF REINFORCED STOCHASTIC PROCESSES WITH A NETWORK-BASED INTERACTION [J].
Aletti, Giacomo ;
Crimaldi, Irene ;
Ghiglietti, Andrea .
ANNALS OF APPLIED PROBABILITY, 2017, 27 (06) :3787-3844
[6]   Interacting generalized Friedman's urn systems [J].
Aletti, Giacomo ;
Ghiglietti, Andrea .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2017, 127 (08) :2650-2678
[7]   A generalized Polya's urn with graph based interactions [J].
Benaim, Michel ;
Benjamini, Itai ;
Chen, Jun ;
Lima, Yuri .
RANDOM STRUCTURES & ALGORITHMS, 2015, 46 (04) :614-634
[8]   ASYMPTOTICS FOR RANDOMLY REINFORCED URNS WITH RANDOM BARRIERS [J].
Berti, Patrizia ;
Crimaldi, Irene ;
Pratelli, Luca ;
Rigo, Pietro .
JOURNAL OF APPLIED PROBABILITY, 2016, 53 (04) :1206-1220
[9]  
Berti P, 2011, J APPL PROBAB, V48, P527
[10]   A generalized Polya's urn with graph based interactions: convergence at linearity [J].
Chen, Jun ;
Lucas, Cyrille .
ELECTRONIC COMMUNICATIONS IN PROBABILITY, 2014, 19 :1-13