The distributed computation of Nash equilibria is assuming growing relevance in engineering where such problems emerge in the context of distributed control. Accordingly, we present schemes for computing equilibria of two classes of static stochastic convex games complicated by a parametric misspecification, a natural concern in the control of large-scale networked engineered system. In both schemes, players learn the equilibrium strategy while resolving the misspecification: 1) Monotone stochastic Nash games: We present a set of coupled stochastic approximation schemes distributed across agents in which the first scheme updates each agent's strategy via a projected (stochastic) gradient step, whereas the second scheme updates every agent's belief regarding its misspecified parameter using an independently specified learning problem. We proceed to show that the produced sequences converge in an almost sure sense to the true equilibrium strategy and the true parameter, respectively. Surprisingly, convergence in the equilibrium strategy achieves the optimal rate of convergence in a mean-squared sense with a quantifiable degradation in the rate constant; 2) Stochastic Nash-Cournot games with unobservable aggregate output: We refine 1) to a Cournot setting where we assume that the tuple of strategies is unobservable while payoff functions and strategy sets are public knowledge through a common knowledge assumption. By utilizing observations of noise-corrupted prices, iterative fixed-point schemes are developed, allowing for simultaneously learning the equilibrium strategies and the misspecified parameter in an almost sure sense.
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R China
Huang, Shijie
Lei, Jinlong
论文数: 0引用数: 0
h-index: 0
机构:
Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201210, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R China
Lei, Jinlong
Hong, Yiguang
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R China
Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201210, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Key Lab Syst & Control, Beijing 100190, Peoples R China
机构:
KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, SwedenKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
Chen, Guanpu
Xu, Gehui
论文数: 0引用数: 0
h-index: 0
机构:
Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, EnglandKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
Xu, Gehui
He, Fengxiang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Edinburgh, Artificial Intelligence & Applicat Inst, Sch Informat, Edinburgh EH8 9AB, Midlothian, ScotlandKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
He, Fengxiang
Hong, Yiguang
论文数: 0引用数: 0
h-index: 0
机构:
Tongji Univ, Res Inst Intelligent Autonomous Syst, Dept Control Sci & Engn, Shanghai 210201, Peoples R ChinaKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
Hong, Yiguang
Rutkowski, Leszek
论文数: 0引用数: 0
h-index: 0
机构:
AGH Univ Sci & Technol, PL-30059 Krakow, Poland
Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, PolandKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden
Rutkowski, Leszek
Tao, Dacheng
论文数: 0引用数: 0
h-index: 0
机构:
Nanyang Technol Univ, Coll Comp & Data Sci, Singapore 639798, SingaporeKTH Royal Inst Technol, Sch Elect Engn & Comp Sci, S-10044 Stockholm, Sweden