A Distributed Policy Gradient Algorithm for Optimal Coordination of Mobile Sensor Networks

被引:2
作者
Wang, Jing [1 ]
Khanh Pham [2 ]
机构
[1] Bradley Univ, Dept Elect & Comp Engn, Peoria, IL 61625 USA
[2] Air Force Res Lab, Space Vehicles Directorate, Kirtland AFB, NM 87117 USA
来源
2022 IEEE SENSORS | 2022年
关键词
COOPERATIVE CONTROL; SYSTEMS; AGENTS;
D O I
10.1109/SENSORS52175.2022.9967129
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we study the optimal deployment problem in mobile sensor networks. By recasting it as an optimal coordination problem of multiagents, a new distributed policy gradient algorithm is proposed based on the minimization of the overall cost for all agents. The proposed algorithm relies on local information exchanges among neighboring agents without the requirement of known system dynamics. The policy gradient is computed based on sampling the trajectory under the perturbed control policy. The control policy is parameterized and the adaptive parameter update is carried out following the negative gradient of the overall cost. The rigorous analysis of the proposed algorithm is provided.
引用
收藏
页数:4
相关论文
共 26 条
[1]  
Bertsekas D.P., 1996, NEURO DYNAMIC PROGRA
[2]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[3]   Adaptive consensus output regulation of a class of nonlinear systems with unknown high-frequency gain [J].
Ding, Zhengtao .
AUTOMATICA, 2015, 51 :348-355
[4]   Information flow and cooperative control of vehicle formations [J].
Fax, JA ;
Murray, RM .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (09) :1465-1476
[5]   Coordination of groups of mobile autonomous agents using nearest neighbor rules [J].
Jadbabaie, A ;
Lin, J ;
Morse, AS .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) :988-1001
[6]   Cooperative and Active Sensing in Mobile Sensor Networks for Scalar Field Mapping [J].
La, Hung M. ;
Sheng, Weihua ;
Chen, Jiming .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 45 (01) :1-12
[7]   Reinforcement Learning and Feedback Control USING NATURAL DECISION METHODS TO DESIGN OPTIMAL ADAPTIVE CONTROLLERS [J].
Lewis, Frank L. ;
Vrabie, Draguna ;
Vamvoudakis, Kyriakos G. .
IEEE CONTROL SYSTEMS MAGAZINE, 2012, 32 (06) :76-105
[8]   Local control strategies for groups of mobile autonomous agents [J].
Lin, ZY ;
Broucke, M ;
Francis, B .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (04) :622-629
[9]   Distributed Subgradient Methods for Multi-Agent Optimization [J].
Nedic, Angelia ;
Ozdaglar, Asurrian .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (01) :48-61
[10]   Consensus problems in networks of agents with switching topology and time-delays [J].
Olfati-Saber, R ;
Murray, RM .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (09) :1520-1533