Data Collection Versus Data Estimation: A Fundamental Trade-Off in Dynamic Networks

被引：5

作者：

Arabneydi, Jalal ^{[1
]}

Aghdam, Amir G. ^{[1
]}

机构：

[1] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada

来源：

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2020年 / 7卷 / 03期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Estimation; Planning; Data collection; Data models; Markov processes; Computational complexity; Networked systems; Partially observable Markov decision process; reinforcement learning; separation principle;

D O I：

10.1109/TNSE.2020.2966504

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

An important question that often arises in the operation of networked systems is whether to collect the real-time data or to estimate them based on the previously collected data. Various factors should be taken into account such as how informative the data are at each time instant for state estimation, how costly and credible the collected data are, and how rapidly the data vary with time. The above question can be formulated as a dynamic decision making problem with imperfect information structure, where a decision maker wishes to find an efficient way to switch between data collection and data estimation while the quality of the estimation depends on the previously collected data (i.e., duality effect). In this paper, the evolution of the state of each node is modeled as an exchangeable Markov process for discrete features and equivariant linear system for continuous features, where the data of interest are defined in the former case as the empirical distribution of the states, and in the latter case as the weighted average of the states. When the data are collected, they may or may not be credible, according to a Bernoulli distribution. Based on a novel planning space, a Bellman equation is proposed to identify a near-optimal strategy whose computational complexity is logarithmic with respect to the inverse of the desired maximum distance from the optimal solution, and polynomial with respect to the number of nodes. A reinforcement learning algorithm is developed for the case when the model is not known exactly, and its convergence to the near-optimal solution is shown subsequently. In addition, a certainty threshold is introduced that determines when data estimation is more desirable than data collection, as the number of nodes increases. For the special case of linear dynamics, a separation principle is constructed wherein the optimal estimate is computed by a Kalman-like filter, irrespective of the probability distribution of random variables. It is shown that the complexity of finding the proposed sampling strategy, in this special case, is independent of the size of the state space and the number of nodes. Examples of a sensor network, a communication network and a social network are provided.

引用

页码：2000 / 2015

页数：16

共 42 条

[1]

Alberts D. S., 1997, Tech. Rep.

[2]

[Anonymous], GAMES EC BEHAV

[3]

[Anonymous], 2012, DYNAMIC PROGRAMMING

[4]

Arabneydi Jalal, 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC), P3125, DOI 10.1109/CDC.2017.8264116

[5]

Arabneydi J., 2016, THESIS

[6]

Arabneydi J., 2020, DEEP STRUCTURED TEAM

[7] Deep Teams: Decentralized Decision Making With Finite and Infinite Number of Agents [J].

Arabneydi, Jalal ;

Aghdam, Amir G. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (10) :4230-4245

[8] A Mean-Field Team Approach to Minimize the Spread of Infection in a Network [J].

Arabneydi, Jalal ;

Aghdam, Amir G. .

2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, :2747-2752

[9]

Arabneydi J, 2018, MIDWEST SYMP CIRCUIT, P809, DOI 10.1109/MWSCAS.2018.8624103

[10]

Arabneydi J, 2018, P AMER CONTR CONF, P5368, DOI 10.23919/ACC.2018.8431905

← 1 2 3 4 5 →