Anomaly Detection Under Controlled Sensing Using Actor-Critic Reinforcement Learning

被引:9
作者
Joseph, Geethu [1 ]
Gursoy, M. Cenk [1 ]
Varshney, Pramod K. [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA
来源
PROCEEDINGS OF THE 21ST IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (IEEE SPAWC2020) | 2020年
基金
美国国家科学基金会;
关键词
Active hypothesis testing; reinforcement learning; optimal sequential selection; quickest state estimation;
D O I
10.1109/spawc48557.2020.9154275
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider the problem of detecting anomalies among a given set of processes using their noisy binary sensor measurements. The noiseless sensor measurement corresponding to a normal process is 0, and the measurement is 1 if the process is anomalous. The decision-making algorithm is assumed to have no knowledge of the number of anomalous processes. The algorithm is allowed to choose a subset of the sensors at each time instant until the confidence level on the decision exceeds the desired value. Our objective is to design a sequential sensor selection policy that dynamically determines which processes to observe at each time and when to terminate the detection algorithm. The selection policy is designed such that the anomalous processes are detected with the desired confidence level while incurring minimum cost which comprises the delay in detection and the cost of sensing. We cast this problem as a sequential hypothesis testing problem within the framework of Markov decision processes, and solve it using the actor-critic deep reinforcement learning algorithm. This deep neural network-based algorithm offers a low complexity solution with good detection accuracy. We also study the effect of statistical dependence between the processes on the algorithm performance. Through numerical experiments, we show that our algorithm is able to adapt to any unknown statistical dependence pattern of the processes.
引用
收藏
页数:5
相关论文
共 11 条
  • [1] Bessler S. A., 1960, Tech. Rep.
  • [2] Enhanced Remote Control Providing Medical Functionalities
    Bujnowski, Adam
    Ruminski, Jacek
    Palinski, Arkadiusz
    Wtrorek, Jerzy
    [J]. PROCEEDINGS OF THE 2013 7TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE AND WORKSHOPS (PERVASIVEHEALTH 2013), 2013, : 290 - 293
  • [3] SEQUENTIAL DESIGN OF EXPERIMENTS
    CHERNOFF, H
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1959, 30 (03): : 755 - 770
  • [4] Remote monitoring system with wireless sensors module for room environment
    Chung, WY
    Oh, SJ
    [J]. SENSORS AND ACTUATORS B-CHEMICAL, 2006, 113 (01) : 64 - 70
  • [5] Active Anomaly Detection in Heterogeneous Processes
    Huang, Boshuang
    Cohen, Kobi
    Zhao, Qing
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2019, 65 (04) : 2284 - 2301
  • [6] Kartik D, 2018, ANN ALLERTON CONF, P741, DOI 10.1109/ALLERTON.2018.8636086
  • [7] Krishnamurthy V., 2016, Partially Observed Markov Decision Processes, DOI DOI 10.1017/CBO9781316471104
  • [8] ACTIVE SEQUENTIAL HYPOTHESIS TESTING
    Naghshvar, Mohammad
    Javidi, Tara
    [J]. ANNALS OF STATISTICS, 2013, 41 (06) : 2703 - 2738
  • [9] Controlled Sensing for Multihypothesis Testing
    Nitinawarat, Sirin
    Atia, George K.
    Veeravalli, Venugopal V.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (10) : 2451 - 2464
  • [10] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1