Offense and defence against adversarial sample: A reinforcement learning method in energy trading market

被引:2
作者
Li, Donghe [1 ]
Yang, Qingyu [1 ,2 ]
Ma, Linyue [3 ]
Peng, Zhenhua [1 ]
Liao, Xiao [3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian, Peoples R China
[3] State Grid Informat & Telecommun Grp Co LTD, Beijing, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
double auction; markov decision process; reinforcement learning; adversarial example; fast gradient sign method; adversarial example detection; DEMAND RESPONSE; SMART; MANAGEMENT; MODEL; PRIVACY;
D O I
10.3389/fenrg.2022.1071973
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The energy trading market that can support free bidding among electricity users is currently the key method in smart grid demand response. Reinforcement learning is used to formulate optimal strategies for them to obtain optimal strategies. Non-etheless, the security problem raised by artificial intelligence technology has been paid more and more attention. For example, the neural network has been proved to be able to resist adversarial example attacks, thus affecting its training results. Considering that reinforcement learning is also widely used for training by neural networks, the security problem can not be ignored, especially in scenarios with high security requirements such as smart grids. To this end, we study the security issues in reinforcement learning-based bidding strategy method facing by the adversarial example. First of all, regarding to the electric vehicle double auction market, we formalize the bidding decision problem of EVs into a Markov Decision Process, so that reinforcement learning is used to solve this problem. Secondly, from the perspective of attackers, we have designed a local Fast Gradient Sign Method which affects the environment and the results of reinforcement learning by changing its own bidding. Then, from the perspective of the defender, we designed a reinforcement learning training network containing an attack identifier based on the deep neural network, so as to identify malicious injection attacks to resist against adversarial attacks. Finally, comprehensive simulations are conducted to verify our proposed method. The results shows that, our proposed attack method will reduce the auction profit by influencing reinforcement learning algorithm, and the protect method will be able to completely resist such attacks.
引用
收藏
页数:14
相关论文
共 43 条
  • [1] Albadi AH, 2007, IEEE POWER ENG SOC, P1665
  • [2] Data Integrity Attack in Dynamic State Estimation of Smart Grid: Attack Model and Countermeasures
    An, Dou
    Zhang, Feiye
    Yang, Qingyu
    Zhang, Chengwei
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) : 1631 - 1644
  • [3] Orthogonal Uncertainty Modeling in the Engineering of Cyber-Physical Systems
    Bandyszak, Torsten
    Daun, Marian
    Tenbergen, Bastian
    Kuhs, Patrick
    Wolf, Stefanie
    Weyer, Thorsten
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (03) : 1250 - 1265
  • [4] Barto A.G., 1989, LEARNING SEQUENTIAL
  • [5] An Improved Model for Parallel Machine Scheduling Under Time-of-Use Electricity Price
    Cheng, Junheng
    Chu, Feng
    Zhou, Mengchu
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (02) : 896 - 899
  • [6] Overgrid: A Fully Distributed Demand Response Architecture Based on Overlay Networks
    Croce, Daniele
    Giuliano, Fabrizio
    Tinnirello, Ilenia
    Galatioto, Alessandra
    Bonomolo, Marina
    Beccali, Marco
    Zizzo, Gaetano
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2017, 14 (02) : 471 - 481
  • [7] Parallel Machine Scheduling Under Time-of-Use Electricity Prices: New Models and Optimization Approaches
    Ding, Jian-Ya
    Song, Shiji
    Zhang, Rui
    Chiong, Raymond
    Wu, Cheng
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) : 1138 - 1154
  • [8] The effects of goal-oriented instructions in digital game-based learning
    Erhel, Severine
    Jamet, Eric
    [J]. INTERACTIVE LEARNING ENVIRONMENTS, 2016, 24 (08) : 1744 - 1757
  • [9] A novel decentralized platform for peer-to-peer energy trading market with blockchain technology
    Esmat, Ayman
    de Vos, Martijn
    Ghiassi-Farrokhfal, Yashar
    Palensky, Peter
    Epema, Dick
    [J]. APPLIED ENERGY, 2021, 282
  • [10] Smart Meter Privacy With Renewable Energy and an Energy Storage Device
    Giaconi, Giulio
    Gunduz, Deniz
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (01) : 129 - 142