Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach

被引:105
作者
Qiu, Dawei [1 ]
Ye, Yujian [2 ]
Papadaskalopoulos, Dimitrios [1 ]
Strbac, Goran [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
Energy management; Distributed energy resources; Multi-agent deep reinforcement learning; Peer-to-peer energy trading; Smart grid;
D O I
10.1016/j.apenergy.2021.116940
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The increasing penetration of small-scale distributed energy resources (DER) has the potential to support cost-efficient energy balancing in emerging electricity systems, but is also fundamentally affecting the conventional operation paradigm of the latter. In this context, innovative market mechanisms need to be devised to better coordinate and provide incentives for DER to utilize their flexibility. Peer-to-Peer (P2P) energy trading has emerged as an alternative approach to facilitate direct trading between consumers and prosumers interacting in an energy collective and fosters more efficient local demand-supply balancing. While previous research has primarily focused on the technical and economic benefits of P2P trading, little effort has been made towards the incorporation of prosumers' heterogeneous characteristics in the P2P trading problem. Here, we address this research gap by classifying the participating prosumers into multiple clusters with regard to their portfolio of DER, and analyzing their trading decisions in a simulated P2P trading platform. The latter employs the mid-market rate (MMR) local pricing mechanism to enable energy trading among prosumers and penalizes the contribution to the system demand peak of each prosumer. We formulate the P2P trading problem as a multi-agent coordination problem and propose a novel multi-agent deep reinforcement learning (MADRL) method to address it. The proposed method is founded on the combination of the multi-agent deep deterministic policy gradient (MADDPG) algorithm and the technique of parameter sharing (PS), which not only enables accelerating the training speed by sharing experiences and learned policies between all agents in each cluster, but also sustains the policies' diversity between multiple clusters. To address the non-stationarity and computational complexity of MADRL as well as persevering the privacy of prosumers, the P2P trading platform acts as a trusted third party which augments the market collective trading information to help training of prosumer agents. Experiments with a large-scale real-world data-set involving 300 residential households demonstrate that the proposed MADRL method exhibits a strong generalization capability in the test data-set and outperforms the state-of-the-art MADRL methods with regard to the system operation cost, demand peak as well as computational time.
引用
收藏
页数:16
相关论文
共 50 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Peer-to-peer energy trading among smart homes [J].
Alam, Muhammad Raisul ;
St-Hilaire, Marc ;
Kunz, Thomas .
APPLIED ENERGY, 2019, 238 :1434-1443
[3]  
Alfonso C, 2020, ENERGY BUILD, V224, P243
[4]  
[Anonymous], 2017, Rainfall and Temperature Forecast and Observations-Verification 2016-05 to 2017-04
[5]   A multi-agent based energy management solution for integrated buildings and microgrid system [J].
Anvari-Moghaddam, Amjad ;
Rahimi-Kian, Ashkan ;
Mirian, Maryam S. ;
Guerrero, Josep M. .
APPLIED ENERGY, 2017, 203 :41-56
[6]  
Ausgrid, 2020, Time of Use Pricing
[7]   Automated peer-to-peer negotiation for energy contract settlements in residential cooperatives [J].
Chakraborty, Shantanu ;
Baarslag, Tim ;
Kaisers, Michael .
APPLIED ENERGY, 2020, 259
[8]   Local Energy Trending Behavior Modeling With Deep Reinforcement Learning [J].
Chen, Tao ;
Su, Wencong .
IEEE ACCESS, 2018, 6 :62806-62814
[9]   Energy Consumption Scheduling of HVAC Considering Weather Forecast Error Through the Distributionally Robust Approach [J].
Du, Y. F. ;
Jiang, L. ;
Duan, C. ;
Li, Y. Z. ;
Smith, J. S. .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (03) :846-857
[10]  
EnergyAustralia, 2020, Solar Rebates and Feed-in Tariffs