Prompting large language model for multi-location multi-step zero-shot wind power forecasting

被引:0
作者
Duan, Zhiyu [1 ]
Bian, Chong [2 ]
Yang, Shunkun [1 ]
Li, Chunping [3 ]
机构
[1] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
[2] China Mobile Res Inst, Artificial Intelligence & Intelligent Operat Ctr, Beijing 100053, Peoples R China
[3] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Large language model; Prompt tuning; Parameter efficient fine tuning; Multi-step multi-location wind power fore-; casting; Zero-shot learning; NEURAL-NETWORK; PREDICTION; REGRESSION;
D O I
10.1016/j.eswa.2025.127436
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wind power forecasting (WPF) is essential for the safe integration of renewable energy into the power grid. Despite the emergence of various predictors, they still face two challenges: 1) the predetermined forecasting horizon lacks the capabilities to make predictions over varying time steps, hindering flexible short- and long-term applications; 2) the chaotic nature of wind speeds under dynamic geographic and atmospheric conditions introduces mutation at certain time that complicates prediction and leads to large errors. To address these issues, a hard-soft hybrid prompt learning method is proposed to harness the potential of a large language model (LLM) for WPF. A hard prompt generator is proposed to redefine WPF as a language modeling task. It can leverage the powerful representation learning ability of LLM to capture inherent temporal features and reveal mutations within the WP data for accurate forecasts. A soft-prompt adapter with gated attention is proposed to align LLM with the WPF context via parameter efficient tuning. It can grasp the intrinsic spatial-temporal characteristics in and across wind farms with low computational complexity. Moreover, a hard-soft prompt fusion mechanism is designed to incorporate WPF-specific task information into the LLM, further triggering the zero-shot learning ability of LLM to achieve flexible prediction over varying forecasting horizons. Extensive experiments on wind farms distributed in multiple regions demonstrated that our method can flexibly perform both short- and longterm WPF tasks and robustly predict mutation events. It also outperforms the state-of-the-art deep predictors regarding generalization and accuracy.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A New Combinatory Approach for Wind Power Forecasting
    Abedinia, Oveis
    Bagheri, Mehdi
    Naderi, Mohammad Salay
    Ghadimi, Noradin
    [J]. IEEE SYSTEMS JOURNAL, 2020, 14 (03): : 4614 - 4625
  • [2] Babazadeh H., 2012, Power Electronics and Machines in Wind Applications (PEMWA), 2012 IEEE: 16-18 July, P1, DOI [10.1109/PEMWA.2012.6316394, DOI 10.1109/PEMWA.2012.6316394]
  • [3] Bo G., 2017, Hybrid PSO-BP Neural Network Approach for Wind Power Forecasting
  • [4] Chang C, 2023, Arxiv, DOI arXiv:2308.08469
  • [5] Learning Heterogeneous Features Jointly: A Deep End-to-End Framework for Multi-Step Short-Term Wind Power Prediction
    Chen, Jinfu
    Zhu, Qiaomu
    Li, Hongyi
    Zhu, Lin
    Shi, Dongyuan
    Li, Yinhong
    Duan, Xianzhong
    Liu, Yilu
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2020, 11 (03) : 1761 - 1772
  • [6] Multifactor spatio-temporal correlation model based on a combination of convolutional neural network and long short-term memory neural network for wind speed forecasting
    Chen, Yong
    Zhang, Shuai
    Zhang, Wenyu
    Peng, Juanjuan
    Cai, Yishuai
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2019, 185 : 783 - 799
  • [7] Wind Power Prediction Based on Multi-class Autoregressive Moving Average Model with Logistic Function
    Dong, Yunxuan
    Ma, Shaodan
    Zhang, Hongcai
    Yang, Guanghua
    [J]. JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2022, 10 (05) : 1184 - 1193
  • [8] M2GSNet: Multi-Modal Multi-Task Graph Spatiotemporal Network for Ultra-Short-Term Wind Farm Cluster Power Prediction
    Fan, Hang
    Zhang, Xuemin
    Mei, Shengwei
    Chen, Kunjin
    Chen, Xinyang
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 15
  • [9] Garza A, 2024, Arxiv, DOI [arXiv:2310.03589, 10.48550/arXiv.2310.03589, DOI 10.48550/ARXIV.2310.03589]
  • [10] LLM-Based Code Generation Method for Golang Compiler Testing
    Gu, Qiuhan
    [J]. PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 2201 - 2203