News and Load: A Quantitative Exploration of Natural Language Processing Applications for Forecasting Day-Ahead Electricity System Demand

被引:3
作者
Bai, Yun [1 ]
Camal, Simon [1 ]
Michiorri, Andrea [1 ]
机构
[1] PSL Univ, MINES Paris, Ctr Proc Renewable Energies & Energy Syst PERSEE, F-06904 Sophia Antipolis, France
关键词
Semantics; Predictive models; Demand forecasting; Sentiment analysis; Feature extraction; Benchmark testing; Numerical models; Electricity demand forecasting; natural language processing; population behavior; social events; TIME-SERIES; PREDICTION; TEXT;
D O I
10.1109/TPWRS.2024.3361074
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The relationship between electricity demand and weather is well established in power systems, along with the importance of behavioral and social aspects such as holidays and significant events. This study explores the link between electricity demand and more nuanced information about social events. This is done using mature Natural Language Processing (NLP) and demand forecasting techniques. The results indicate that day-ahead forecasts are improved by textual features such as word frequencies, public sentiments, topic distributions, and word embeddings. The social events contained in these features include global pandemics, politics, international conflicts, transportation, etc. Causality effects and correlations are discussed to propose explanations for the mechanisms behind the links highlighted. This study is believed to bring a new perspective to traditional electricity demand analysis. It confirms the feasibility of improving forecasts from unstructured text, with potential consequences for sociology and economics.
引用
收藏
页码:6222 / 6234
页数:13
相关论文
共 44 条
[1]   Crude oil price forecasting incorporating news text [J].
Bai, Yun ;
Li, Xixi ;
Yu, Hao ;
Jia, Suling .
INTERNATIONAL JOURNAL OF FORECASTING, 2022, 38 (01) :367-383
[2]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[3]  
Bravo-Marquez M., 2013, P 2 INT WORKSH ISS S, P1
[4]   Making words work: Using financial text as a predictor of financial events [J].
Cecchini, Mark ;
Aytug, Haldun ;
Koehler, Gary J. ;
Pathak, Praveen .
DECISION SUPPORT SYSTEMS, 2010, 50 (01) :164-175
[5]  
Chang L., 2021, News-crawler
[6]   Causal impact of masks, policies, behavior on early covid-19 pandemic in the US [J].
Chernozhukov, Victor ;
Kasahara, Hiroyuki ;
Schrimpf, Paul .
JOURNAL OF ECONOMETRICS, 2021, 220 (01) :23-62
[7]   DISTRIBUTION OF THE ESTIMATORS FOR AUTOREGRESSIVE TIME-SERIES WITH A UNIT ROOT [J].
DICKEY, DA ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :427-431
[8]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[9]   INVESTIGATING CAUSAL RELATIONS BY ECONOMETRIC MODELS AND CROSS-SPECTRAL METHODS [J].
GRANGER, CWJ .
ECONOMETRICA, 1969, 37 (03) :424-438
[10]   Text Preprocessing for Text Mining in Organizational Research: Review and Recommendations [J].
Hickman, Louis ;
Thapa, Stuti ;
Tay, Louis ;
Cao, Mengyang ;
Srinivasan, Padmini .
ORGANIZATIONAL RESEARCH METHODS, 2022, 25 (01) :114-146