A distributed approach to meteorological predictions: addressing data imbalance in precipitation prediction models through federated learning and GANs

被引:3
作者
Jafarigol, Elaheh [1 ]
Trafalis, Theodore B. [2 ]
机构
[1] Univ Oklahoma, Data Sci & Analyt Inst, 202 W Boyd St, Room 409, Norman, OK 73019 USA
[2] Univ Oklahoma, Ind & Syst Engn, 202 W Boyd St, Room 104, Norman, OK 73019 USA
关键词
Imbalanced learning; Federated learning; Deep learning; Generative Adversarial Networks; Weather prediction; NEURAL-NETWORK; CLASSIFICATION; PERFORMANCE; CHALLENGES; IMPACT; SMOTE; AREA;
D O I
10.1007/s10287-024-00504-3
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The classification of weather data involves categorizing meteorological phenomena into classes, thereby facilitating nuanced analyses and precise predictions for various sectors such as agriculture, aviation, and disaster management. This involves utilizing machine learning models to analyze large, multidimensional weather datasets for patterns and trends. These datasets may include variables such as temperature, humidity, wind speed, and pressure, contributing to meteorological conditions. Furthermore, it's imperative that classification algorithms proficiently navigate challenges such as data imbalances, where certain weather events (e.g., storms or extreme temperatures) might be underrepresented. This empirical study explores data augmentation methods to address imbalanced classes in tabular weather data in centralized and federated settings. Employing data augmentation techniques such as the Synthetic Minority Over-sampling Technique or Generative Adversarial Networks can improve the model's accuracy in classifying rare but critical weather events. Moreover, with advancements in federated learning, machine learning models can be trained across decentralized databases, ensuring privacy and data integrity while mitigating the need for centralized data storage and processing. Thus, the classification of weather data stands as a critical bridge, linking raw meteorological data to actionable insights, enhancing our capacity to anticipate and prepare for diverse weather conditions.
引用
收藏
页数:23
相关论文
共 83 条
[1]   Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]  
Abd Elrahman S.M., 2013, Journal of Network and Innovative Computing, V1, P332
[3]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]  
Aydin M A, 2021, 2021 29 SIGN PROC CO
[5]  
Bao F, 2019, IEEE Trans Neural Netw Learn Syst
[6]  
Bekkar M., 2013, J Inf Eng Appl, V3, P27, DOI DOI 10.5121/IJDKP.2013.3402
[7]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[8]  
Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
[9]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[10]   SMOTEBoost: Improving prediction of the minority class in boosting [J].
Chawla, NV ;
Lazarevic, A ;
Hall, LO ;
Bowyer, KW .
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2003, PROCEEDINGS, 2003, 2838 :107-119