Blending daily satellite precipitation product and rain gauges using stacking ensemble machine learning with the consideration of spatial heterogeneity

被引:0
作者
Chen, Chuanfa [1 ]
Hao, Jinda [1 ]
Yang, Shufan [1 ]
Li, Yanyan [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Geodesy & Geomat, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Precipitation; Spatial heterogeneity; Stacking ensemble learning; Merging; GEOGRAPHICALLY WEIGHTED REGRESSION; INTERPOLATION; IMERG;
D O I
10.1016/j.jhydrol.2025.133223
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Blending satellite precipitation products (SPPs) with rain gauge observations through machine learning (ML)based methods offers a proficient means of achieving high-accuracy precipitation data. However, traditional ML methods often neglect the spatial heterogeneity of precipitation across the study area, and the unique strengths of individual ML models remain underutilized. To address these challenges, this paper proposes a stacking ensemble learning approach that accounts for spatial heterogeneity for blending SPPs with rain gauge data to produce highly accurate precipitation estimates. Specifically, the study area is segmented into several homogeneous zones to mitigate spatial heterogeneity, with each grid cell within these zones assigned a uniform identifier (ID). Furthermore, a stacking ensemble ML framework which takes the ID as an input feature is developed to merge SPPs and rain gauge observations. To evaluate the performance of our proposed method, we blended daily IMERG data and rain gauge observations spanning from 2016 to 2020 across the Chinese mainland, benchmarking it against seven ML methods and the original IMERG data. The experimental results provide several key insights: (i) Data-driven adaptive clustering emerges as an efficient tool for addressing the challenge of spatial heterogeneity in high-quality precipitation estimation. (ii) Across multiple temporal scales, the proposed method outperforms the classical ML-based methods. Notably, at the daily scale, it improves upon the classical approaches by at least 2.4 % in Mean Absolute Error (MAE), 0.76 % in Root Mean Square Error (RMSE), 1.4 % in Correlation Coefficient (CC), and 1.4 % in Kling-Gupta Efficiency (KGE). Furthermore, at the monthly and seasonal scales, it reduces MAE by at least 2.3 % and 2.8 %, respectively, and enhances KGE by at least 0.9 % and 1.1 %. (iii) The spatial distribution of precipitation estimated by the proposed method aligns more closely with rain gauge observations compared to the classical methods. (iv) The ID feature plays a crucial role in precipitation estimation, ranking first and second in terms of feature importance for 39.6 % and 33.9 % of days, respectively, over the five-year period. (v) The proposed method generates positive incremental values at 69 % of rain gauge stations, demonstrating greater added value compared to the classical methods. Overall, the proposed method can be regarded as an effective tool for generating high-accuracy daily precipitation products.
引用
收藏
页数:16
相关论文
共 81 条
  • [1] Explainable artificial intelligence (XAI) for interpreting the contributing factors feed into the wildfire susceptibility prediction model
    Abdollahi, Abolfazl
    Pradhan, Biswajeet
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 879
  • [2] PERSIANN-CDR Daily Precipitation Climate Data Record from Multisatellite Observations for Hydrological and Climate Studies
    Ashouri, Hamed
    Hsu, Kuo-Lin
    Sorooshian, Soroosh
    Braithwaite, Dan K.
    Knapp, Kenneth R.
    Cecil, L. Dewayne
    Nelson, Brian R.
    Prat, Olivier P.
    [J]. BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2015, 96 (01) : 69 - +
  • [3] RF-MEP: A novel Random Forest method for merging gridded precipitation products and ground-based measurements
    Baez-Villanueva, Oscar M.
    Zambrano-Bigiarini, Mauricio
    Beck, Hylke E.
    McNamara, Ian
    Ribbe, Lars
    Nauditt, Alexandra
    Birkel, Christian
    Verbist, Koen
    Giraldo-Osorio, Juan Diego
    Nguyen Xuan Thinh
    [J]. REMOTE SENSING OF ENVIRONMENT, 2020, 239
  • [4] Blending long-term satellite-based precipitation data with gauge observations for drought monitoring: Considering effects of different gauge densities
    Bai, Xiaoyan
    Wu, Xiaoqing
    Wang, Peng
    [J]. JOURNAL OF HYDROLOGY, 2019, 577
  • [5] Global-scale evaluation of 22 precipitation datasets using gauge observations and hydrological modeling
    Beck, Hylke E.
    Vergopolan, Noemi
    Pan, Ming
    Levizzani, Vincenzo
    van Dijk, Albert I. J. M.
    Weedon, Graham P.
    Brocca, Luca
    Pappenberger, Florian
    Huffman, George J.
    Wood, Eric F.
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (12) : 6201 - 6217
  • [6] Machine Learning-Based Blending of Satellite and Reanalysis Precipitation Datasets: A Multiregional Tropical Complex Terrain Evaluation
    Bhuiyan, Md Abul Ehsan
    Nikolopoulos, Efthymios, I
    Anagnostou, Emmanouil N.
    [J]. JOURNAL OF HYDROMETEOROLOGY, 2019, 20 (11) : 2147 - 2161
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Geographically weighted regression: A method for exploring spatial nonstationarity
    Brunsdon, C
    Fotheringham, AS
    Charlton, ME
    [J]. GEOGRAPHICAL ANALYSIS, 1996, 28 (04) : 281 - 298
  • [9] Evaluation of interpolation techniques for the creation of gridded daily precipitation ( 1 x 1 km2); Cyprus, 1980-2010
    Camera, Corrado
    Bruggeman, Adriana
    Hadjinicolaou, Panos
    Pashiardis, Stelios
    Lange, Manfred A.
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2014, 119 (02) : 693 - 712
  • [10] Fast computation of cluster validity measures for bregman divergences and benefits
    Capo, Marco
    Perez, Aritz
    Lozano, Jose A.
    [J]. PATTERN RECOGNITION LETTERS, 2023, 170 : 100 - 105