A 100 m gridded population dataset of China's seventh census using ensemble learning and big geospatial data

被引:13
作者
Chen, Yuehong [1 ]
Xu, Congcong [1 ]
Ge, Yong [2 ]
Zhang, Xiaoxiang [1 ]
Zhou, Ya'nan [1 ]
机构
[1] Hohai Univ, Coll Geog & Remote Sensing, Nanjing 211100, Peoples R China
[2] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
GLOBAL POPULATION; NIGHTTIME LIGHT; DENSITY; LEVEL;
D O I
10.5194/essd-16-3705-2024
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
China has undergone rapid urbanization and internal migration in the past few years, and its up-to-date gridded population datasets are essential for various applications. Existing datasets for China, however, suffer from either outdatedness or failure to incorporate data from the latest Seventh National Population Census of China, conducted in 2020. In this study, we develop a novel population downscaling approach that leverages stacking ensemble learning and big geospatial data to produce up-to-date population grids at a 100 m resolution for China using seventh census data at both county and town levels. The proposed approach employs stacking ensemble learning to integrate the strengths of random forest, XGBoost, and LightGBM through fusing their predictions in a training mechanism, and it delineates the inhabited areas from big geospatial data to enhance the gridded population estimation. Experimental results demonstrate that the proposed approach exhibits the best-fit performance compared to individual base models. Meanwhile, the out-of-sample town-level test set indicates that the estimated gridded population dataset (R-2=0.8936) is more accurate than existing WorldPop (R-2=0.7427) and LandScan (R-2=0.7165) products for China in 2020. Furthermore, with the inhabited area enhancement, the spatial distribution of population grids is intuitively more reasonable than the two existing products. Hence, the proposed population downscaling approach provides a valuable option for producing gridded population datasets. The estimated 100 m gridded population dataset of China holds great significance for future applications, and it is publicly available at https://doi.org/10.6084/m9.figshare.24916140.v1 (Chen et al., 2024b).
引用
收藏
页码:3705 / 3718
页数:14
相关论文
共 53 条
  • [21] Improving the accuracy of extant gridded population maps using multisource map fusion
    Gao, Peng
    Wu, Tianjun
    Ge, Yong
    Li, Zihan
    [J]. GISCIENCE & REMOTE SENSING, 2022, 59 (01) : 54 - 70
  • [22] Spatiotemporal dynamics of population density in China using nighttime light and geographic weighted regression method
    Guo, Wei
    Liu, Jinke
    Zhao, Xuesheng
    Hou, Wei
    Zhao, Yunxuan
    Li, Yongxing
    Sun, Wenbin
    Fan, Deqin
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 2704 - 2723
  • [23] Combining Luojia1-01 Nighttime Light and Points-of-Interest Data for Fine Mapping of Population Spatialization Based on the Zonal Classification Method
    Guo, Wei
    Zhang, Jinyu
    Zhao, Xuesheng
    Li, Yongxing
    Liu, Jinke
    Sun, Wenbin
    Fan, Deqin
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1589 - 1600
  • [24] Ke GL, 2017, ADV NEUR IN, V30
  • [25] Population distribution modelling at fine spatio-temporal scale based on mobile phone data
    Kubicek, Petr
    Konecny, Milan
    Stachon, Zdenek
    Shen, Jie
    Herman, Lukas
    Reznik, Tomas
    Stanek, Karel
    Stampach, Radim
    Leitgeb, Simon
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2019, 12 (11) : 1319 - 1340
  • [26] The spatial allocation of population: a review of large-scale gridded population data products and their fitness for use
    Leyk, Stefan
    Gaughan, Andrea E.
    Adamo, Susana B.
    de Sherbinin, Alex
    Balk, Deborah
    Freire, Sergio
    Rose, Amy
    Stevens, Forrest R.
    Blankespoor, Brian
    Frye, Charlie
    Comenetz, Joshua
    Sorichetta, Alessandro
    MacManus, Kytt
    Pistolesi, Linda
    Levy, Marc
    Tatem, Andrew J.
    Pesaresi, Martino
    [J]. EARTH SYSTEM SCIENCE DATA, 2019, 11 (03) : 1385 - 1409
  • [27] A high resolution spatial population database of Somalia for disease risk mapping
    Linard C.
    Alegana V.A.
    Noor A.M.
    Snow R.W.
    Tatem A.J.
    [J]. International Journal of Health Geographics, 9 (1)
  • [28] Population spatialization in Zhengzhou city based on multi-source data and random forest model
    Liu, Lingling
    Cheng, Gang
    Yang, Jie
    Cheng, Yushu
    [J]. FRONTIERS IN EARTH SCIENCE, 2023, 11
  • [29] Estimating population and urban areas at risk of coastal hazards, 1990-2015: how data choices matter
    MacManus, Kytt
    Balk, Deborah
    Engin, Hasim
    McGranahan, Gordon
    Inman, Rya
    [J]. EARTH SYSTEM SCIENCE DATA, 2021, 13 (12) : 5747 - 5801
  • [30] Global landslide and avalanche hotspots
    Nadim, Farrokh
    Kjekstad, Oddvar
    Peduzzi, Pascal
    Herold, Christian
    Jaedicke, Christian
    [J]. LANDSLIDES, 2006, 3 (02) : 159 - 173