A 100 m gridded population dataset of China's seventh census using ensemble learning and big geospatial data

被引:13
作者
Chen, Yuehong [1 ]
Xu, Congcong [1 ]
Ge, Yong [2 ]
Zhang, Xiaoxiang [1 ]
Zhou, Ya'nan [1 ]
机构
[1] Hohai Univ, Coll Geog & Remote Sensing, Nanjing 211100, Peoples R China
[2] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing 100101, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
GLOBAL POPULATION; NIGHTTIME LIGHT; DENSITY; LEVEL;
D O I
10.5194/essd-16-3705-2024
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
China has undergone rapid urbanization and internal migration in the past few years, and its up-to-date gridded population datasets are essential for various applications. Existing datasets for China, however, suffer from either outdatedness or failure to incorporate data from the latest Seventh National Population Census of China, conducted in 2020. In this study, we develop a novel population downscaling approach that leverages stacking ensemble learning and big geospatial data to produce up-to-date population grids at a 100 m resolution for China using seventh census data at both county and town levels. The proposed approach employs stacking ensemble learning to integrate the strengths of random forest, XGBoost, and LightGBM through fusing their predictions in a training mechanism, and it delineates the inhabited areas from big geospatial data to enhance the gridded population estimation. Experimental results demonstrate that the proposed approach exhibits the best-fit performance compared to individual base models. Meanwhile, the out-of-sample town-level test set indicates that the estimated gridded population dataset (R-2=0.8936) is more accurate than existing WorldPop (R-2=0.7427) and LandScan (R-2=0.7165) products for China in 2020. Furthermore, with the inhabited area enhancement, the spatial distribution of population grids is intuitively more reasonable than the two existing products. Hence, the proposed population downscaling approach provides a valuable option for producing gridded population datasets. The estimated 100 m gridded population dataset of China holds great significance for future applications, and it is publicly available at https://doi.org/10.6084/m9.figshare.24916140.v1 (Chen et al., 2024b).
引用
收藏
页码:3705 / 3718
页数:14
相关论文
共 53 条
  • [1] Improving intelligent dasymetric mapping population density estimates at 30 m resolution for the conterminous United States by excluding uninhabited areas
    Baynes, Jeremy
    Neale, Anne
    Hultgren, Torrin
    [J]. EARTH SYSTEM SCIENCE DATA, 2022, 14 (06) : 2833 - 2849
  • [2] Fine-scale population spatialization data of China in 2018 based on real location-based big data
    Chen, Mingxing
    Xian, Yue
    Huang, Yaohuan
    Zhang, Xiaoping
    Hu, Maogui
    Guo, Shasha
    Chen, Liangkan
    Liang, Longwu
    [J]. SCIENTIFIC DATA, 2022, 9 (01)
  • [3] Mapping China's regional economic activity by integrating points-of-interest and remote sensing data with random forest
    Chen, Qian
    Ye, Tingting
    Zhao, Naizhuo
    Ding, Mingjun
    Ouyang, Zutao
    Jia, Peng
    Yue, Wenze
    Yang, Xuchao
    [J]. ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2021, 48 (07) : 1876 - 1894
  • [4] Improved GDP spatialization approach by combining land-use data and night-time light data: a case study in China's continental coastal area
    Chen, Qing
    Hou, Xiyong
    Zhang, Xiaochun
    Ma, Chun
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (19) : 4610 - 4622
  • [5] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [6] Chen Y., Remote Sens. Environ., V305, DOI [10.1016/j.rse.2024.114100,2024a, DOI 10.1016/J.RSE.2024.114100,2024A]
  • [7] Chen Y., Figshare data set, DOI [10.6084/m9.figshare.24916140.v1,2024b, DOI 10.6084/M9.FIGSHARE.24916140.V1,2024B]
  • [8] Provincial and gridded population projection for China under shared socioeconomic pathways from 2010 to 2100
    Chen, Yidan
    Guo, Fang
    Wang, Jiachen
    Cai, Wenjia
    Wang, Can
    Wang, Kaicun
    [J]. SCIENTIFIC DATA, 2020, 7 (01)
  • [9] High-Resolution Gridded Population Projections for China Under the Shared Socioeconomic Pathways
    Chen, Yimin
    Li, Xia
    Huang, Kangning
    Luo, Ming
    Gao, Minyi
    [J]. EARTHS FUTURE, 2020, 8 (06)
  • [10] Spatial Location Optimization of Fire Stations with Traffic Status and Urban Functional Areas
    Chen, Yuehong
    Wu, Guohao
    Chen, Yu
    Xia, Zelong
    [J]. APPLIED SPATIAL ANALYSIS AND POLICY, 2023, 16 (02) : 771 - 788