An ensemble method to generate high-resolution gridded population data for China from digital footprint and ancillary geospatial data

被引:21
作者
Tu, Wenna [1 ,2 ]
Liu, Zhang [1 ,3 ]
Du, Yunyan [1 ,2 ]
Yi, Jiawei [1 ,2 ]
Liang, Fuyuan [4 ]
Wang, Nan [1 ,2 ]
Qian, Jiale [1 ,2 ]
Huang, Sheng [1 ,2 ]
Wang, Huimeng [5 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Tencent Inc, Beijing, Peoples R China
[4] Western Illinois Univ, Dept Earth Atmospher & Geog Informat Sci, Macomb, IL 61455 USA
[5] Shandong Jianzhu Univ, Sch Surveying & Geoinformat, Jinan, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic population distribution; Digital footprint; Geospatial big data; Ensemble learning; Spatial dependence; URBAN-POPULATION; LAND-COVER; PREDICTION; PLATEAU; POVERTY;
D O I
10.1016/j.jag.2022.102709
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Fine-scale population datasets are essential to many health and development applications. Quite a few population estimate approaches have been proposed and multiple gridded population datasets have been produced. However, it is still a challenge to accurately estimate daily and even hourly population dynamics. In this study, we present an ensemble learning approach to tackle the challenge through integrating a digital footprint dataset and multiple geospatial ancillary datasets to estimate population dynamics. More specifically, we used the geographically weighted regression model to integrate two aspatial tree-based learning models and generated preliminary hourly and daily gridded population estimates. We then adjusted the fine-scale population estimates based on the county-level estimates and their nonlinear relationship with the grid-level covariates. After sufficient model training and parameter tuning, we produced a series 0.01-degree gridded population maps (FinePop) of China for 2018, including a nationwide daily-average map and provincial hourly-average maps. The FinePop is more accurate than the WorldPop and LandScan datasets, as suggested by the highest R-2 (0.72) obtained from the comparison against township-level population census data. The root mean squared error of the township population density estimates for FinePop, WorldPop, and LandScan are 3162, 3327, and 3423, respectively. The FinePop also shows its advantages in unraveling transportation networks and the diurnal-nocturnal population migration patterns in both small and large cities.
引用
收藏
页数:13
相关论文
共 44 条
  • [1] Generation of fine-scale population layers using multi-resolution satellite imagery and geospatial data
    Azar, Derek
    Engstrom, Ryan
    Graesser, Jordan
    Comenetz, Joshua
    [J]. REMOTE SENSING OF ENVIRONMENT, 2013, 130 : 219 - 232
  • [2] Determining global population distribution: Methods, applications and data
    Balk, D. L.
    Deichmann, U.
    Yetman, G.
    Pozzi, F.
    Hay, S. I.
    Nelson, A.
    [J]. ADVANCES IN PARASITOLOGY, VOL 62: GLOBAL MAPPING OF INFECTIOUS DISEASES: METHODS, EXAMPLES AND EMERGING APPLICATIONS, 2006, 62 : 119 - 156
  • [3] Predicting poverty and wealth from mobile phone metadata
    Blumenstock, Joshua
    Cadamuro, Gabriel
    On, Robert
    [J]. SCIENCE, 2015, 350 (6264) : 1073 - 1076
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Fine-grained prediction of urban population using mobile phone location data
    Chen, Jie
    Pei, Tao
    Shaw, Shih-Lung
    Lu, Feng
    Li, Mingxiao
    Cheng, Shifen
    Liu, Xiliang
    Zhang, Hengcai
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2018, 32 (09) : 1770 - 1786
  • [6] Population distribution and urbanization on both sides of the Hu Huanyong Line: Answering the Premier's question
    Chen Mingxing
    Gong Yinghua
    Li Yang
    Lu Dadao
    Zhang Hua
    [J]. JOURNAL OF GEOGRAPHICAL SCIENCES, 2016, 26 (11) : 1593 - 1610
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Spatiotemporal Remote Sensing Image Fusion Using Multiscale Two-Stream Convolutional Neural Networks
    Chen, Yuehong
    Shi, Kaixin
    Ge, Yong
    Zhou, Ya'nan
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Downscaling Census Data for Gridded Population Mapping With Geographically Weighted Area-to-Point Regression Kriging
    Chen, Yuehong
    Zhang, Ruojing
    Ge, Yong
    Jin, Yan
    Xia, Zelong
    [J]. IEEE ACCESS, 2019, 7 : 149132 - 149141
  • [10] Mapping monthly population distribution and variation at 1-km resolution across China
    Cheng, Zhifeng
    Wang, Jianghao
    Ge, Yong
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2022, 36 (06) : 1166 - 1184