A machine learning-based analysis of 311 requests in the Miami-Dade County

被引:4
作者
Cheng, Shaoming [1 ]
Ganapati, Sukumar [1 ]
Narasimhan, Giri [2 ]
Yusuf, Farzana Beente [2 ]
机构
[1] Florida Int Univ, Dept Publ Policy & Adm, Miami, FL 33199 USA
[2] Florida Int Univ, Sch Comp & Informat Sci, Miami, FL USA
基金
美国国家科学基金会;
关键词
PUBLIC-SERVICES; OPEN GOVERNMENT; COPRODUCTION; CLASSIFICATION; PARTICIPATION;
D O I
10.1111/grow.12578
中图分类号
F0 [经济学]; F1 [世界各国经济概况、经济史、经济地理]; C [社会科学总论];
学科分类号
0201 ; 020105 ; 03 ; 0303 ;
摘要
This paper illustrates the application of machine learning algorithms in predictive analytics for local governments using administrative data. The developed and tested machine learning predictive algorithm overcomes known limitations of the conventional ordinary least squares method. Such limitations include but not limited to imposed linearity, presumed causality with independent variables as presumed causes and dependent variables as presume result, likely high multicollinearity among features, and spatial autocorrelation. The study applies the algorithms to 311 non-emergency service requests in the context of Miami-Dade County. The algorithms are applied to predict the volume of 311 service requests and the community characteristics affecting the volume across Census tract neighborhoods. Four common families of algorithms and an ensemble of them are applied. They are random forest, support vector machines, lasso and elastic-net regularized generalized linear models, and extreme gradient boosting. Two feature selection methods, namely Boruta and fscaret, are applied to identify the significant community characteristics. The results show that the machine learning algorithms capture spatial autocorrelation and clustering. The features generated by fscaret algorithms are parsimonious in predicting the 311 service request volume.
引用
收藏
页码:1627 / 1645
页数:19
相关论文
共 34 条
  • [1] LOCAL INDICATORS OF SPATIAL ASSOCIATION - LISA
    ANSELIN, L
    [J]. GEOGRAPHICAL ANALYSIS, 1995, 27 (02) : 93 - 115
  • [2] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [3] Chang Y.W., 2010, J MACH LEARN RES, V11, P1471
  • [4] Customer agility and responsiveness through big data analytics for public value creation: A case study of Houston 311 on-demand services
    Chatfield, Akemi Takeoka
    Reddick, Christopher G.
    [J]. GOVERNMENT INFORMATION QUARTERLY, 2018, 35 (02) : 336 - 347
  • [5] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [6] Do Advanced Information Technologies Produce Equitable Government Responses in Coproduction: An Examination of 311 Systems in 15 US Cities
    Clark, Benjamin Y.
    Brudney, Jeffrey L.
    Jang, Sung-Gheel
    Davy, Bradford
    [J]. AMERICAN REVIEW OF PUBLIC ADMINISTRATION, 2020, 50 (03) : 315 - 327
  • [7] The role of administrative data in the big data revolution in social science research
    Connelly, Roxanne
    Playford, Christopher J.
    Gayle, Vernon
    Dibben, Chris
    [J]. SOCIAL SCIENCE RESEARCH, 2016, 59 : 1 - 12
  • [8] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [9] A comparative analysis of data mining methods in predicting NCAA bowl outcomes
    Delen, Dursun
    Cogdell, Douglas
    Kasap, Nihat
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2012, 28 (02) : 543 - 552
  • [10] Descant S., 2020, CITIES REIMAGINE 311