Dynamic data science and official statistics

被引:5
作者
Thompson, Mary E. [1 ]
机构
[1] Univ Waterloo, Stat & Actuarial Sci, Waterloo, ON N2L 3G1, Canada
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 2018年 / 46卷 / 01期
基金
加拿大自然科学与工程研究理事会;
关键词
Combining data sources; Dimension reduction; large-scale data; recursive methods; visualization; CROP YIELD; BIG DATA; MODEL; REGULARIZATION; DESIGN;
D O I
10.1002/cjs.11322
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many of the challenges and opportunities of data science have to do with dynamic factors: a growing volume of administrative and commercial data on individuals and establishments, continuous flows of data and the capacity to analyze and summarize them in real time, and the necessity for resources to maintain them. With its emphasis on data quality and supportable results, the practice of Official Statistics faces a variety of statistical and data science issues. This article discusses the importance of population frames and their maintenance; the potential for use of multi-frame methods and linkages; how the use of large scale non-survey data may shape the objects of inference; the complexity of models for large data sets; the importance of recursive methods and regularization; and the benefits of sophisticated spatial visualization tools in capturing spatial variation and temporal change. The Canadian Journal of Statistics 46: 10-23; 2018 (c) 2017 Statistical Society of Canada
引用
收藏
页码:10 / 23
页数:14
相关论文
共 50 条
  • [31] Official statistics embrace big data: a review of current and developing international practice
    Plekhanov, Dmitriy
    INTERNATIONAL CONFERENCE ON ELECTRONIC GOVERNANCE AND OPEN SOCIETY: CHALLENGES IN EURASIA (EGOSE 2017), 2017, : 22 - 26
  • [32] Assessing the Quality of Home Detection from Mobile Phone Data for Official Statistics
    Vanhoof, Maarten
    Reis, Fernando
    Ploetz, Thomas
    Smoreda, Zbigniew
    JOURNAL OF OFFICIAL STATISTICS, 2018, 34 (04) : 935 - 960
  • [33] Driving Excellence in Official Statistics: Unleashing the Potential of Comprehensive Digital Data Governance
    Hassani, Hossein
    Macfeely, Steve
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
  • [34] Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals
    van den Homberg, Marc
    Susha, Iryna
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (12)
  • [35] Temporally Consistent Present Population from Mobile Network Signaling Data for Official Statistics
    Castillo, Milena Suarez
    Semecurbe, Francois
    Ziemlicki, Cezary
    Tao, Haixuan Xavier
    Seimandi, Tom
    JOURNAL OF OFFICIAL STATISTICS, 2023, 39 (04) : 535 - 570
  • [36] Integrating probability and big non-probability samples data to produce Official Statistics
    Golini, Natalia
    Righi, Paolo
    STATISTICAL METHODS AND APPLICATIONS, 2024, 33 (02) : 555 - 580
  • [37] Official Statistics and Big Data Processing with Artificial Intelligence: Capacity Indicators for Public Sector Organizations
    Abbas, Syed Wasim
    Hamid, Muhammad
    Alkanhel, Reem
    Abdallah, Hanaa A.
    SYSTEMS, 2023, 11 (08):
  • [38] A Review of Big Data and Machine Learning Operations in Official Statistics: MLOps and Feature Store Adoption
    Ramos Nunes, Carlos Eduardo
    Ashofteh, Afshin
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 711 - 718
  • [39] On the role of statistics in the era of big data: A computer science perspective
    Ceri, Stefano
    STATISTICS & PROBABILITY LETTERS, 2018, 136 : 68 - 72
  • [40] Opportunities and challenges for official statistics in a digital society
    Allin, Paul
    CONTEMPORARY SOCIAL SCIENCE, 2021, 16 (02) : 156 - 169