Official Statistics Data Integration for Enhanced Information Quality

被引:14
|
作者
Dalla Valle, Luciana [1 ]
Kenett, Ron S. [2 ,3 ,4 ]
机构
[1] Univ Plymouth, Stat, Plymouth PL4 8AA, Devon, England
[2] KPA Ltd, Tel Aviv, Israel
[3] Univ Turin, Turin, Italy
[4] NYU Poly, Ctr Risk Engn, New York, NY USA
关键词
information quality (InfoQ); data integration; Bayesian networks; administrative data; official statistics;
D O I
10.1002/qre.1859
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This work is about integrated analysis of data collected as official statistics with administrative data from operational systems in order to increase the quality of information. Information quality, or InfoQ, is the potential of a data set to achieve a specific goal by using a given empirical analysis method'. InfoQ is based on the identification of four interacting components: the analysis goal, the data, the data analysis and the utility, and it is assessed through eight dimensions: data resolution, data structure, data integration, temporal relevance, generalizability, chronology of data and goal, construct operationalization and communication. The paper illustrates, through case studies, a novel strategy to increase InfoQ based on the integration of official statistics with administrative data using copulas and Bayesian Networks. Official statistics are extraordinary sources of information. However, because of temporal relevance and chronology of data and goals, these fundamental sources of information are often not properly leveraged resulting in a poor level of InfoQ in the use of official statistics. This leads to low valued statistical analyses and to the lack of sufficiently informative results. By improving temporal relevance and chronology of data and goals, the use of Bayesian Networks allows us to calibrate official with administrative data, thus strengthening the quality of the information derived from official surveys, and, overall, enhancing InfoQ. We show, with examples, how to design and implement such a calibration strategy. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1281 / 1300
页数:20
相关论文
共 50 条
  • [41] Making use of official statistics in research on gender and health status: recent British data
    Dunnell, K
    Fitzpatrick, J
    Bunting, J
    SOCIAL SCIENCE & MEDICINE, 1999, 48 (01) : 117 - 127
  • [42] Usage of R in Official Statistics - Survey Data Analysis at the Statistical Office of the Republic of Slovenia
    Pikelj, Jerneja
    ROMANIAN STATISTICAL REVIEW, 2015, (02) : 96 - 103
  • [43] Integrating probability and big non-probability samples data to produce Official Statistics
    Golini, Natalia
    Righi, Paolo
    STATISTICAL METHODS AND APPLICATIONS, 2024, 33 (02) : 555 - 580
  • [44] Enhanced Data Integration for LabVIEW Laboratory Systems
    Olteanu, Adriana
    Stamatescu, Grigore
    Ionita, Anca Daniela
    Sgarciu, Valentin
    2013 8TH INTERNATIONAL SYMPOSIUM ON ADVANCED TOPICS IN ELECTRICAL ENGINEERING (ATEE), 2013,
  • [45] Data Integration Technology of Industrial Information Systems
    Pavlenko, Petro
    Shendryk, Vira
    Balushok, Kostyantyn
    Doroshenko, Stanislav
    ADVANCES IN DESIGN, SIMULATION AND MANUFACTURING II, 2020, : 248 - 258
  • [46] Data integration for information technology infrastructure in mining
    Dessureault, S
    Porter, J
    Woodhall, M
    CIM BULLETIN, 2004, 97 (1076): : 49 - 56
  • [47] Data source selection for information integration in big data era
    Lin, Yiming
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    INFORMATION SCIENCES, 2019, 479 : 197 - 213
  • [48] A Review of Big Data and Machine Learning Operations in Official Statistics: MLOps and Feature Store Adoption
    Ramos Nunes, Carlos Eduardo
    Ashofteh, Afshin
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 711 - 718
  • [49] Automation of Publications in Official Statistics using R
    Schulz, Guido
    ROMANIAN STATISTICAL REVIEW, 2018, (01) : 31 - 44
  • [50] Official statistics, public policy and public trust
    Holt, D. Tim
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2008, 171 : 323 - 346