Official Statistics Data Integration for Enhanced Information Quality

被引:14
|
作者
Dalla Valle, Luciana [1 ]
Kenett, Ron S. [2 ,3 ,4 ]
机构
[1] Univ Plymouth, Stat, Plymouth PL4 8AA, Devon, England
[2] KPA Ltd, Tel Aviv, Israel
[3] Univ Turin, Turin, Italy
[4] NYU Poly, Ctr Risk Engn, New York, NY USA
关键词
information quality (InfoQ); data integration; Bayesian networks; administrative data; official statistics;
D O I
10.1002/qre.1859
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This work is about integrated analysis of data collected as official statistics with administrative data from operational systems in order to increase the quality of information. Information quality, or InfoQ, is the potential of a data set to achieve a specific goal by using a given empirical analysis method'. InfoQ is based on the identification of four interacting components: the analysis goal, the data, the data analysis and the utility, and it is assessed through eight dimensions: data resolution, data structure, data integration, temporal relevance, generalizability, chronology of data and goal, construct operationalization and communication. The paper illustrates, through case studies, a novel strategy to increase InfoQ based on the integration of official statistics with administrative data using copulas and Bayesian Networks. Official statistics are extraordinary sources of information. However, because of temporal relevance and chronology of data and goals, these fundamental sources of information are often not properly leveraged resulting in a poor level of InfoQ in the use of official statistics. This leads to low valued statistical analyses and to the lack of sufficiently informative results. By improving temporal relevance and chronology of data and goals, the use of Bayesian Networks allows us to calibrate official with administrative data, thus strengthening the quality of the information derived from official surveys, and, overall, enhancing InfoQ. We show, with examples, how to design and implement such a calibration strategy. Copyright (c) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:1281 / 1300
页数:20
相关论文
共 50 条
  • [31] An end-to-end statistical process with mobile network data for official statistics
    Salgado, David
    Sanguiao, Luis
    Oancea, Bogdan
    Barragan, Sandra
    Necula, Marian
    EPJ DATA SCIENCE, 2021, 10 (01)
  • [32] Evaluating Data Quality for Integration of Data Sources
    Krogstie, John
    PRACTICE OF ENTERPRISE MODELING, POEM 2013, 2013, 165 : 39 - 53
  • [33] Characterizing Data Ecosystems to Support Official Statistics with Open Mapping Data for Reporting on Sustainable Development Goals
    van den Homberg, Marc
    Susha, Iryna
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (12)
  • [34] Connecting algorithmic fairness to quality dimensions in machine learning in official statistics and survey production
    Patrick Oliver Schenk
    Christoph Kern
    AStA Wirtschafts- und Sozialstatistisches Archiv, 2024, 18 (2) : 131 - 184
  • [35] Data Integration for Medical Information Management
    Mingui Sun
    Yun Q. Shi
    Qiang Liu
    Robert J. Sclabassi
    Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 319 - 328
  • [36] Data integration for medical information management
    Sun, MG
    Shi, YQ
    Liu, Q
    Sclabassi, RJ
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 319 - 328
  • [37] Official Statistics between Past and Future
    Rojicek, Marek
    STATISTIKA-STATISTICS AND ECONOMY JOURNAL, 2019, 99 (04) : 475 - 480
  • [38] The Progress of R in Romanian Official Statistics
    Dobre, Ana Maria
    Adam, Cecilia Roxana
    ROMANIAN STATISTICAL REVIEW, 2014, (02) : 45 - 54
  • [39] Profiling Social Sentiment in Times of Health Emergencies with Information from Social Networks and Official Statistics
    Velasco-Lopez, Jorge-Eusebio
    Carrasco, Ramon-Alberto
    Serrano-Guerrero, Jesus
    Chiclana, Francisco
    MATHEMATICS, 2024, 12 (06)
  • [40] Efficient and Reliable Geocoding of German Twitter Data to Enable Spatial Data Linkage to Official Statistics and Other Data Sources
    Nguyen, H. Long
    Tsolak, Dorian
    Karmann, Anna
    Knauff, Stefan
    Kuehne, Simon
    FRONTIERS IN SOCIOLOGY, 2022, 7