Moment-based density estimation of confidential micro-data: a computational statistics approach

被引:1
|
作者
Wakefield, Bradley [1 ]
Lin, Yan-Xia [1 ]
Sarathy, Rathin [2 ]
Muralidhar, Krishnamurty [3 ]
机构
[1] Univ Wollongong, Natl Inst Appl Stat Res Australia, Northfields Ave, Wollongong, NSW 2522, Australia
[2] Oklahoma State Univ, Spears Sch Business, Dept Management Sci & Informat Syst, Stillwater, OK 74078 USA
[3] Univ Oklahoma, Mkt & Supply Chain Management, Norman, OK 73019 USA
关键词
Density estimation; Confidential data; Moment problems; Multidimensional approximations; COPULAS;
D O I
10.1007/s11222-022-10203-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Providing access to synthetic micro-data in place of confidential data to protect the privacy of participants is common practice. For the synthetic data to be useful for analysis, it is necessary that the density function of the synthetic data closely approximate the confidential data. Hence, accurately estimating the density function based on sample micro-data is important. Existing kernel-based, copula-based, and machine learning methods of joint density estimation may not be viable. Applying the multivariate moments' problem to sample-based density estimation has long been considered impractical due to the computational complexity and intractability of optimal parameter selection of the density estimate when the true joint density function is unknown. This paper introduces a generalised form of the sample moment-based density estimate, which can be used to estimate joint density functions when only the information of empirical moments is available. We demonstrate optimal parametrisation of the moment-based density estimate based solely on sample data by employing a computational strategy for parameter selection. We compare the performance of the moment-based estimate to that of existing non-parametric and parametric density estimation methods. The results show that using empirical moments can provide a reasonable, robust non-parametric approximation of a joint density function that is comparable to existing non-parametric methods. We provide an example of synthetic data generation from the moment-based density estimate and show that the resulting synthetic data provides a reasonable disclosure-protected alternative for public release.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Performance Analysis of Moment-based Blind SNR Estimation Algorithm
    Lu Manjun
    Si Xicai
    Yu Zhiming
    ITESS: 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES, PT 1, 2008, : 1090 - 1095
  • [42] Moment-based estimation of extendible Marshall-Olkin copulas
    Hering, Christian
    Mai, Jan-Frederik
    METRIKA, 2012, 75 (05) : 601 - 620
  • [43] Moment-based power estimation in very deep submicron technologies
    Garcia-Ortiz, A
    Kabulepa, L
    Murgan, T
    Glesner, M
    ICCAD-2003: IEEE/ACM DIGEST OF TECHNICAL PAPERS, 2003, : 107 - 112
  • [44] Moment-based estimation of the Nakagami-m fading parameter
    Cheng, J
    Beaulieu, NC
    2001 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING, VOLS I AND II, CONFERENCE PROCEEDINGS, 2001, : 361 - 364
  • [45] Moment-based estimation of extendible Marshall-Olkin copulas
    Christian Hering
    Jan-Frederik Mai
    Metrika, 2012, 75 : 601 - 620
  • [46] Moment-based approaches in imaging. part 3: Computational considerations
    Shu, Huazhong
    Luo, Limin
    Coatrieux, Jean-Louis
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2008, 27 (03): : 89 - 91
  • [47] Chinese residential electricity consumption: Estimation and forecast using micro-data
    Cao, Jing
    Ho, Mun Sing
    Li, Yating
    Newell, Richard G.
    Pizer, William A.
    RESOURCE AND ENERGY ECONOMICS, 2019, 56 : 6 - 27
  • [48] Moment-Based Approximations of Probability Mass Functions with Applications Involving Order Statistics
    Provost, Serge B.
    Jiang, Min
    Ha, Hyung-Tae
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (12) : 1969 - 1981
  • [49] Moment-based multivariate permutation tests for ordinal categorical data
    Giancristofaro, Rosa Arboretti
    Bonnini, Stefano
    JOURNAL OF NONPARAMETRIC STATISTICS, 2008, 20 (05) : 383 - 393
  • [50] SEGMENTATION OF RANGE IMAGES - AN ORTHOGONAL MOMENT-BASED INTEGRATED APPROACH
    GHOSAL, S
    MEHROTRA, R
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 1993, 9 (04): : 385 - 399