Secure Bayesian model averaging for horizontally partitioned data

被引:9
作者
Ghosh, Joyee [1 ]
Reiter, Jerome P. [2 ]
机构
[1] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[2] Duke Univ, Dept Stat Sci, Durham, NC 27706 USA
基金
美国国家科学基金会;
关键词
Bayesian model averaging; Data confidentiality; Disclosure limitation; Markov chain Monte Carlo; Regression; Variable selection; NORMALIZING CONSTANTS; VARIABLE SELECTION; REGRESSION; APPROXIMATIONS; DIMENSION; BINARY;
D O I
10.1007/s11222-011-9312-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
When multiple data owners possess records on different subjects with the same set of attributes-known as horizontally partitioned data-the data owners can improve analyses by concatenating their databases. However, concatenation of data may be infeasible because of confidentiality concerns. In such settings, the data owners can use secure computation techniques to obtain the results of certain analyses on the integrated database without sharing individual records. We present secure computation protocols for Bayesian model averaging and model selection for both linear regression and probit regression. Using simulations based on genuine data, we illustrate the approach for probit regression, and show that it can provide reasonable model selection outputs.
引用
收藏
页码:311 / 322
页数:12
相关论文
共 42 条
[1]  
Agrawal R, 2000, SIGMOD REC, V29, P439, DOI 10.1145/335191.335438
[2]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[3]  
[Anonymous], 2002, SIGKDD, DOI DOI 10.1145/775047.775142
[4]  
[Anonymous], 2002, P ACM SIGMOD WORKSH
[5]   Optimal predictive model selection [J].
Barbieri, MM ;
Berger, JO .
ANNALS OF STATISTICS, 2004, 32 (03) :870-897
[6]  
BENALOH JC, 1987, LECT NOTES COMPUT SC, V263, P251
[7]   Approximations and consistency of Bayes factors as model dimension grows [J].
Berger, JO ;
Ghosh, JK ;
Mukhopadhyay, N .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2003, 112 (1-2) :241-258
[8]  
BERGER JO, 2001, MODEL SELECTION, P135
[9]  
CARLIN BP, 1995, J ROY STAT SOC B MET, V57, P473
[10]   Marginal likelihood from the Gibbs output [J].
Chib, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1313-1321