Generalized integration model for improved statistical inference by leveraging external summary data

被引:41
作者
Zhang, Han [1 ]
Deng, Lu [1 ]
Schiffman, Mark [1 ]
Qin, Jing [2 ]
Yu, Kai [1 ]
机构
[1] NCI, Div Canc Epidemiol & Genet, 9609 Med Ctr Dr, Bethesda, MD 20892 USA
[2] NIAID, NIH, 6700B Rockledge Dr, Bethesda, MD 20892 USA
关键词
Constraint maximum likelihood estimate; Empirical likelihood; Estimating equation; Lagrange multiplier; Meta-analysis; EMPIRICAL-LIKELIHOOD; INFORMATION; ESTIMATORS;
D O I
10.1093/biomet/asaa014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Meta-analysis has become a powerful tool for improving inference by gathering evidence from multiple sources. It pools summary-level data from different studies to improve estimation efficiency with the assumption that all participating studies are analysed under the same statistical model. It is challenging to integrate external summary data calculated from different models with a newly conducted internal study in which individual-level data are collected. We develop a novel statistical inference framework that can effectively synthesize internal and external data for the integrative analysis. The new framework is versatile enough to assimilate various types of summary data from multiple sources. We establish asymptotic properties for the proposed procedure and prove that the new estimate is theoretically more efficient than the internal data based maximum likelihood estimate, as well as a recently developed constrained maximum likelihood approach that incorporates the external information. We illustrate an application of our method by evaluating cervical cancer risk using data from a large cervical screening program.
引用
收藏
页码:689 / 703
页数:15
相关论文
共 22 条
  • [1] Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-Level Information From External Big Data Sources
    Chatterjee, Nilanjan
    Chen, Yi-Hau
    Maas, Paige
    Carroll, Raymond J.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 107 - 117
  • [2] Generalized linear models incorporating population level information: an empirical-likelihood-based approach
    Chaudhuri, Sanjay
    Handcock, Mark S.
    Rendall, Michael S.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 311 - 328
  • [3] Using empirical likelihood methods to obtain range restricted weights in regression estimators for surveys
    Chen, J
    Sitter, RR
    Wu, C
    [J]. BIOMETRIKA, 2002, 89 (01) : 230 - 237
  • [4] Chen JH, 1999, STAT SINICA, V9, P385
  • [5] CHEN JH, 1993, BIOMETRIKA, V80, P107, DOI 10.1093/biomet/80.1.107
  • [6] CHENG W., 2018, APPL STAT, V68, P121
  • [7] Improving estimation and prediction in linear regression incorporating external information from an established reduced model
    Cheng, Wenting
    Taylor, Jeremy M. G.
    Vokonas, Pantel S.
    Park, Sung Kyun
    Mukherjee, Bhramar
    [J]. STATISTICS IN MEDICINE, 2018, 37 (09) : 1515 - 1530
  • [8] Comment
    Han, Peisong
    Lawless, Jerald F.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 118 - 121
  • [9] LARGE SAMPLE PROPERTIES OF GENERALIZED-METHOD OF MOMENTS ESTIMATORS
    HANSEN, LP
    [J]. ECONOMETRICA, 1982, 50 (04) : 1029 - 1054
  • [10] Additive hazards model with auxiliary subgroup survival information
    He, Jie
    Li, Hui
    Zhang, Shumei
    Duan, Xiaogang
    [J]. LIFETIME DATA ANALYSIS, 2019, 25 (01) : 128 - 149