Density ratio model with data-adaptive basis function

被引:0
作者
Zhang, Archer Gong [1 ]
Chen, Jiahua [1 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Empirical likelihood; Functional principal component analysis; Multiple density estimation; Multiple distributions; Multiple quantile estimation; GOODNESS-OF-FIT; EMPIRICAL LIKELIHOOD; LOGISTIC-REGRESSION; DISTRIBUTIONS; INFERENCE;
D O I
10.1016/j.jmva.2022.105043
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In many applications, we collect samples from multiple interconnected populations. These population distributions share some latent structure, so it is advantageous to jointly analyze the samples to make efficient inferences on the multiple distributions and their functionals. One effective way to connect the distributions is the density ratio model (DRM). A key ingredient of the DRM is that the log density ratios are linear combinations of prespecified functions; the vector formed by these functions is called the basis function. The benefit of DRM relies on correctly specifying the basis function to a large degree. In applications, the user may not have a complete knowledge to enable a suitable choice of the basis function, and many discussions have been devoted to this topic. In this article, we consider the still open problem of a data-adaptive choice of the basis function that can alleviate the risk of severe model misspecification. We propose a data-adaptive approach to the choice of basis function based on functional principal component analysis. Under some conditions, we show that this approach leads to consistent basis function estimation. Our simulation results show that the proposed adaptive choice can achieve an efficiency gain. We use a real-data example from economics to demonstrate the efficiency gain and the ease of our approach. (C) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页数:18
相关论文
共 47 条
  • [1] Abe M., 2019, OPEN REV
  • [2] Akaike H., 1973, 2 INT S INF THEOR, P267
  • [3] ANDERSON JA, 1979, BIOMETRIKA, V66, P17, DOI 10.1093/biomet/66.1.17
  • [4] [Anonymous], 2007, Graduate Texts in Mathematics
  • [5] Arveson W., 2002, GRAD TEXTS MATH, V209
  • [6] HYPOTHESIS TESTING IN THE PRESENCE OF MULTIPLE SAMPLES UNDER DENSITY RATIO MODELS
    Cai, Song
    Chen, Jiahua
    Zidek, James V.
    [J]. STATISTICA SINICA, 2017, 27 (02) : 761 - 783
  • [7] Chen J., 2022, ANN APPL STAT
  • [8] EMPIRICAL LIKELIHOOD ESTIMATION FOR FINITE POPULATIONS AND THE EFFECTIVE USAGE OF AUXILIARY INFORMATION
    CHEN, JH
    QIN, J
    [J]. BIOMETRIKA, 1993, 80 (01) : 107 - 116
  • [9] Composite empirical likelihood for multisample clustered data
    Chen, Jiahua
    Li, Pengfei
    Liu, Yukun
    Zidek, James V.
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2021, 33 (01) : 60 - 81
  • [10] QUANTILE AND QUANTILE-FUNCTION ESTIMATIONS UNDER DENSITY RATIO MODEL
    Chen, Jiahua
    Liu, Yukun
    [J]. ANNALS OF STATISTICS, 2013, 41 (03) : 1669 - 1692