Matching a discrete distribution by Poisson matching quantiles estimation

被引:0
作者
Lim, Hyungjun [1 ]
Kim, Arlene K. H. [1 ]
机构
[1] Korea Univ, Dept Stat, 145 Anam Ro, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Matching distributions; PMQE; discrete variable; unpaired data analysis; deviance; SAMPLE QUANTILES; REGRESSION;
D O I
10.1080/02664763.2024.2337082
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Analyzing the data collected from different sources requires unpaired data analysis to account for the absence of correspondence between the random variable Y and the covariates $ \boldsymbol {X} $ X. Several attempts have been made to analyze continuous Y, but it may follow a discrete distribution, which previous methodologies have overlooked. To address these limitations, we propose Poisson matching quantiles estimation (PMQE), the first unpaired data analysis method designed to examine the discrete Y and the unpaired continuous covariates $ \boldsymbol{X} $ X. Using their order statistics, the PMQE method matches the linear combination of random variables $ \boldsymbol{\beta} <^>{T} \boldsymbol{X} $ beta TX to $ {\rm log}(Y) $ log(Y). We further improve the performance of the proposed method by $ \ell _1 $ l1 penalizing $ \boldsymbol{\beta} $ beta, leading to the PMQE LASSO. An effective algorithm and simulation results are presented, along with the convergence results. We illustrate the practical application of PMQE using real data.
引用
收藏
页码:3102 / 3124
页数:23
相关论文
共 30 条
[1]  
Abid A., 2017, ARXIV
[2]  
Abid A, 2018, ANN ALLERTON CONF, P470, DOI 10.1109/ALLERTON.2018.8635907
[3]  
Balabdaoui F, 2021, J MACH LEARN RES, V22
[4]  
BRESLOW NE, 1984, J R STAT SOC C-APPL, V33, P38
[5]  
Carpentier A, 2016, JMLR WORKSH CONF PRO, V51, P658
[6]   GENERALIZATION OF POISSON DISTRIBUTION [J].
CONSUL, PC ;
JAIN, GC .
TECHNOMETRICS, 1973, 15 (04) :791-799
[7]   TESTS FOR DETECTING OVERDISPERSION IN POISSON REGRESSION-MODELS [J].
DEAN, C ;
LAWLESS, JF .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1989, 84 (406) :467-472
[8]   Facility Dependent Distance Decay in Competitive Location [J].
Drezner, Tammy ;
Drezner, Zvi ;
Zerom, Dawit .
NETWORKS & SPATIAL ECONOMICS, 2020, 20 (04) :915-934
[9]  
Fang G., 2023, INT C MACH LEARN PML, P9716
[10]   Modeling underdispersed count data with generalized Poisson regression [J].
Harris, Tammy ;
Yang, Zhao ;
Hardin, James W. .
STATA JOURNAL, 2012, 12 (04) :736-747