rox: A Statistical Model for Regression with Missing Values

被引:1
|
作者
Buyukozkan, Mustafa [1 ]
Benedetti, Elisa [1 ]
Krumsiek, Jan [1 ]
机构
[1] Weill Cornell Med, Inst Computat Biomed, Dept Physiol & Biophys, New York, NY 10021 USA
基金
中国国家自然科学基金;
关键词
missing values; regression analysis; limit-of-detection; PARAMETERS; IMPUTATION;
D O I
10.3390/metabo13010127
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-dimensional omics datasets frequently contain missing data points, which typically occur due to concentrations below the limit of detection (LOD) of the profiling platform. The presence of such missing values significantly limits downstream statistical analysis and result interpretation. Two common techniques to deal with this issue include the removal of samples with missing values and imputation approaches that substitute the missing measurements with reasonable estimates. Both approaches, however, suffer from various shortcomings and pitfalls. In this paper, we present "rox", a novel statistical model for the analysis of omics data with missing values without the need for imputation. The model directly incorporates missing values as "low" concentrations into the calculation. We show the superiority of rox over common approaches on simulated data and on six metabolomics datasets. Fully leveraging the information contained in LOD-based missing values, rox provides a powerful tool for the statistical analysis of omics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] An Augmented Regression Model for Tensors With Missing Values
    Wang, Feng
    Gahrooei, Mostafa Reisi
    Zhong, Zhen
    Tang, Tao
    Shi, Jianjun
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (04) : 2968 - 2984
  • [2] A statistical emulator for multivariate model outputs with missing values
    Finazzi, Francesco
    Napier, Yoana
    Scott, Marian
    Hills, Alan
    Cameletti, Michela
    ATMOSPHERIC ENVIRONMENT, 2019, 199 : 415 - 422
  • [3] MISSING VALUES IN QUADRATIC REGRESSION
    REED, AH
    BIOMETRICS, 1969, 25 (04) : 801 - &
  • [4] Regression imputation in the functional linear model with missing values in the response
    Crambes, Christophe
    Henchiri, Yousri
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2019, 201 : 103 - 119
  • [5] MISSING VALUES IN MULTIPLE-REGRESSION
    AFIFI, A
    ELASHOFF, RM
    ANNALS OF MATHEMATICAL STATISTICS, 1964, 35 (03): : 1389 - &
  • [6] Approximation of Missing Values for Classification and Regression
    Su, Yi-juan
    Zhong, Zhi
    Li, Wen-jing
    RESOURCES AND SUSTAINABLE DEVELOPMENT, PTS 1-4, 2013, 734-737 : 2938 - 2944
  • [7] Multivariate linear regression with missing values
    Beyad, Yaser
    Maeder, Marcel
    ANALYTICA CHIMICA ACTA, 2013, 796 : 38 - 41
  • [8] On sliced inverse regression with missing values
    Dong, Yuexiao
    Li, Zeda
    JOURNAL OF NONPARAMETRIC STATISTICS, 2018, 30 (04) : 990 - 1002
  • [9] Logistic regression with missing values in the covariates
    1600, American Statistical Assoc, Alexandria, VA, USA (37):
  • [10] COMMENTS ON THE WEIGHTED REGRESSION APPROACH TO MISSING VALUES
    CONNIFFE, D
    ECONOMIC AND SOCIAL REVIEW, 1983, 14 (04) : 259 - 272