Mixture density networks for the indirect estimation of reference intervals

被引:3
|
作者
Hepp, Tobias [1 ,2 ]
Zierk, Jakob [3 ]
Rauh, Manfred [3 ]
Metzler, Markus [3 ]
Seitz, Sarem [4 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Med Informat Biometry & Epidemiol, Waldstr 6, D-91054 Erlangen, Germany
[2] Georg August Univ Gottingen, Chair Spatial Data Sci & Stat Learning, Pl Gottinger Sieben 3, D-37073 Gottingen, Germany
[3] Univ Hosp Erlangen, Dept Pediat & Adolescent Med, Loschgestr 15, D-91054 Erlangen, Germany
[4] Otto Friedrich Univ Bamberg, Dept Informat Syst & Appl Comp Sci, Kapuzinerstr 16, D-96047 Bamberg, Germany
关键词
Mixture density networks; Reference intervals; Latent class regression; Distributional regression; PEDIATRIC REFERENCE INTERVALS; MAXIMUM-LIKELIHOOD;
D O I
10.1186/s12859-022-04846-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Reference intervals represent the expected range of physiological test results in a healthy population and are essential to support medical decision making. Particularly in the context of pediatric reference intervals, where recruitment regulations make prospective studies challenging to conduct, indirect estimation strategies are becoming increasingly important. Established indirect methods enable robust identification of the distribution of "healthy" samples from laboratory databases, which include unlabeled pathologic cases, but are currently severely limited when adjusting for essential patient characteristics such as age. Here, we propose the use of mixture density networks (MDN) to overcome this problem and model all parameters of the mixture distribution in a single step. Results Estimated reference intervals from varying settings with simulated data demonstrate the ability to accurately estimate latent distributions from unlabeled data using different implementations of MDNs. Comparing the performance with alternative estimation approaches further highlights the importance of modeling the mixture component weights as a function of the input in order to avoid biased estimates for all other parameters and the resulting reference intervals. We also provide a strategy to generate partially customized starting weights to improve proper identification of the latent components. Finally, the application on real-world hemoglobin samples provides results in line with current gold standard approaches, but also suggests further investigations with respect to adequate regularization strategies in order to prevent overfitting the data. Conclusions Mixture density networks provide a promising approach capable of extracting the distribution of healthy samples from unlabeled laboratory databases while simultaneously and explicitly estimating all parameters and component weights as non-linear functions of the covariate(s), thereby allowing the estimation of age-dependent reference intervals in a single step. Further studies on model regularization and asymmetric component distributions are warranted to consolidate our findings and expand the scope of applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Unsupervised machine learning method for indirect estimation of reference intervals for chronic kidney disease in the Puerto Rican population
    Velev, Julian
    LeBien, Jack
    Roche-Lima, Abiel
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [32] Indirect estimation of reference intervals using first or last results and results from patients without repeated measurements
    Arzideh, Farhad
    Oezcueruemez, Mustafa
    Albers, Eike
    Haeckel, Rainer
    Streichert, Thomas
    JOURNAL OF LABORATORY MEDICINE, 2021, 45 (02) : 103 - 109
  • [33] Indirect methods for reference intervals based on current data - Response
    Grossi, E
    Colombo, R
    Cavuto, S
    Franzini, C
    CLINICAL CHEMISTRY, 2006, 52 (02) : 337 - 338
  • [34] Indirect determination of biochemistry reference intervals using outpatient data
    Martinez-Sanchez, Luisa
    Cobbaert, Christa M.
    Noordam, Raymond
    Brouwer, Nannette
    Blanco-Grau, Albert
    Villena-Ortiz, Yolanda
    Thelen, Marc
    Ferrer-Costa, Roser
    Casis, Ernesto
    Rodriguez-Frias, Francisco
    den Elzen, Wendy P. J.
    PLOS ONE, 2022, 17 (05):
  • [35] MICROCOMPUTER-ASSISTED ESTIMATION OF REFERENCE INTERVALS
    FENTON, JJ
    CLINICAL CHEMISTRY, 1986, 32 (06) : 1180 - 1180
  • [36] Indirect reference intervals for TSH in a sample of lebanese pregnant women
    Eid, Dollen
    El Bcherawi, Nizar
    Tayeh, Georges Abi
    El Ghorayeb, Nada
    Gannage-Yared, Marie-Helene
    PRACTICAL LABORATORY MEDICINE, 2025, 44
  • [37] Calculation of reference intervals for thyroid hormones using an indirect approach
    Rolic, T.
    Aas, F. E.
    Westbye, A. B.
    Mandic, S.
    Thorsby, P. M.
    CLINICA CHIMICA ACTA, 2024, 558
  • [38] Unsupervised machine learning method for indirect estimation of reference intervals for chronic kidney disease in the Puerto Rican population
    Julian Velev
    Jack LeBien
    Abiel Roche-Lima
    Scientific Reports, 13 (1)
  • [39] INDIRECT ESTABLISHING OF REFERENCE INTERVALS - IS ITS VALUE ALWAYS QUESTIONABLE
    PIECHOTA, W
    SYMONOWICZ, N
    PASTUSIAK, Z
    JOURNAL OF AUTOMATIC CHEMISTRY, 1982, 4 (02): : 99 - 100
  • [40] Reference intervals for thyroid disorders calculated by indirect method and comparison with reference change values
    Yildiz, Zeynep
    Dagdelen, Lale Koeroglu
    BIOCHEMIA MEDICA, 2023, 33 (01)