Mixture density networks for the indirect estimation of reference intervals

被引:3
|
作者
Hepp, Tobias [1 ,2 ]
Zierk, Jakob [3 ]
Rauh, Manfred [3 ]
Metzler, Markus [3 ]
Seitz, Sarem [4 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Med Informat Biometry & Epidemiol, Waldstr 6, D-91054 Erlangen, Germany
[2] Georg August Univ Gottingen, Chair Spatial Data Sci & Stat Learning, Pl Gottinger Sieben 3, D-37073 Gottingen, Germany
[3] Univ Hosp Erlangen, Dept Pediat & Adolescent Med, Loschgestr 15, D-91054 Erlangen, Germany
[4] Otto Friedrich Univ Bamberg, Dept Informat Syst & Appl Comp Sci, Kapuzinerstr 16, D-96047 Bamberg, Germany
关键词
Mixture density networks; Reference intervals; Latent class regression; Distributional regression; PEDIATRIC REFERENCE INTERVALS; MAXIMUM-LIKELIHOOD;
D O I
10.1186/s12859-022-04846-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Reference intervals represent the expected range of physiological test results in a healthy population and are essential to support medical decision making. Particularly in the context of pediatric reference intervals, where recruitment regulations make prospective studies challenging to conduct, indirect estimation strategies are becoming increasingly important. Established indirect methods enable robust identification of the distribution of "healthy" samples from laboratory databases, which include unlabeled pathologic cases, but are currently severely limited when adjusting for essential patient characteristics such as age. Here, we propose the use of mixture density networks (MDN) to overcome this problem and model all parameters of the mixture distribution in a single step. Results Estimated reference intervals from varying settings with simulated data demonstrate the ability to accurately estimate latent distributions from unlabeled data using different implementations of MDNs. Comparing the performance with alternative estimation approaches further highlights the importance of modeling the mixture component weights as a function of the input in order to avoid biased estimates for all other parameters and the resulting reference intervals. We also provide a strategy to generate partially customized starting weights to improve proper identification of the latent components. Finally, the application on real-world hemoglobin samples provides results in line with current gold standard approaches, but also suggests further investigations with respect to adequate regularization strategies in order to prevent overfitting the data. Conclusions Mixture density networks provide a promising approach capable of extracting the distribution of healthy samples from unlabeled laboratory databases while simultaneously and explicitly estimating all parameters and component weights as non-linear functions of the covariate(s), thereby allowing the estimation of age-dependent reference intervals in a single step. Further studies on model regularization and asymmetric component distributions are warranted to consolidate our findings and expand the scope of applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Harmonization of indirect reference intervals calculation by the Bhattacharya method
    Martinez-Sanchez, Luisa
    Gabriel-Medina, Pablo
    Villena-Ortiz, Yolanda
    Garcia-Fernandez, Alba E.
    Blanco-Grau, Albert
    Cobbaert, Christa M.
    Bravo-Nieto, Daniel
    Garriga-Edo, Sarai
    Sanz-Gea, Clara
    Gonzalez-Silva, Gonzalo
    Lopez-Hellin, Joan
    Ferrer-Costa, Roser
    Casis, Ernesto
    Rodriguez-Frias, Francisco
    den Elzen, Wendy P. J.
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2023, 61 (02) : 266 - 274
  • [22] Indirect determination of pediatric blood count reference intervals
    Zierk, J.
    Arzideh, F.
    Haeckel, R.
    Rascher, W.
    Rauh, M.
    Metzler, M.
    KLINISCHE PADIATRIE, 2012, 224 (03): : 225 - 225
  • [23] ESTIMATION OF REFERENCE INTERVALS FOR TRANSCRANIAL MAGNETIC STIMULATION: DERIVED PARAMETERS VIA THE HOFFMAN INDIRECT METHOD Reply
    Nandedkar, Sanjeev D.
    Sanders, Donald B.
    Hobson-Webb, Lisa D.
    Billlakota, Santoshi
    Barkhaus, Paul E.
    Stalberg, Erik V.
    MUSCLE & NERVE, 2018, 58 (04) : E31 - E32
  • [24] Use of total patient data for indirect estimation of reference intervals for 40 clinical chemical analytes in Turkey
    Ilcol, Yesim Ozarda
    Aslan, Diler
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2006, 44 (07) : 867 - 876
  • [25] Plasma lipids paediatric reference intervals: Indirect estimation using a large 14-year database
    Galozzi, Paola
    Padoan, Andrea
    Moretti, Carlo
    Aita, Ada
    Basso, Daniela
    JOURNAL OF PEDIATRIC GASTROENTEROLOGY AND NUTRITION, 2024, 79 (02): : 259 - 268
  • [26] Distributed Unsupervised Gaussian Mixture Learning for Density Estimation in Sensor Networks
    Safarinejadian, Behrooz
    Menhaj, Mohammad B.
    Karrari, Mehdi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2010, 59 (09) : 2250 - 2260
  • [27] Estimation of density by indirect observations
    Solev V.N.
    Journal of Mathematical Sciences, 2013, 188 (6) : 753 - 757
  • [28] Indirect estimation of pediatric reference interval via density graph deep embedded clustering
    Zheng, Jianguo
    Tang, Yongqiang
    Peng, Xiaoxia
    Zhao, Jun
    Chen, Rui
    Yan, Ruohua
    Peng, Yaguang
    Zhang, Wensheng
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [29] Confidence intervals for kernel density estimation
    Fiorio, Carlo V.
    STATA JOURNAL, 2004, 4 (02): : 168 - 179
  • [30] Confidence intervals for logspline density estimation
    Kooperberg, C
    Stone, CJ
    NONLINEAR ESTIMATION AND CLASSIFICATION, 2003, 171 : 285 - 295