Mixture density networks for the indirect estimation of reference intervals

被引:3
|
作者
Hepp, Tobias [1 ,2 ]
Zierk, Jakob [3 ]
Rauh, Manfred [3 ]
Metzler, Markus [3 ]
Seitz, Sarem [4 ]
机构
[1] Friedrich Alexander Univ Erlangen Nurnberg, Dept Med Informat Biometry & Epidemiol, Waldstr 6, D-91054 Erlangen, Germany
[2] Georg August Univ Gottingen, Chair Spatial Data Sci & Stat Learning, Pl Gottinger Sieben 3, D-37073 Gottingen, Germany
[3] Univ Hosp Erlangen, Dept Pediat & Adolescent Med, Loschgestr 15, D-91054 Erlangen, Germany
[4] Otto Friedrich Univ Bamberg, Dept Informat Syst & Appl Comp Sci, Kapuzinerstr 16, D-96047 Bamberg, Germany
关键词
Mixture density networks; Reference intervals; Latent class regression; Distributional regression; PEDIATRIC REFERENCE INTERVALS; MAXIMUM-LIKELIHOOD;
D O I
10.1186/s12859-022-04846-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Reference intervals represent the expected range of physiological test results in a healthy population and are essential to support medical decision making. Particularly in the context of pediatric reference intervals, where recruitment regulations make prospective studies challenging to conduct, indirect estimation strategies are becoming increasingly important. Established indirect methods enable robust identification of the distribution of "healthy" samples from laboratory databases, which include unlabeled pathologic cases, but are currently severely limited when adjusting for essential patient characteristics such as age. Here, we propose the use of mixture density networks (MDN) to overcome this problem and model all parameters of the mixture distribution in a single step. Results Estimated reference intervals from varying settings with simulated data demonstrate the ability to accurately estimate latent distributions from unlabeled data using different implementations of MDNs. Comparing the performance with alternative estimation approaches further highlights the importance of modeling the mixture component weights as a function of the input in order to avoid biased estimates for all other parameters and the resulting reference intervals. We also provide a strategy to generate partially customized starting weights to improve proper identification of the latent components. Finally, the application on real-world hemoglobin samples provides results in line with current gold standard approaches, but also suggests further investigations with respect to adequate regularization strategies in order to prevent overfitting the data. Conclusions Mixture density networks provide a promising approach capable of extracting the distribution of healthy samples from unlabeled laboratory databases while simultaneously and explicitly estimating all parameters and component weights as non-linear functions of the covariate(s), thereby allowing the estimation of age-dependent reference intervals in a single step. Further studies on model regularization and asymmetric component distributions are warranted to consolidate our findings and expand the scope of applications.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Mixture density networks for the indirect estimation of reference intervals
    Tobias Hepp
    Jakob Zierk
    Manfred Rauh
    Markus Metzler
    Sarem Seitz
    BMC Bioinformatics, 23
  • [2] Indirect Estimation of Reference Intervals for Thyroid Parameters
    Milinkovic, Neda
    Ignjatovic, Svetlana
    Zarkovic, Milos
    Radosavljevic, Branimir
    Majkic-Singh, Nada
    CLINICAL LABORATORY, 2014, 60 (07) : 1083 - 1089
  • [3] Indirect Method for Estimation of Reference Intervals of Inflammatory Markers
    Kang, Taewon
    Yoo, Jeaeun
    Jekarl, Dong Wook
    Chae, Hyojin
    Kim, Myungshin
    Park, Yeon-Joon
    Oh, Eun-Jee
    Kim, Yonggoo
    ANNALS OF LABORATORY MEDICINE, 2023, 43 (01) : 55 - +
  • [4] Diurnal variation of leukocyte counts affects the indirect estimation of reference intervals
    Torge, Antje
    Haeckel, Rainer
    Oezcueruemez, Mustafa
    Krebs, Alexander
    Junker, Ralf
    JOURNAL OF LABORATORY MEDICINE, 2021, 45 (02) : 121 - 124
  • [5] Indirect approaches to estimate reference intervals
    Haeckel, Rainer
    JOURNAL OF LABORATORY MEDICINE, 2021, 45 (02) : 31 - 33
  • [6] TSH continuous reference intervals by indirect methods: A Comparisons to Partitioned Reference Intervals
    Mokhtar, Khelil Mohamed
    CLINICAL BIOCHEMISTRY, 2020, 85 : 53 - 56
  • [7] INDIRECT ESTIMATION OF BLOOD CELLS REFERENCE INTERVALS FROM CLINICAL DATA IN THE COMMERCIAL REFERENCE LABORATORY.
    Sohn, Yong-Hak
    Kim, Min Kyung
    Cha, Choong-Hwan
    INTERNATIONAL JOURNAL OF LABORATORY HEMATOLOGY, 2019, 41 : 162 - 163
  • [8] INDIRECT ESTIMATION OF REFERENCE INTERVALS FOR THYROID PARAMETERS USING ADVIA CENTAUR XP ANALYZER
    Mirjanic-Azaric, Bosa
    Milinkovics, Neda
    Bogavac-Stanojevic, Natasa
    Avram, Sanja
    Stojakovic-Jelisavac, Tanja
    Stojanovic, Darja
    JOURNAL OF MEDICAL BIOCHEMISTRY, 2022, 41 (02) : 238 - 245
  • [9] Indirect Estimation of the Pediatric Reference Intervals for Thyroid Function Tests in Public Health Laboratory
    Turhan, Turan
    Kilinckaya, Muhammed F.
    CLINICAL LABORATORY, 2023, 69 (05) : 925 - 929
  • [10] Causal Interpretability and Uncertainty Estimation in Mixture Density Networks
    Swamy, Gokul
    Das, Arunita
    Niranjan, Shobhit
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT V, 2023, 14258 : 243 - 254