Generalized Hypergeometric Distributions Generated by Birth-Death Process in Bioinformatics

被引:0
作者
Kuznetsov, Vladimir A. [1 ,2 ]
Grageda, Andre [1 ]
Farbod, Davood [3 ]
机构
[1] SUNY Upstate Med Univ, Dept Urol, Dept Biochem & Mol Biol, Syracuse, NY 13210 USA
[2] ASTAR, Bioinformat Inst, Singapore 138671, Singapore
[3] Quchan Univ Technol, Dept Math, Quchan 9477177870, Iran
关键词
Birth-Death Process; generalized hypergeometric distributions; stationary process; limiting process; regular variation; bioinformatics data; MAXIMUM-LIKELIHOOD ESTIMATORS; GENE-EXPRESSION; SCALE-FREE; MODEL; EVOLUTION;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Modern high-throughput biological systems detection methods generate empirical frequency distributions (EFD) which exhibit complex forms and have long right-side tails. Such EFD are often observed in normal and pathological processes, of which the probabilistic properties are essential, but the underlying probability mechanisms are poorly understood. To better understand the probability mechanisms driving biological complexity and the pathological role of extreme values, we propose that the observed skewed discrete distributions are generated by non-linear transition rates of birth and death processes (BDPs). We introduce a (3d+1)-parameter Generalized Gaussian Hypergeometric Probability ((3d+1)-GHP) model with the probabilities defined by a stationary solution of generalized BDP (g-BDP) and represented by generalized hypergeometric series with regularly varying function properties. We study the Regularly Varying 3d-Parameter Generalized Gaussian Hypergeometric Probability (3d-RGHP) function's regular variation properties, asymptotically constant slow varying component, unimodality and upward/downward convexity which allows us to specify a family of 3d-RGHP models and study their analytical and numerical characteristics. The frequency distribution of unique mutations occurring in the human genome of patients with melanoma have been analyzed as an example application of our theory in bioinformatics. The results show that the parameterized model not only fits the 'heavy tail' well, but also the entire EFD taken on the complete experimental outcome space. Our model provides a rigorous and flexible mathematical framework for analysis and application of skewed distributions generated by BDPs which often occur in bioinformatics and big data science.
引用
收藏
页码:303 / 327
页数:25
相关论文
共 50 条
  • [31] Spectral properties of birth-death polynomials
    van Doorn, Erik A.
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2015, 284 : 251 - 258
  • [32] Speed of stability for birth-death processes
    Chen, Mu-Fa
    FRONTIERS OF MATHEMATICS IN CHINA, 2010, 5 (03) : 379 - 515
  • [33] Probability Distribution of Tree Age for the Simple Birth-Death Process, with Applications to Distributions of Number of Ancestral Lineages and Divergence Times for Pairs of Taxa in a Yule Tree
    Mulder, Willem H.
    BULLETIN OF MATHEMATICAL BIOLOGY, 2023, 85 (10)
  • [34] FORMULAS FOR AVERAGE TRANSITION TIMES BETWEEN STATES OF THE MARKOV BIRTH-DEATH PROCESS
    Zhernovyi, Yuriy
    Kopytko, Bohdan
    JOURNAL OF APPLIED MATHEMATICS AND COMPUTATIONAL MECHANICS, 2021, 20 (04) : 99 - 110
  • [35] Study on birth-death process in the evolution modes of high-tech effusion
    Wu, H
    Ma, QG
    Yao, ZJ
    VALUE ENGINEERING & TECHNOLOGY INNOVATION, INTERNATIONAL CONFERENCE PROCEEDINGS, 1999, : 177 - 184
  • [36] The Effect of Fossil Sampling on the Estimation of Divergence Times with the Fossilized Birth-Death Process
    O'Reilly, Joseph E.
    Donoghue, Philip C. J.
    SYSTEMATIC BIOLOGY, 2020, 69 (01) : 124 - 138
  • [37] A Birth-Death Process Model on Collision and Coalescence of Drops in Gas/Drop Flows
    Xue, S. S.
    Xu, M.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2015), 2015, 13 : 788 - 791
  • [38] The fossilized birth-death process for coherent calibration of divergence-time estimates
    Heath, Tracy A.
    Huelsenbeck, John P.
    Stadler, Tanja
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (29) : E2957 - E2966
  • [39] The Occurrence Birth-Death Process for Combined-Evidence Analysis in Macroevolution and Epidemiology
    Andreoletti, Jeremy
    Zwaans, Antoine
    Warnock, Rachel C. M.
    Aguirre-Fernandez, Gabriel
    Barido-Sottani, Joelle
    Gupta, Ankit
    Stadler, Tanja
    Manceau, Marc
    SYSTEMATIC BIOLOGY, 2022, 71 (06) : 1440 - 1452
  • [40] Analysis of stationary fluid queue driven by state-dependent birth-death process subject to catastrophes
    Ammar, S. I.
    Samanta, S. K.
    Kilany, N. M.
    Jiang, T.
    SCIENTIA IRANICA, 2024, 31 (14) : 1149 - 1158