Large-scale dependent multiple testing via hidden semi-Markov models

被引:0
|
作者
Wang, Jiangzhou [1 ]
Wang, Pengfei [2 ]
机构
[1] Shenzhen Univ, Inst Stat Sci, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
关键词
FDR; Hidden semi-Markov model; Multiple testing; FALSE DISCOVERY RATE; EMPIRICAL BAYES;
D O I
10.1007/s00180-023-01367-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale multiple testing is common in the statistical analysis of high-dimensional data. Conventional multiple testing procedures usually implicitly assumed that the tests are independent. However, this assumption is rarely established in many practical applications, particularly in "high-throughput" data analysis. Incorporating dependence structure information among tests can improve statistical power and interpretability of discoveries. In this paper, we propose a new large-scale dependent multiple testing procedure based on the hidden semi-Markov model (HSMM), which characterizes local correlations among tests using a semi-Markov process instead of a first-order Markov chain. Our novel approach allows for the number of consecutive null hypotheses to follow any reasonable distribution, enabling a more accurate description of complex local correlations. We show that the proposed procedure minimizes the marginal false non-discovery rate (mFNR) at the same marginal false discovery rate (mFDR) level. To reduce the computational complexity of the HSMM, we make use of the hidden Markov model (HMM) with an expanded state space to approximate it. We provide a forward-backward algorithm and an expectation-maximization (EM) algorithm for implementing the proposed procedure. Finally, we demonstrate the superior performance of the SMLIS procedure through extensive simulations and a real data analysis.
引用
收藏
页码:1093 / 1126
页数:34
相关论文
共 50 条
  • [41] Probabilistic stability and stabilization of human-machine system via hidden semi-Markov modeling approach
    Liu, Yang-Fan
    Wu, Huai-Ning
    APPLIED MATHEMATICS AND COMPUTATION, 2025, 489
  • [42] A novel model for user clicks identification based on hidden semi-Markov
    Xu, C.
    Du, C.
    Zhao, G. F.
    Yu, S.
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2013, 36 (02) : 791 - 798
  • [43] ECG segmentation algorithm based on bidirectional hidden semi-Markov model
    Huo, Rui
    Zhang, Liting
    Liu, Feifei
    Wang, Ying
    Liang, Yesong
    Wei, Shoushui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 150
  • [44] Network Security Situation Assessment Based on Hidden Semi-Markov Model
    Zhang, Boyun
    Chen, Zhigang
    Yan, Xiai
    Wang, Shulin
    Fan, Qiang
    ADVANCED INTELLIGENT COMPUTING, 2011, 6838 : 509 - +
  • [45] A hidden semi-Markov model-based speech synthesis system
    Zen, Heiga
    Tokuda, Keiichi
    Masuko, Takashi
    Kobayasih, Takao
    Kitamura, Tadashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (05): : 825 - 834
  • [46] Application of Hidden Semi-Markov Model to 3′ splice sites identification
    Feng, XC
    Qian, MP
    Deng, MH
    Ma, XT
    Yan, XT
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2004, 31 (05) : 455 - 458
  • [47] Spectrum Sensing in Cognitive Radio Based on Hidden Semi-Markov Model
    Di, Lujie
    Ding, Xueke
    Li, Mingbing
    Wan, Qun
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT II, 2019, 302 : 275 - 286
  • [48] Statistical inference and large-scale multiple testing for high-dimensional regression models
    Cai, T. Tony
    Guo, Zijian
    Xia, Yin
    TEST, 2023, 32 (04) : 1135 - 1171
  • [49] Global Testing and Large-Scale Multiple Testing for High-Dimensional Covariance Structures
    Cai, T. Tony
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 : 423 - 446
  • [50] Heteroscedasticity-Adjusted Ranking and Thresholding for Large-Scale Multiple Testing
    Fu, Luella
    Gang, Bowen
    James, Gareth M.
    Sun, Wenguang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (538) : 1028 - 1040