Large-scale dependent multiple testing via hidden semi-Markov models

被引:0
|
作者
Wang, Jiangzhou [1 ]
Wang, Pengfei [2 ]
机构
[1] Shenzhen Univ, Inst Stat Sci, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
关键词
FDR; Hidden semi-Markov model; Multiple testing; FALSE DISCOVERY RATE; EMPIRICAL BAYES;
D O I
10.1007/s00180-023-01367-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large-scale multiple testing is common in the statistical analysis of high-dimensional data. Conventional multiple testing procedures usually implicitly assumed that the tests are independent. However, this assumption is rarely established in many practical applications, particularly in "high-throughput" data analysis. Incorporating dependence structure information among tests can improve statistical power and interpretability of discoveries. In this paper, we propose a new large-scale dependent multiple testing procedure based on the hidden semi-Markov model (HSMM), which characterizes local correlations among tests using a semi-Markov process instead of a first-order Markov chain. Our novel approach allows for the number of consecutive null hypotheses to follow any reasonable distribution, enabling a more accurate description of complex local correlations. We show that the proposed procedure minimizes the marginal false non-discovery rate (mFNR) at the same marginal false discovery rate (mFDR) level. To reduce the computational complexity of the HSMM, we make use of the hidden Markov model (HMM) with an expanded state space to approximate it. We provide a forward-backward algorithm and an expectation-maximization (EM) algorithm for implementing the proposed procedure. Finally, we demonstrate the superior performance of the SMLIS procedure through extensive simulations and a real data analysis.
引用
收藏
页码:1093 / 1126
页数:34
相关论文
共 50 条
  • [31] A Robust Method for Large-Scale Multiple Hypotheses Testing
    Han, Seungbong
    Andrei, Adin-Cristian
    Tsui, Kam-Wah
    BIOMETRICAL JOURNAL, 2010, 52 (02) : 222 - 232
  • [32] An improved noise-robust voice activity detector based on hidden semi-Markov models
    Liang, Yuan
    Liu, Xianglong
    Lou, Yihua
    Shan, Baosong
    PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 1044 - 1053
  • [33] A Hierarchical Hidden Semi-Markov Model for Modeling Mobility Data
    Baratchi, Mitra
    Meratnia, Nirvana
    Havinga, Paul J. M.
    Skidmore, Andrew K.
    Toxopeus, Bert A. K. G.
    UBICOMP'14: PROCEEDINGS OF THE 2014 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, 2014, : 401 - 412
  • [34] Reconstructing Individual Activity Trajectories by Hidden Semi-Markov Model
    Han, Zixuan
    Wan, Zijian
    Guo, Wanyi
    Ren, Chang
    2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
  • [35] Hidden Markov and Semi-Markov Models with Multivariate Leptokurtic-Normal Components for Robust Modeling of Daily Returns Series
    Maruotti, Antonello
    Punzo, Antonio
    Bagnato, Luca
    JOURNAL OF FINANCIAL ECONOMETRICS, 2019, 17 (01) : 91 - 117
  • [36] A prognosis method using age-dependent hidden semi-Markov model for equipment health prediction
    Peng, Ying
    Dong, Ming
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2011, 25 (01) : 237 - 252
  • [37] Multiple Testing in Nonparametric Hidden Markov Models: An Empirical Bayes Approach
    Abraham, Kweku
    Castillo, Ismael
    Gassiat, Elisabeth
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [38] Hidden Semi-Markov Models in the Computerized Decoding of Microelectrode Recording Data for Deep Brain Stimulator Placement
    Taghva, Alexander
    WORLD NEUROSURGERY, 2011, 75 (5-6) : 758 - U221
  • [39] Application of hidden semi-Markov models for the seismic hazard assessment of the North and South Aegean Sea, Greece
    Pertsinidou, C. E.
    Tsaklidis, G.
    Papadimitriou, E.
    Limnios, N.
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (06) : 1064 - 1085
  • [40] Large-scale multiple testing under dependence
    Sun, Wenguang
    Cai, T. Tony
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 : 393 - 424