Covariate-modulated large-scale multiple testing under dependence
被引:2
|
作者:
Wang, Jiangzhou
论文数: 0引用数: 0
h-index: 0
机构:
Shenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
Wang, Jiangzhou
[1
]
Cui, Tingting
论文数: 0引用数: 0
h-index: 0
机构:
Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
Cui, Tingting
[2
]
Zhu, Wensheng
论文数: 0引用数: 0
h-index: 0
机构:
Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
Zhu, Wensheng
[2
]
Wang, Pengfei
论文数: 0引用数: 0
h-index: 0
机构:
Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R ChinaShenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
Wang, Pengfei
[3
]
机构:
[1] Shenzhen Univ, Coll Math & Stat, Shenzhen 518060, Peoples R China
[2] Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun, Peoples R China
[3] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116025, Peoples R China
Large-scale multiple testing, which calls for conducting tens of thousands of hypothesis testings simultaneously, has been applied in many scientific fields. Most conventional multiple testing procedures often focused on the control of false discovery rate (FDR) and largely ignored covariate information and the dependence structure among tests. A FDR control procedure, termed as Covariate-Modulated Local Index of Significance (cmLIS) procedure, which not only takes into account local correlations among tests but also accommodates the covariate information by leveraging a covariate-modulated hidden Markov model (HMM), has been proposed. In the oracle case where all parameters of the covariate-modulated HMM are known, the cmLIS procedure is shown to be valid and optimal in some sense. According to whether the number of mixed components in the nonnull distribution is known, two Bayesian sampling algorithms are provided for parameter estimation. Extensive simulations are conducted to demonstrate the effectiveness of the cmLIS procedure over state-of-the-art multiple testing procedures. Finally, the cmLIS procedure is applied to an RNA sequencing data and a schizophrenia (SCZ) data. (c) 2022 Elsevier B.V. All rights reserved.
机构:
Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun 130024, Jilin, Peoples R China
Dongbei Univ Finance & Econ, Sch Stat, Dalian, Peoples R ChinaNortheast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun 130024, Jilin, Peoples R China
Wang, Pengfei
Zhu, Wensheng
论文数: 0引用数: 0
h-index: 0
机构:
Northeast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun 130024, Jilin, Peoples R ChinaNortheast Normal Univ, Sch Math & Stat, Key Lab Appl Stat MOE, Changchun 130024, Jilin, Peoples R China
机构:
Stanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USAStanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USA
Zhang, Martin J.
Xia, Fei
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USAStanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USA
Xia, Fei
Zou, James
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USA
Stanford Univ, Dept Biomed Data Sci, Palo Alto, CA 94304 USA
Chan Zuckerberg Biohub, San Francisco, CA 94158 USAStanford Univ, Dept Elect Engn, Palo Alto, CA 94304 USA