We consider the problem of feature selection in a high-dimensional multiple predictors, multiple responses regression setting. Assuming that regression errors are i.i.d. when they are in fact dependent leads to inconsistent and inefficient feature estimates. We relax the i.i.d. assumption by allowing the errors to exhibit a tree-structured dependence. This allows a Bayesian problem formulation with the error dependence structure treated as an auxiliary variable that can be integrated out analytically with the help of the matrix-tree theorem. Mixing over trees results in a flexible technique for modelling the graphical structure for the regression errors. Furthermore, the analytic integration results in a collapsed Gibbs sampler for feature selection that is computationally efficient. Our approach offers significant performance gains over the competing methods in simulations, especially when the features themselves are correlated. In addition to comprehensive simulation studies, we apply our method to a high-dimensional breast cancer data set to identify markers significantly associated with the disease. Copyright (C) 2014 John Wiley & Sons, Ltd.
机构:
Agrocampus Ouest, CNRS, UMR 6625, Inst Rech Math Rennes IRMAR, 65 Rue St Brieuc, F-35042 Rennes, FranceAgrocampus Ouest, CNRS, UMR 6625, Inst Rech Math Rennes IRMAR, 65 Rue St Brieuc, F-35042 Rennes, France
Perthame, Emeline
Friguet, Chloe
论文数: 0引用数: 0
h-index: 0
机构:
Univ South Brittany, CNRS, UMR 6205, LMBA, Bat Y Coppens,Campus Tohannic, F-56000 Vannes, FranceAgrocampus Ouest, CNRS, UMR 6625, Inst Rech Math Rennes IRMAR, 65 Rue St Brieuc, F-35042 Rennes, France
Friguet, Chloe
Causeur, David
论文数: 0引用数: 0
h-index: 0
机构:
Agrocampus Ouest, CNRS, UMR 6625, Inst Rech Math Rennes IRMAR, 65 Rue St Brieuc, F-35042 Rennes, FranceAgrocampus Ouest, CNRS, UMR 6625, Inst Rech Math Rennes IRMAR, 65 Rue St Brieuc, F-35042 Rennes, France
机构:
Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R ChinaYunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R China
Dai, Dengluan
Tang, Anmin
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R ChinaYunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R China
Tang, Anmin
Ye, Jinli
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R ChinaYunnan Univ, Yunnan Key Lab Stat Modeling & Data Anal, Kunming 650091, Peoples R China
机构:
Departamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A CoruñaDepartamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A Coruña
Bolón-Canedo V.
Sánchez-Maroño N.
论文数: 0引用数: 0
h-index: 0
机构:
Departamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A CoruñaDepartamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A Coruña
Sánchez-Maroño N.
Alonso-Betanzos A.
论文数: 0引用数: 0
h-index: 0
机构:
Departamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A CoruñaDepartamento de Computación, Universidade da Coruña, Campus de Elviña s/n, A Coruña