A TESTING BASED APPROACH TO THE DISCOVERY OF DIFFERENTIALLY CORRELATED VARIABLE SETS

被引:4
作者
Bodwin, Kelly [1 ]
Zhang, Kai [1 ]
Nobel, Andrew [1 ]
机构
[1] Univ North Carolina Chapel Hill, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Differential correlation mining; association mining; biostatistics; genomics; high-dimensional data; COVARIANCE-MATRIX; R PACKAGE; EXPRESSION; NETWORKS; CONNECTIVITY; ACTIVATION; HYPOTHESIS; DEPENDENCE; ELEMENTS; MODELS;
D O I
10.1214/17-AOAS1083
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Given data obtained under two sampling conditions, it is often of interest to identify variables that behave differently in one condition than in the other. We introduce a method for differential analysis of second-order behavior called Differential Correlation Mining (DCM). The DCM method identifies differentially correlated sets of variables, with the property that the average pairwise correlation between variables in a set is higher under one sample condition than the other. DCM is based on an iterative search procedure that adaptively updates the size and elements of a candidate variable set. Updates are performed via hypothesis testing of individual variables, based on the asymptotic distribution of their average differential correlation. We investigate the performance of DCM by applying it to simulated data as well as to recent experimental datasets in genomics and brain imaging.
引用
收藏
页码:1180 / 1203
页数:24
相关论文
共 47 条
[1]  
Anderson T., 1959, An Introduction to Multivariate Statistical Analysis
[2]  
Bassi F., 2012, Proceedings of the 2012 IEEE International Symposium on Information Theory - ISIT, P2591, DOI 10.1109/ISIT.2012.6283986
[3]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[4]   COVARIANCE REGULARIZATION BY THRESHOLDING [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (06) :2577-2604
[5]  
Bishop C. M., 2006, Pattern recognition and machine learning (information science and statistics), DOI [DOI 10.1007/978-0-387-45528-0, 10.1007/978-0-387-45528-0]
[6]   New network topology approaches reveal differential correlation patterns in breast cancer [J].
Bockmayr, Michael ;
Klauschen, Frederick ;
Gyoerffy, Balazs ;
Denkert, Carsten ;
Budczies, Jan .
BMC SYSTEMS BIOLOGY, 2013, 7
[7]  
BODWIN K., 2018, TESTING BASED APPR S, DOI [10.1214/17-AOAS1083SUPP, DOI 10.1214/17-AOAS1083SUPP]
[8]   THE ASYMPTOTIC COVARIANCE-MATRIX OF SAMPLE CORRELATION-COEFFICIENTS UNDER GENERAL CONDITIONS [J].
BROWNE, MW ;
SHAPIRO, A .
LINEAR ALGEBRA AND ITS APPLICATIONS, 1986, 82 :169-176
[9]  
Cai T.T., 2014, TECHNICAL REPORT
[10]   LIMITING LAWS OF COHERENCE OF RANDOM MATRICES WITH APPLICATIONS TO TESTING COVARIANCE STRUCTURE AND CONSTRUCTION OF COMPRESSED SENSING MATRICES [J].
Cai, T. Tony ;
Jiang, Tiefeng .
ANNALS OF STATISTICS, 2011, 39 (03) :1496-1525