Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks

被引:95
作者
Wu, Siqi [1 ,2 ]
Joseph, Antony [1 ,2 ,3 ]
Hammonds, Ann S. [2 ]
Celniker, Susan E. [2 ]
Yu, Bin [1 ,4 ]
Frise, Erwin [2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Div Environm Genom & Syst Biol, Berkeley, CA 94720 USA
[3] Walmart Labs, San Bruno, CA 94066 USA
[4] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
principal patterns; stability selection; sparse decomposition; spatial gene expression; spatially local networks; INTRATUMOR HETEROGENEITY; TRANSCRIPTION FACTORS; AUTOMATED ANNOTATION; GLOBAL ANALYSIS; DROSOPHILA; PATTERNS; EMBRYO; EXTRACTION; DIVERSITY; EVOLUTION;
D O I
10.1073/pnas.1521171113
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Spatial gene expression patterns enable the detection of local covariability and are extremely useful for identifying local gene interactions during normal development. The abundance of spatial expression data in recent years has led to the modeling and analysis of regulatory networks. The inherent complexity of such data makes it a challenge to extract biological information. We developed staNMF, a method that combines a scalable implementation of nonnegative matrix factorization (NMF) with a new stability-driven model selection criterion. When applied to a set of Drosophila early embryonic spatial gene expression images, one of the largest datasets of its kind, staNMF identified 21 principal patterns (PP). Providing a compact yet biologically interpretable representation of Drosophila expression patterns, PP are comparable to a fate map generated experimentally by laser ablation and show exceptional promise as a data-driven alternative to manual annotations. Our analysis mapped genes to cell-fate programs and assigned putative biological roles to uncharacterized genes. Finally, we used the PP to generate local transcription factor regulatory networks. Spatially local correlation networks were constructed for six PP that span along the embryonic anterior-posterior axis. Using a two-tail 5% cutoff on correlation, we reproduced 10 of the 11 links in the well-studied gap gene network. The performance of PP with the Drosophila data suggests that staNMF provides informative decompositions and constitutes a useful computational lens through which to extract biological insight from complex and often noisy gene expression data.
引用
收藏
页码:4290 / 4295
页数:6
相关论文
共 49 条
[1]   The developmental expression dynamics of Drosophila melanogaster transcription factors [J].
Adryan, Boris ;
Teichmann, Sarah A. .
GENOME BIOLOGY, 2010, 11 (04)
[2]   Genetic and Phenotypic Diversity in Breast Tumor Metastases [J].
Almendro, Vanessa ;
Kim, Hee Jung ;
Cheng, Yu-Kang ;
Goenen, Mithat ;
Itzkovitz, Shalev ;
Argani, Pedram ;
van Oudenaarden, Alexander ;
Sukumar, Saraswati ;
Michor, Franziska ;
Polyak, Kornelia .
CANCER RESEARCH, 2014, 74 (05) :1338-1348
[3]   Inference of Tumor Evolution during Chemotherapy by Computational Modeling and In Situ Analysis of Genetic and Phenotypic Cellular Diversity [J].
Almendro, Vanessa ;
Cheng, Yu-Kang ;
Randles, Amanda ;
Itzkovitz, Shalev ;
Marusyk, Andriy ;
Ametller, Elisabet ;
Gonzalez-Farre, Xavier ;
Munoz, Montse ;
Russnes, Hege G. ;
Helland, Aslaug ;
Rye, Inga H. ;
Borresen-Dale, Anne-Lise ;
Maruyama, Reo ;
van Oudenaarden, Alexander ;
Dowsett, Mitchell ;
Jones, Robin L. ;
Reis-Filho, Jorge ;
Gascon, Pere ;
Goenen, Mithat ;
Michor, Franziska ;
Polyak, Kornelia .
CELL REPORTS, 2014, 6 (03) :514-527
[4]  
Amari S, 1996, ADV NEUR IN, V8, P757
[5]   The mesoderm determinant Snail collaborates with related zinc-finger proteins to control Drosophila neurogenesis [J].
Ashraf, SI ;
Hu, XD ;
Roote, J ;
Ip, YT .
EMBO JOURNAL, 1999, 18 (22) :6426-6438
[6]   Development of multiscale biological image data analysis: Review of 2006 International Workshop on Multiscale Biological Imaging, Data Mining and Informatics, Santa Barbara, USA (BII06) [J].
Auer, Manfred ;
Peng, Hanchuan ;
Singh, Ambuj .
BMC CELL BIOLOGY, 2007, 8 (Suppl 1)
[7]   Tumour heterogeneity in the clinic [J].
Bedard, Philippe L. ;
Hansen, Aaron R. ;
Ratain, Mark J. ;
Siu, Lillian L. .
NATURE, 2013, 501 (7467) :355-364
[8]   Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[9]   Spatial and temporal diversity in genomic instability processes defines lung cancer evolution [J].
de Bruin, Elza C. ;
McGranahan, Nicholas ;
Mitter, Richard ;
Salm, Max ;
Wedge, David C. ;
Yates, Lucy ;
Jamal-Hanjani, Mariam ;
Shafi, Seema ;
Murugaesu, Nirupa ;
Rowan, Andrew J. ;
Groenroos, Eva ;
Muhammad, Madiha A. ;
Horswell, Stuart ;
Gerlinger, Marco ;
Varela, Ignacio ;
Jones, David ;
Marshall, John ;
Voet, Thierry ;
Van Loo, Peter ;
Rassl, Doris M. ;
Rintoul, Robert C. ;
Janes, Sam M. ;
Lee, Siow-Ming ;
Forster, Martin ;
Ahmad, Tanya ;
Lawrence, David ;
Falzon, Mary ;
Capitanio, Arrigo ;
Harkins, Timothy T. ;
Lee, Clarence C. ;
Tom, Warren ;
Teefe, Enock ;
Chen, Shann-Ching ;
Begum, Sharmin ;
Rabinowitz, Adam ;
Phillimore, Benjamin ;
Spencer-Dene, Bradley ;
Stamp, Gordon ;
Szallasi, Zoltan ;
Matthews, Nik ;
Stewart, Aengus ;
Campbell, Peter ;
Swanton, Charles .
SCIENCE, 2014, 346 (6206) :251-256
[10]   Systematic image-driven analysis of the spatial Drosophila embryonic expression landscape [J].
Frise, Erwin ;
Hammonds, Ann S. ;
Celniker, Susan E. .
MOLECULAR SYSTEMS BIOLOGY, 2010, 6