Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks

被引:62
作者
Tan, Jie [1 ]
Doing, Georgia [2 ]
Lewis, Kimberley A. [2 ]
Price, Courtney E. [2 ]
Chen, Kathleen M. [3 ]
Cady, Kyle C. [4 ,5 ]
Perchuk, Barret [4 ,5 ]
Laub, Michael T. [4 ,5 ]
Hogan, Deborah A. [2 ]
Greene, Casey S. [3 ]
机构
[1] Geisel Sch Med Dartmouth, Dept Mol & Syst Biol, Hanover, NH USA
[2] Geisel Sch Med Dartmouth, Dept Microbiol & Immunol, Hanover, NH USA
[3] Univ Penn, Dept Syst Pharmacol & Translat Therapeut, Philadelphia, PA 19104 USA
[4] MIT, Dept Biol, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[5] Howard Hughes Med Inst, Cambridge, MA USA
关键词
INDEPENDENT COMPONENT ANALYSIS; PSEUDOMONAS-AERUGINOSA; GENE-EXPRESSION; MICROARRAY DATA; CROSS-TALK; CLASS DISCOVERY; IDENTIFICATION; REGULON; PHOB; REPRESENTATION;
D O I
10.1016/j.cels.2017.06.003
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Cross-experiment comparisons in public data compendia are challenged by unmatched conditions and technical noise. The ADAGE method, which performs unsupervised integration with denoising autoencoder neural networks, can identify biological patterns, but because ADAGE models, like many neural networks, are over-parameterized, different ADAGE models perform equally well. To enhance model robustness and better build signatures consistent with biological pathways, we developed an ensemble ADAGE (eADAGE) that integrated stable signatures across models. We applied eADAGE to a compendium of Pseudomonas aeruginosa gene expression profiling experiments performed in 78 media. eADAGE revealed a phosphate starvation response controlled by PhoB in media with moderate phosphate and predicted that a second stimulus provided by the sensor kinase, KinB, is required for this PhoB activation. We validated this relationship using both targeted and unbiased genetic approaches. eADAGE, which captures stable biological patterns, enables cross-experiment comparisons that can highlight measured but undiscovered relationships.
引用
收藏
页码:63 / +
页数:15
相关论文
共 55 条
[1]   Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]  
[Anonymous], AM SOC MICROBIOL J
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]  
Beaulieu-Jones BK, 2017, BIOCOMPUT-PAC SYM, P207, DOI 10.1142/9789813207813_0021
[5]   Semi-supervised learning of the electronic health record for phenotype stratification [J].
Beaulieu-Jones, Brett K. ;
Greene, Casey S. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 :168-178
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]   Lysogeny at mid-twentieth century: P1, P2, and other experimental, systems [J].
Bertani, G .
JOURNAL OF BACTERIOLOGY, 2004, 186 (03) :595-600
[8]   Cross talk between the response regulators PhoB and TctD allows for the integration of diverse environmental signals in Pseudomonas aeruginosa [J].
Bielecki, Piotr ;
Jensen, Vanessa ;
Schulze, Wiebke ;
Goedeke, Julia ;
Strehmel, Janine ;
Eckweiler, Denitsa ;
Nicolai, Tanja ;
Bielecka, Agata ;
Wille, Thorsten ;
Gerlach, Roman G. ;
Haeussler, Susanne .
NUCLEIC ACIDS RESEARCH, 2015, 43 (13) :6413-6425
[9]   The Effect of pstS and phoB on Quorum Sensing and Swarming Motility in Pseudomonas aeruginosa [J].
Blus-Kadosh, Inna ;
Zilka, Anat ;
Yerushalmi, Gal ;
Banin, Ehud .
PLOS ONE, 2013, 8 (09)
[10]   Knowledge-guided multi-scale independent component analysis for biomarker identification [J].
Chen, Li ;
Xuan, Jianhua ;
Wang, Chen ;
Shih, Ie-Ming ;
Wang, Yue ;
Zhang, Zhen ;
Hoffman, Eric ;
Clarke, Robert .
BMC BIOINFORMATICS, 2008, 9 (1)