Conditional generative adversarial network for gene expression inference

被引:42
作者
Wang, Xiaoqian [1 ]
Dizaji, Kamran Ghasedi [1 ]
Huang, Heng [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
基金
美国国家科学基金会;
关键词
NORMALIZATION;
D O I
10.1093/bioinformatics/bty563
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The rapid progress of gene expression profiling has facilitated the prosperity of recent biological studies in various fields, where gene expression data characterizes various cell conditions and regulatory mechanisms under different experimental circumstances. Despite the widespread application of gene expression profiling and advances in high-throughput technologies, profiling in genome-wide level is still expensive and difficult. Previous studies found that high correlation exists in the expression pattern of different genes, such that a small subset of genes can be informative to approximately describe the entire transcriptome. In the Library of Integrated Network-based Cell-Signature program, a set of similar to 1000 landmark genes have been identified that contain similar to 80% information of the whole genome and can be used to predict the expression of remaining genes. For a cost-effective profiling strategy, traditional methods measure the profiles of landmark genes and then infer the expression of other target genes via linear models. However, linear models do not have the capacity to capture the non-linear associations in gene regulatory networks. Results: As a flexible model with high representative power, deep learning models provide an alternate to interpret the complex relation among genes. In this paper, we propose a deep learning architecture for the inference of target gene expression profiles. We construct a novel conditional generative adversarial network by incorporating both the adversarial and l(1)-norm loss terms in our model. Unlike the smooth and blurry predictions resulted by mean squared error objective, the coupled adversarial and l(1)-norm loss function leads to more accurate and sharp predictions. We validate our method under two different settings and find consistent and significant improvements over all the comparing methods.
引用
收藏
页码:603 / 611
页数:9
相关论文
共 49 条
[1]  
[Anonymous], 151105440 ARXIV
[2]  
[Anonymous], 2012, Technical Report
[3]  
[Anonymous], 2016, ARXIV161109340
[4]  
[Anonymous], 2013, P 30 INT C MACH LEAR
[5]  
[Anonymous], 170310593 ARXIV
[6]   On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].
Bach, Sebastian ;
Binder, Alexander ;
Montavon, Gregoire ;
Klauschen, Frederick ;
Mueller, Klaus-Robert ;
Samek, Wojciech .
PLOS ONE, 2015, 10 (07)
[7]  
Benhenda M., 2017, 170808227 ARXIV
[8]   Stromal gene expression defines poor-prognosis subtypes in colorectal cancer [J].
Calon, Alexandre ;
Lonardo, Enza ;
Berenguer-Llergo, Antonio ;
Espinet, Elisa ;
Hernando-Momblona, Xavier ;
Iglesias, Mar ;
Sevillano, Marta ;
Palomo-Ponce, Sergio ;
Tauriello, Daniele V. F. ;
Byrom, Daniel ;
Cortina, Carme ;
Morral, Clara ;
Barcelo, Carles ;
Tosi, Sebastien ;
Riera, Antoni ;
Attolini, Camille Stephan-Otto ;
Rossell, David ;
Sancho, Elena ;
Batlle, Eduard .
NATURE GENETICS, 2015, 47 (04) :320-U62
[9]   Gene expression inference with deep learning [J].
Chen, Yifei ;
Li, Yi ;
Narayan, Rajiv ;
Subramanian, Aravind ;
Xie, Xiaohui .
BIOINFORMATICS, 2016, 32 (12) :1832-1839
[10]  
Collobert R., 2008, P 25 ICML, P160, DOI [DOI 10.1145/1390156.1390177, 10.1145/1390156.1390177]