Discovering Perturbation of Modular Structure in HIV Progression by Integrating Multiple Data Sources Through Non-Negative Matrix Factorization

被引:2
作者
Ray, Sumanta [1 ]
Maulik, Ujjwal [2 ]
机构
[1] Aliah Univ, Dept Comp Sci & Engn, Kolkata 700156, W Bengal, India
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, W Bengal, India
关键词
Non negative matrix factorization; data integration; gene expression; protei-protein interaction; IMMUNODEFICIENCY-VIRUS-INFECTION; LARGE GENE LISTS; NETWORK; EXPRESSION; PATHWAYS;
D O I
10.1109/TCBB.2016.2642184
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Detecting perturbation in modular structure during HIV-1 disease progression is an important step to understand stage specific infection pattern of HIV-1 virus in human cell. In this article, we proposed a novel methodology on integration of multiple biological information to identify such disruption in human gene module during different stages of HIV-1 infection. We integrate three different biological information: gene expression information, protein-protein interaction information, and gene ontology information in single gene meta-module, through non negative matrix factorization (NMF). As the identified meta-modules inherit those information so, detecting perturbation of these, reflects the changes in expression pattern, in PPI structure and in functional similarity of genes during the infection progression. To integrate modules of different data sources into strong meta-modules, NMF based clustering is utilized here. Perturbation in meta-modular structure is identified by investigating the topological and intramodular properties and putting rank to those meta-modules using a rank aggregation algorithm. We have also analyzed the preservation structure of significant GO terms in which the human proteins of the meta-modules participate. Moreover, we have performed an analysis to show the change of coregulation pattern of identified transcription factors (TFs) over the HIV progression stages.
引用
收藏
页码:869 / 877
页数:9
相关论文
共 35 条
[1]  
[Anonymous], 2013, P 2013 SIAM INT C DA
[2]   A review of in silico approaches for analysis and prediction of HIV-1-human protein-protein interactions [J].
Bandyopadhyay, Sanghamitra ;
Ray, Sumanta ;
Mukhopadhyay, Anirban ;
Maulik, Ujjwal .
BRIEFINGS IN BIOINFORMATICS, 2015, 16 (05) :830-851
[3]   Is human blood a good surrogate for brain tissue in transcriptional studies? [J].
Cai, Chaochao ;
Langfelder, Peter ;
Fuller, Tova F. ;
Oldham, Michael C. ;
Luo, Rui ;
van den Berg, Leonard H. ;
Ophoff, Roel A. ;
Horvath, Steve .
BMC GENOMICS, 2010, 11
[4]  
Chu C, 2010, AM FAM PHYSICIAN, V81, P1239
[5]   Identifying proteins controlling key disease signaling pathways [J].
Gitter, Anthony ;
Bar-Joseph, Ziv .
BIOINFORMATICS, 2013, 29 (13) :227-236
[6]   Data integration in the era of omics: current and future challenges [J].
Gomez-Cabrero, David ;
Abugessaisa, Imad ;
Maier, Dieter ;
Teschendorff, Andrew ;
Merkenschlager, Matthias ;
Gisel, Andreas ;
Ballestar, Esteban ;
Bongcam-Rudloff, Erik ;
Conesa, Ana ;
Tegner, Jesper .
BMC SYSTEMS BIOLOGY, 2014, 8 :I1
[7]  
Greene D, 2009, LECT NOTES ARTIF INT, V5781, P423, DOI 10.1007/978-3-642-04180-8_45
[8]   Geometric Interpretation of Gene Coexpression Network Analysis [J].
Horvath, Steve ;
Dong, Jun .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (08)
[9]   Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources [J].
Huang, Da Wei ;
Sherman, Brad T. ;
Lempicki, Richard A. .
NATURE PROTOCOLS, 2009, 4 (01) :44-57
[10]   Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists [J].
Huang, Da Wei ;
Sherman, Brad T. ;
Lempicki, Richard A. .
NUCLEIC ACIDS RESEARCH, 2009, 37 (01) :1-13