Methods for biological data integration: perspectives and challenges

被引:161
作者
Gligorijevic, Vladimir [1 ]
Przulj, Natasa [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
基金
欧洲研究理事会; 美国国家科学基金会;
关键词
data fusion; biological networks; non-negative matrix factorization; systems biology; omics data; heterogeneous data integration; PROTEIN-PROTEIN INTERACTIONS; GENOME-WIDE ASSOCIATION; NONNEGATIVE MATRIX FACTORIZATION; MICROARRAY DATA; INTERACTION NETWORKS; FUNCTION PREDICTION; INTERACTION MAP; GENE-FUNCTION; SCALE-FREE; RNA-SEQ;
D O I
10.1098/rsif.2015.0571
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rapid technological advances have led to the production of different types of biological data and enabled construction of complex networks with various types of interactions between diverse biological entities. Standard network data analysis methods were shown to be limited in dealing with such heterogeneous networked data and consequently, new methods for integrative data analyses have been proposed. The integrative methods can collectively mine multiple types of biological data and produce more holistic, systems-level biological insights. We survey recent methods for collective mining (integration) of various types of networked biological data. We compare different state-of-the-art methods for data integration and highlight their advantages and disadvantages in addressing important biological problems. We identify the important computational challenges of these methods and provide a general guideline for which methods are suited for specific biological problems, or specific data types. Moreover, we propose that recent non-negative matrix factorization-based approaches may become the integration methodology of choice, as they are well suited and accurate in dealing with heterogeneous data and have many opportunities for further development.
引用
收藏
页数:19
相关论文
共 195 条
[1]   Graph-based methods for analysing networks in cell biology [J].
Aittokallio, Tero ;
Schwikowski, Benno .
BRIEFINGS IN BIOINFORMATICS, 2006, 7 (03) :243-255
[2]   Scale-free networks in cell biology [J].
Albert, R .
JOURNAL OF CELL SCIENCE, 2005, 118 (21) :4947-4957
[3]   Network inference, analysis, and modeling in systems biology [J].
Albert, Reke .
PLANT CELL, 2007, 19 (11) :3327-3338
[4]  
Albright R., 2006, Algorithms, initializations, and convergence for the nonnegative matrix factorization
[5]   KeyPathwayMiner 4.0: condition-specific pathway analysis by combining multiple omics studies and networks with Cytoscape [J].
Alcaraz, Nicolas ;
Pauling, Josch ;
Batra, Richa ;
Barbosa, Eudes ;
Junge, Alexander ;
Christensen, Anne G. L. ;
Azevedo, Vasco ;
Ditzel, Henrik J. ;
Baumbach, Jan .
BMC SYSTEMS BIOLOGY, 2014, 8
[6]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[7]  
[Anonymous], 2002, Series: Springer Series in Statistics
[8]  
[Anonymous], 1990, Matrix theory and applications
[9]  
[Anonymous], 2008, BAYESIAN NETWORKS
[10]  
[Anonymous], SYSTEMS BIOL