Reconstructing biological gene regulatory networks: Where optimization meets big data

被引:29
作者
Thomas S.A. [1 ]
Jin Y. [1 ]
机构
[1] Department of Computing, University of Surrey, Guildford, Surrey
基金
英国工程与自然科学研究理事会;
关键词
Big data; Data science; Data-driven optimization; Evolutionary algorithms; Gene regulatory network reconstruction; Metaheuristics;
D O I
10.1007/s12065-013-0098-7
中图分类号
学科分类号
摘要
The importance of 'big data' in biology is increasing as vast quantities of data are being produced from high-throughput experiments. Techniques such as DNA microarrays are providing a genome-wide picture of gene expression levels, allowing us to investigate the structure and interactions of gene networks in biological systems. Inference of gene regulatory network (GRN) is an underdetermined problem suited to Metaheuristic algorithms which can operate on limited information. Thus GRN inference offers a platform for investigations into data intensive sciences and large scale optimization problems. Here we examine the link between data intensive research and optimization problems for the reverse engineering of GRNs. Briefly, we detail the benefit of the data deluge and the study of ALife for modelling GRNs as well as their reconstruction. We discuss how metaheuristics can solve big data problems and the inference of GRNs offer real world problems for both areas of research. We overview some current reconstruction algorithms and investigate some modelling and computational limits of the inference processes and suggest some areas for development. Furthermore we identify links and synergies between optimization and big data, e.g., dynamic, uncertain and large scale optimization problems, and discuss the potential benefit of multi- and many-objective optimization. We stress the importance of data integration techniques in order to maximize the data available, particularly for the case of inferring GRNs from microarray data. Such multi-disciplinary research is vital as biology is rapidly becoming a quantitative, data intensive science. © 2013 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:29 / 47
页数:18
相关论文
共 170 条
[1]  
Aijo T., Lahdesmaki H., Learning gene regulatory networks from gene expression measurements using non-parametric molecular kinetics, Bioinformatics, 25, 22, pp. 2937-2944, (2009)
[2]  
Akutsu T., Hayashida M., Tamura T., Algorithms for inference, analysis and control of boolean networks, Algebraic biology. Lecture Notes in computer science, 5147, pp. 1-15, (2008)
[3]  
Albert R., Jeong H., Barabasi A.L., Error and attack tolerance of complex networks, Nature, 406, pp. 378-382, (2000)
[4]  
Allenby N.E.E., Laing E., Bucca G., Kierzek A.M., Smith C.P., Diverse control of metabolism and other cellular processes in streptomyces coelicolor by the phop transcription factor: genome-wide identification of in vivo targets, Nucleic Acids Res, 40, 19, pp. 9543-9556, (2012)
[5]  
Alon U., An Introduction to Systems Biology: Design Principles of Biological Circuits, (2006)
[6]  
Alon U., Network motifs: theory and experimental approaches, Nat. Rev. Genet., 8, pp. 450-461, (2007)
[7]  
Alvarez-benitez J.E., Everson R.M., Fieldsend J.E., A mopso algorithm based exclusively on pareto dominance concepts, Proceedings of the third international conference on evolutionary multicriterion optimization, pp. 459-473, (2005)
[8]  
Ando S., Iba H., Inference of gene regulatory model by genetic algorithms, Proceedings of the 2001 congress on evolutionary computation, 2001, 1, pp. 712-719, (2001)
[9]  
Babu M.M., Luscombe N.M., Aravind L., Gerstein M., Teichmann S.A., Structure and evolution of transcriptional regulatory networks, Curr Opin Struct Biol, 14, 3, pp. 283-291, (2004)
[10]  
Bandaru S., Deb K., Automated innovization for simultaneous discovery of multiple rules in bi-objective problems, Evolutionary Multi-Criterion Optimization, Lecture Notes in Computer Science, Vol. 6576, pp. 1-15, (2011)