bioNMF: a versatile tool for non-negative matrix factorization in biology

被引:71
作者
Pascual-Montano, Alberto [1 ]
Carmona-Saez, Pedro
Chagoyen, Monica
Tirado, Francisco
Carazo, Jose M.
Pascual-Marqui, Roberto D.
机构
[1] Univ Complutense Madrid, Fac Ciencias Fis, Comp Architecture Dept, E-28040 Madrid, Spain
[2] Natl Biotechnol Ctr, BioComp Unit, E-28049 Madrid, Spain
[3] Univ Hosp Psychiat, KEY Inst Brain Mind Res, CH-8029 Zurich, Switzerland
关键词
D O I
10.1186/1471-2105-7-366
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In the Bioinformatics field, a great deal of interest has been given to Non-negative matrix factorization technique (NMF), due to its capability of providing new insights and relevant information about the complex latent relationships in experimental data sets. This method, and some of its variants, has been successfully applied to gene expression, sequence analysis, functional characterization of genes and text mining. Even if the interest on this technique by the bioinformatics community has been increased during the last few years, there are not many available simple standalone tools to specifically perform these types of data analysis in an integrated environment. Results: In this work we propose a versatile and user-friendly tool that implements the NMF methodology in different analysis contexts to support some of the most important reported applications of this new methodology. This includes clustering and biclustering gene expression data, protein sequence analysis, text mining of biomedical literature and sample classification using gene expression. The tool, which is named bioNMF, also contains a user-friendly graphical interface to explore results in an interactive manner and facilitate in this way the exploratory data analysis process. Conclusion: bioNMF is a standalone versatile application which does not require any special installation or libraries. It can be used for most of the multiple applications proposed in the bioinformatics field or to support new research using this method. This tool is publicly available at http://www.dacya.ucm.es/apascual/bioNMF.
引用
收藏
页数:9
相关论文
共 27 条
[1]   Metagenes and molecular pattern discovery using matrix factorization [J].
Brunet, JP ;
Tamayo, P ;
Golub, TR ;
Mesirov, JP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) :4164-4169
[2]   Biclustering of gene expression data by non-smooth non-negative matrix factorization [J].
Carmona-Saez, P ;
Pascual-Marqui, RD ;
Tirado, F ;
Carazo, JM ;
Pascual-Montano, A .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]   High-resolution genomic profiles define distinct clinico-pathogenetic subgroups of multiple myeloma patients [J].
Carrasco, DR ;
Tonon, G ;
Huang, YS ;
Zhang, YY ;
Sinha, R ;
Bin, F ;
Stewart, JP ;
Zhan, FG ;
Khatry, D ;
Protopopova, M ;
Protopopov, A ;
Sukhdeo, K ;
Hanamura, I ;
Stephens, O ;
Barlogie, B ;
Anderson, KC ;
Chin, L ;
Shaughnessy, JD ;
Brennan, C ;
DePinho, RA .
CANCER CELL, 2006, 9 (04) :313-325
[4]   Discovering semantic features in the literature: a foundation for building functional associations [J].
Chagoyen, M ;
Carmona-Saez, P ;
Shatkay, H ;
Carazo, JM ;
Pascual-Montano, A .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]  
Dai JJ, 2006, STAT APPL GENET MOL, V5
[6]   Engene: the processing and exploratory analysis of gene expression data [J].
de la Nava, JG ;
Santaella, DF ;
Alba, JC ;
Carazo, JM ;
Trelles, O ;
Pascual-Montano, A .
BIOINFORMATICS, 2003, 19 (05) :657-658
[7]   Improving molecular cancer class discovery through sparse non-negative matrix factorization [J].
Gao, Y ;
Church, G .
BIOINFORMATICS, 2005, 21 (21) :3970-3975
[8]   Coupled two-way clustering analysis of gene microarray data [J].
Getz, G ;
Levine, E ;
Domany, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12079-12084
[9]   Biologically valid linear factor models of gene expression [J].
Girolami, M ;
Breitling, R .
BIOINFORMATICS, 2004, 20 (17) :3021-3033
[10]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537