The non-negative matrix factorization toolbox for biological data mining

被引:132
|
作者
Li, Yifeng [1 ]
Ngom, Alioune [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Non-negative matrix factorization; Clustering; Bi-clustering; Feature extraction; Feature selection; Classification; Missing values;
D O I
10.1186/1751-0473-8-10
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data. Results: We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison. Conclusions: A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Araki, Shoko
    Ueda, Naonori
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 971 - 982
  • [22] Rank-Adaptive Non-Negative Matrix Factorization
    Shan, Dong
    Xu, Xinzheng
    Liang, Tianming
    Ding, Shifei
    COGNITIVE COMPUTATION, 2018, 10 (03) : 506 - 515
  • [23] Discriminative semi-supervised non-negative matrix factorization for data clustering
    Xing, Zhiwei
    Wen, Meng
    Peng, Jigen
    Feng, Jinqian
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 103
  • [24] A zero-inflated non-negative matrix factorization for the deconvolution of mixed signals of biological data
    Kong, Yixin
    Kozik, Ariangela
    Nakatsu, Cindy H.
    Jones-Hall, Yava L.
    Chun, Hyonho
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2022, 18 (01) : 203 - 218
  • [25] Biased unconstrained non-negative matrix factorization for clustering
    Deng, Ping
    Zhang, Fan
    Li, Tianrui
    Wang, Hongjun
    Horng, Shi-Jinn
    KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [26] Constrained Non-negative Matrix Factorization with Graph Laplacian
    Chen, Pan
    He, Yangcheng
    Lu, Hongtao
    Wu, Li
    NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 635 - 644
  • [27] Robust non-negative matrix factorization for subspace learning
    Dai, Xiangguang
    Tao, Yingyin
    Zhang, Wei
    Feng, Yuming
    ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2020, (44): : 511 - 520
  • [28] Optimization and expansion of non-negative matrix factorization
    Lin, Xihui
    Boutros, Paul C.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [29] General subspace constrained non-negative matrix factorization for data representation
    Liu, Yong
    Liao, Yiyi
    Tang, Liang
    Tang, Feng
    Liu, Weicong
    NEUROCOMPUTING, 2016, 173 : 224 - 232
  • [30] Novel Algorithm for Non-Negative Matrix Factorization
    Tran Dang Hien
    Do Van Tuan
    Pham Van At
    Le Hung Son
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2015, 11 (02) : 121 - 133