The non-negative matrix factorization toolbox for biological data mining

被引:132
|
作者
Li, Yifeng [1 ]
Ngom, Alioune [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Non-negative matrix factorization; Clustering; Bi-clustering; Feature extraction; Feature selection; Classification; Missing values;
D O I
10.1186/1751-0473-8-10
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Non-negative matrix factorization (NMF) has been introduced as an important method for mining biological data. Though there currently exists packages implemented in R and other programming languages, they either provide only a few optimization algorithms or focus on a specific application field. There does not exist a complete NMF package for the bioinformatics community, and in order to perform various data mining tasks on biological data. Results: We provide a convenient MATLAB toolbox containing both the implementations of various NMF techniques and a variety of NMF-based data mining approaches for analyzing biological data. Data mining approaches implemented within the toolbox include data clustering and bi-clustering, feature extraction and selection, sample classification, missing values imputation, data visualization, and statistical comparison. Conclusions: A series of analysis such as molecular pattern discovery, biological process identification, dimension reduction, disease prediction, visualization, and statistical comparison can be performed using this toolbox.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Non-negative Matrix Factorization: A Survey
    Gan, Jiangzhang
    Liu, Tong
    Li, Li
    Zhang, Jilian
    COMPUTER JOURNAL, 2021, 64 (07) : 1080 - 1092
  • [2] Non-negative Matrix Factorization for Binary Data
    Larsen, Jacob Sogaard
    Clemmensen, Line Katrine Harder
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 555 - 563
  • [3] NON-NEGATIVE MATRIX FACTORIZATION OF CLUSTERED DATA WITH MISSING VALUES
    Chen, Rebecca
    Varshney, Lav R.
    2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 180 - 184
  • [4] Improving Prediction Accuracy of Microarray Cancer Data with Non-negative Matrix Factorization and Its Variant
    Patel, Nakul
    Passi, Kalpdrum
    Jain, Chakresh Kumar
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2227 - 2234
  • [5] A framework for intelligent Twitter data analysis with non-negative matrix factorization
    Casalino, Gabriella
    Castiello, Ciro
    Del Buono, Nicoletta
    Mencar, Corrado
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2018, 14 (03) : 334 - 356
  • [6] FARNESS PRESERVING NON-NEGATIVE MATRIX FACTORIZATION
    Babaee, Mohammadreza
    Bahmanyar, Reza
    Rigoll, Gerhard
    Datcu, Mihai
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3023 - 3027
  • [7] Comparison of Non-negative Matrix Factorization Methods for Clustering Genomic Data
    Hou, Mi-Xiao
    Gao, Ying-Lian
    Liu, Jin-Xing
    Shang, Jun-Liang
    Zheng, Chun-Hou
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 290 - 299
  • [8] Performance Analysis of Non-negative Matrix Factorization Methods on TCGA Data
    Hou, Mi-Xiao
    Liu, Jin-Xing
    Shang, Junliang
    Gao, Ying-Lian
    Kong, Xiang-Zhen
    Dai, Ling-Yun
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 407 - 418
  • [9] Optimal Recovery of Missing Values for Non-Negative Matrix Factorization
    Dean, Rebecca Chen
    Varshney, Lav R.
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2021, 2 : 207 - 216
  • [10] Image semantic information mining algorithm by non-negative matrix factorization
    Li Yan
    Zhou Xingbo
    2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND ENGINEERING APPLICATIONS, 2013, : 345 - 348