Non-negative matrix factorization by maximizing correntropy for cancer clustering

被引:99
作者
Wang, Jim Jing-Yan [1 ]
Wang, Xiaolei [1 ]
Gao, Xin [1 ,2 ]
机构
[1] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 239556900, Saudi Arabia
[2] King Abdullah Univ Sci & Technol, Computat Biosci Res Ctr, Thuwal 239556900, Saudi Arabia
关键词
GENE-EXPRESSION DATA; COMPREHENSIVE EVALUATION; CLASS DISCOVERY; CLASSIFICATION; PREDICTION; MODEL;
D O I
10.1186/1471-2105-14-107
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Non-negative matrix factorization (NMF) has been shown to be a powerful tool for clustering gene expression data, which are widely used to classify cancers. NMF aims to find two non-negative matrices whose product closely approximates the original matrix. Traditional NMF methods minimize either the l(2) norm or the Kullback-Leibler distance between the product of the two matrices and the original matrix. Correntropy was recently shown to be an effective similarity measurement due to its stability to outliers or noise. Results: We propose a maximum correntropy criterion (MCC)-based NMF method (NMF-MCC) for gene expression data-based cancer clustering. Instead of minimizing the l(2) norm or the Kullback-Leibler distance, NMF-MCC maximizes the correntropy between the product of the two matrices and the original matrix. The optimization problem can be solved by an expectation conditional maximization algorithm. Conclusions: Extensive experiments on six cancer benchmark sets demonstrate that the
引用
收藏
页数:11
相关论文
共 50 条
[31]   Recovering Hidden Diagonal Structures via Non-Negative Matrix Factorization with Multiple Constraints [J].
Yang, Xi ;
Han, Guoqiang ;
Cai, Hongmin ;
Song, Yan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) :1760-1772
[32]   Tumor Classification Based on Non-Negative Matrix Factorization Using Gene Expression Data [J].
Zheng, Chun-Hou ;
Ng, To-Yee ;
Zhang, Lei ;
Shiu, Chi-Keung ;
Wang, Hong-Qiang .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2011, 10 (02) :86-93
[33]   Robust Manhattan non-negative matrix factorization for image recovery and representation [J].
Dai, Xiangguang ;
Su, Xiaojie ;
Zhang, Wei ;
Xue, Fangzheng ;
Li, Huaqing .
INFORMATION SCIENCES, 2020, 527 :70-87
[34]   Bayesian Non-Negative Matrix Factorization With Adaptive Sparsity and Smoothness Prior [J].
Tichy, Ondrej ;
Bodiova, Lenka ;
Smidl, Vaclav .
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) :510-514
[35]   A supervised non-negative matrix factorization model for speech emotion recognition [J].
Hou, Mixiao ;
Li, Jinxing ;
Lu, Guangming .
SPEECH COMMUNICATION, 2020, 124 :13-20
[36]   Improving Prediction Accuracy of Microarray Cancer Data with Non-negative Matrix Factorization and Its Variant [J].
Patel, Nakul ;
Passi, Kalpdrum ;
Jain, Chakresh Kumar .
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, :2227-2234
[37]   Efficient Orthogonal Non-negative Matrix Factorization over Stiefel Manifold [J].
Zhang, Wei Emma ;
Tan, Mingkui ;
Sheng, Quan Z. ;
Yao, Lina ;
Shi, Qingfeng .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :1743-1752
[38]   Distributed non-negative matrix factorization with determination of the number of latent features [J].
Chennupati, Gopinath ;
Vangara, Raviteja ;
Skau, Erik ;
Djidjev, Hristo ;
Alexandrov, Boian .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (09) :7458-7488
[39]   A logistic non-negative matrix factorization approach to binary data sets [J].
Tome, A. M. ;
Schachtner, R. ;
Vigneron, V. ;
Puntonet, C. G. ;
Lang, E. W. .
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2015, 26 (01) :125-143
[40]   Network-based integrative clustering of multiple types of genomic data using non-negative matrix factorization [J].
Chalise, Prabhakar ;
Ni, Yonghui ;
Fridley, Brooke L. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 118