HTRPCA: Hypergraph Regularized Tensor Robust Principal Component Analysis for Sample Clustering in Tumor Omics Data

被引:7
作者
Zhao, Yu-Ying [1 ]
Jiao, Cui-Na [1 ]
Wang, Mao-Li [1 ]
Liu, Jin-Xing [1 ,2 ]
Wang, Juan [1 ]
Zheng, Chun-Hou [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao, Peoples R China
[2] Rizhao Huilian Zhongchuang Inst Intelligent Techn, Rizhao 276826, Peoples R China
基金
中国国家自然科学基金;
关键词
Low-rank tensor; Hypergraph; Sample clustering; Tensor robust principal component analysis; FACTORIZATION;
D O I
10.1007/s12539-021-00441-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, clustering analysis of cancer genomics data has gained widespread attention. However, limited by the dimensions of the matrix, the traditional methods cannot fully mine the underlying geometric structure information in the data. Besides, noise and outliers inevitably exist in the data. To solve the above two problems, we come up with a new method which uses tensor to represent cancer omics data and applies hypergraph to save the geometric structure information in original data. This model is called hypergraph regularized tensor robust principal component analysis (HTRPCA). The data processed by HTRPCA becomes two parts, one of which is a low-rank component that contains pure underlying structure information between samples, and the other is some sparse interference points. So we can use the low-rank component for clustering. This model can retain complex geometric information between more sample points due to the addition of the hypergraph regularization. Through clustering, we can demonstrate the effectiveness of HTRPCA, and the experimental results on TCGA datasets demonstrate that HTRPCA precedes other advanced methods. [GRAPHICS]
引用
收藏
页码:22 / 33
页数:12
相关论文
共 33 条
[1]   Sparse Bayesian Methods for Low-Rank Matrix Estimation [J].
Babacan, S. Derin ;
Luessi, Martin ;
Molina, Rafael ;
Katsaggelos, Aggelos K. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (08) :3964-3977
[2]   Predicting Diabetes: Clinical, Biological, and Genetic Approaches Data from the Epidemiological Study on the Insulin Resistance Syndrome (DESIR) [J].
Balkau, Beverley ;
Lange, Celine ;
Fezeu, Leopold ;
Tichet, Jean ;
De Lauzon-Guillain, Blandine ;
Cernichow, Sebastien ;
Fumeron, Frederic ;
Froguel, Philippe ;
Vaxillaire, Martine ;
Cauchi, Stephane ;
Ducimetiere, Pierre ;
Eschwege, Eveline .
DIABETES CARE, 2008, 31 (10) :2056-2061
[3]   Graph Regularized Nonnegative Matrix Factorization for Data Representation [J].
Cai, Deng ;
He, Xiaofei ;
Han, Jiawei ;
Huang, Thomas S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) :1548-1560
[4]   Robust Principal Component Analysis? [J].
Candes, Emmanuel J. ;
Li, Xiaodong ;
Ma, Yi ;
Wright, John .
JOURNAL OF THE ACM, 2011, 58 (03)
[5]   The direct extension of ADMM for multi-block convex minimization problems is not necessarily convergent [J].
Chen, Caihua ;
He, Bingsheng ;
Ye, Yinyu ;
Yuan, Xiaoming .
MATHEMATICAL PROGRAMMING, 2016, 155 (1-2) :57-79
[6]  
Chen CF, 2012, PROC CVPR IEEE, P2618, DOI 10.1109/CVPR.2012.6247981
[7]   Robust Subspace Segmentation Via Low-Rank Representation [J].
Chen, Jinhui ;
Yang, Jian .
IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (08) :1432-1445
[8]   Joint L1/2-Norm Constraint and Graph-Laplacian PCA Method for Feature Extraction [J].
Feng, Chun-Mei ;
Gao, Ying-Lian ;
Liu, Jin-Xing ;
Wang, Juan ;
Wang, Dong-Qin ;
Wen, Chang-Gang .
BIOMED RESEARCH INTERNATIONAL, 2017, 2017
[9]   Feature selection in principal component analysis of analytical data [J].
Guo, Q ;
Wu, W ;
Massart, DL ;
Boucon, C ;
de Jong, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 61 (1-2) :123-132
[10]   Differentially Expressed Genes Extracted by the Tensor Robust Principal Component Analysis (TRPCA) Method [J].
Hu, Yue ;
Liu, Jin-Xing ;
Gao, Ying-Lian ;
Li, Sheng-Jun ;
Wang, Juan .
COMPLEXITY, 2019, 2019