Multi-View Clustering for Integration of Gene Expression and Methylation Data With Tensor Decomposition and Self-Representation Learning

被引:16
作者
Gao, Xiaowei [1 ]
Wang, Yan [2 ]
Hou, Weimin [3 ]
Liu, Zaiyi [4 ]
Ma, Xiaoke [5 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Dept Lib, Xian 710071, Shaanxi, Peoples R China
[3] Northwest Minzu Univ, Minist Educ, Key Lab Chinas Ethn Languages Informat Technol, Lanzhou 730030, Gansu, Peoples R China
[4] Hebei Univ Sci & Technol, Sch Informat Sci & Engn, Dept Radiol, Shijiazhuang 050018, Hebei, Peoples R China
[5] Guangdong Gen Hosp, Guangdong Acad Med Sci, Dept Radiol, Guangzhou 510080, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA; Epigenetics; Bioinformatics; Genomics; Gene expression; Clustering algorithms; Tensors; DNA methylation; gene clustering; low-rank constraint; multi-view clustering; self-representation learning; DISCOVERY; INSIGHTS;
D O I
10.1109/TCBB.2022.3229678
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The accumulated DNA methylation and gene expression provide a great opportunity to exploit the epigenetic patterns of genes, which is the foundation for revealing the underlying mechanisms of biological systems. Current integrative algorithms are criticized for undesirable performance because they fail to address the heterogeneity of expression and methylation data, and the intrinsic relations among them. To solve this issue, a novel multi-view clustering with self-representation learning and low-rank tensor constraint (MCSL-LTC) is proposed for the integration of gene expression and DNA methylation data, which are treated as complementary views. Specifically, MCSL-LTC first learns the low-dimensional features for each view with the linear projection, and then these features are fused in a unified tensor space with low-rank constraints. In this case, the complementary information of various views is precisely captured, where the heterogeneity of omic data is avoided, thereby enhancing the consistency of different views. Finally, MCSL-LTC obtains a consensus cluster of genes reflecting the structure and features of various views. Experimental results demonstrate that the proposed approach outperforms state-of-the-art baselines in terms of accuracy on both the social and cancer data, which provides an effective and efficient method for the integration of heterogeneous genomic data.
引用
收藏
页码:2050 / 2063
页数:14
相关论文
共 63 条
[1]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]   Crosstalk between microRNA expression and DNA methylation drives the hormone-dependent phenotype of breast cancer [J].
Aure, Miriam Ragle ;
Fleischer, Thomas ;
Bjorklund, Sunniva ;
Ankill, Jorgen ;
Castro-Mondragon, Jaime A. ;
Borresen-Dale, Anne-Lise ;
Tost, Jorg ;
Sahlberg, Kristine K. ;
Mathelier, Anthony ;
Tekpli, Xavier ;
Kristensen, Vessela N. .
GENOME MEDICINE, 2021, 13 (01)
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Diversity-induced Multi-view Subspace Clustering [J].
Cao, Xiaochun ;
Zhang, Changqing ;
Fu, Huazhu ;
Liu, Si ;
Zhang, Hua .
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, :586-594
[5]   Similarity Fusion via Exploiting High Order Proximity for Cancer Subtyping [J].
Chen, Jiazhou ;
Rong, Wentao ;
Tao, Guihua ;
Cai, Hongmin .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) :658-667
[6]  
Chen MS, 2020, AAAI CONF ARTIF INTE, V34, P3513
[7]  
Cheng B, 2011, IEEE I CONF COMP VIS, P2439, DOI 10.1109/ICCV.2011.6126528
[8]   Sparse Subspace Clustering: Algorithm, Theory, and Applications [J].
Elhamifar, Ehsan ;
Vidal, Rene .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (11) :2765-2781
[9]  
Fazel Maryam., 2002, MATRIX RANK MINIMIZA
[10]  
Fleischer T, 2014, GENOME BIOL, V15, DOI [10.1186/s13059-014-0435-x, 10.1186/PREACCEPT-2333349012841587]