GCNPCA: miRNA-Disease Associations Prediction Algorithm Based on Graph Convolutional Neural Networks

被引:10
作者
Liu, Jiwen [1 ]
Kuang, Zhufang [1 ]
Deng, Lei [2 ]
机构
[1] Cent South Univ Forestry & Technol, Sch Comp & Informat Engn, Changsha 410004, Hunan, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Diseases; Prediction algorithms; Neoplasms; Feature extraction; Semantics; Databases; Principal component analysis; miRNA-disease; graph convolutional network; heterogenous network; principal component analysis; random forest; MICRORNA; SIMILARITY;
D O I
10.1109/TCBB.2022.3203564
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A growing number of studies have confirmed the important role of microRNAs (miRNAs) in human diseases and the aberrant expression of miRNAs affects the onset and progression of human diseases. The discovery of disease-associated miRNAs as new biomarkers promote the progress of disease pathology and clinical medicine. However, only a small proportion of miRNA-disease correlations have been validated by biological experiments. And identifying miRNA-disease associations through biological experiments is both expensive and inefficient. Therefore, it is important to develop efficient and highly accurate computational methods to predict miRNA-disease associations. A miRNA-disease associations prediction algorithm based on Graph Convolutional neural Networks and Principal Component Analysis (GCNPCA) is proposed in this paper. Specifically, the deep topological structure information is extracted from the heterogeneous network composed of miRNA and disease nodes by a Graph Convolutional neural Network (GCN) with an additional attention mechanism. The internal attribute information of the nodes is obtained by the Principal Component Analysis (PCA). Then, the topological structure information and the node attribute information are combined to construct comprehensive feature descriptors. Finally, the Random Forest (RF) is used to train and classify these feature descriptors. In the five-fold cross-validation experiment, the AUC and AUPR for the GCNPCA algorithm are 0.983 and 0.988 respectively.
引用
收藏
页码:1041 / 1052
页数:12
相关论文
共 41 条
[1]   Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes [J].
Baskerville, S ;
Bartel, DP .
RNA, 2005, 11 (03) :241-247
[2]   MicroRNA functions [J].
Bushati, Natascha ;
Cohen, Stephen M. .
ANNUAL REVIEW OF CELL AND DEVELOPMENTAL BIOLOGY, 2007, 23 :175-205
[3]   Inferring Potential CircRNA-Disease Associations via Deep Autoencoder-Based Classification [J].
Deepthi, K. ;
Jereesh, A. S. .
MOLECULAR DIAGNOSIS & THERAPY, 2021, 25 (01) :87-97
[4]   Graph2MDA: a multi-modal variational graph embedding model for predicting microbe-drug associations [J].
Deng, Lei ;
Huang, Yibiao ;
Liu, Xuejun ;
Liu, Hui .
BIOINFORMATICS, 2022, 38 (04) :1118-1125
[5]   IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method [J].
Fan, Wenwen ;
Shang, Junliang ;
Li, Feng ;
Sun, Yan ;
Yuan, Shasha ;
Liu, Jin-Xing .
BMC BIOINFORMATICS, 2020, 21 (01)
[6]   Cancer statistics for the year 2020: An overview [J].
Ferlay, Jacques ;
Colombet, Murielle ;
Soerjomataram, Isabelle ;
Parkin, Donald M. ;
Pineros, Marion ;
Znaor, Ariana ;
Bray, Freddie .
INTERNATIONAL JOURNAL OF CANCER, 2021, 149 (04) :778-789
[7]   Graph regularized L2,1-nonnegative matrix factorization for miRNA-disease association prediction [J].
Gao, Zhen ;
Wang, Yu-Tian ;
Wu, Qing-Wen ;
Ni, Jian-Cheng ;
Zheng, Chun-Hou .
BMC BIOINFORMATICS, 2020, 21 (01)
[8]   word2vec, node2vec, graph2vec, X2vec: Towards a Theory of Vector Embeddings of Structured Data [J].
Grohe, Martin .
PODS'20: PROCEEDINGS OF THE 39TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2020, :1-16
[9]   MSCNE:Predict miRNA-Disease Associations Using Neural Network Based on Multi-Source Biological Information [J].
Han, Genwei ;
Kuang, Zhufang ;
Deng, Lei .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) :2926-2937
[10]   A review on methods for diagnosis of breast cancer cells and tissues [J].
He, Ziyu ;
Chen, Zhu ;
Tan, Miduo ;
Elingarami, Sauli ;
Liu, Yuan ;
Li, Taotao ;
Deng, Yan ;
He, Nongyue ;
Li, Song ;
Fu, Juan ;
Li, Wen .
CELL PROLIFERATION, 2020, 53 (07)