GAE-LGA: integration of multi-omics data with graph autoencoders to identify lncRNA-PCG associations

被引:11
作者
Gao, Meihong [1 ]
Liu, Shuhui [1 ]
Qi, Yang [1 ]
Guo, Xinpeng [1 ]
Shang, Xuequn [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci & Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
long non-coding RNAs; protein-coding genes; lncRNA-PCG associations prediction; cross-omics correlation learning; graph autoencoders; LONG NONCODING RNAS; PREDICTION;
D O I
10.1093/bib/bbac452
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Long non-coding RNAs (lncRNAs) can disrupt the biological functions of protein-coding genes (PCGs) to cause cancer. However, the relationship between lncRNAs and PCGs remains unclear and difficult to predict. Machine learning has achieved a satisfactory performance in association prediction, but to our knowledge, it is currently less used in lncRNA-PCG association prediction. Therefore, we introduce GAE-LGA, a powerful deep learning model with graph autoencoders as components, to recognize potential lncRNA-PCG associations. GAE-LGA jointly explored lncRNA-PCG learning and cross-omics correlation learning for effective lncRNA-PCG association identification. The functional similarity and multi-omics similarity of lncRNAs and PCGs were accumulated and encoded by graph autoencoders to extract feature representations of lncRNAs and PCGs, which were subsequently used for decoding to obtain candidate lncRNA-PCG pairs. Comprehensive evaluation demonstrated that GAE-LGA can successfully capture lncRNA-PCG associations with strong robustness and outperformed other machine learning-based identification methods. Furthermore, multi-omics features were shown to improve the performance of lncRNA-PCG association identification. In conclusion, GAE-LGA can act as an efficient application for lncRNA-PCG association prediction with the following advantages: It fuses multi-omics information into the similarity network, making the feature representation more accurate; it can predict lncRNA-PCG associations for new lncRNAs and identify potential lncRNA-PCG associations with high accuracy.
引用
收藏
页数:9
相关论文
共 52 条
  • [1] MaTAR25 lncRNA regulates the Tensin1 gene to impact breast cancer progression
    Chang, Kung-Chi
    Diermeier, Sarah D.
    Yu, Allen T.
    Brine, Lily D.
    Russo, Suzanne
    Bhatia, Sonam
    Alsudani, Habeeb
    Kostroff, Karen
    Bhuiya, Tawfiqul
    Brogi, Edi
    Pappin, Darryl J.
    Bennett, C. Frank
    Rigo, Frank
    Spector, David L.
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [2] ILDMSF: Inferring Associations Between Long Non-Coding RNA and Disease Based on Multi-Similarity Fusion
    Chen, Qingfeng
    Lai, Dehuan
    Lan, Wei
    Wu, Ximin
    Chen, Baoshan
    Liu, Jin
    Chen, Yi-Ping Phoebe
    Wang, Jianxin
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 1106 - 1112
  • [3] KATZLDA: KATZ measure for the lncRNA-disease association prediction
    Chen, Xing
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [4] LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse
    Cheng, Liang
    Wang, Pingping
    Tian, Rui
    Wang, Song
    Guo, Qinghua
    Luo, Meng
    Zhou, Wenyang
    Liu, Guiyou
    Jiang, Huijie
    Jiang, Qinghua
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) : D140 - D144
  • [5] LncRNA UCA1 Antagonizes Arsenic-Induced Cell Cycle Arrest through Destabilizing EZH2 and Facilitating NFATc2 Expression
    Dong, Zheng
    Gao, Ming
    Li, Changying
    Xu, Ming
    Liu, Sijin
    [J]. ADVANCED SCIENCE, 2020, 7 (11)
  • [6] Knockdown of MALAT1 inhibits osteosarcoma progression via regulating the miR-34a/cyclin D1 axis
    Duan, Guangchao
    Zhang, Chuanlin
    Xu, Changke
    Xu, Chao
    Zhang, Lei
    Zhang, Yan
    [J]. INTERNATIONAL JOURNAL OF ONCOLOGY, 2019, 54 (01) : 17 - 28
  • [7] GENCODE 2021
    Frankish, Adam
    Diekhans, Mark
    Jungreis, Irwin
    Lagarde, Julien
    Loveland, Jane E.
    Mudge, Jonathan M.
    Sisu, Cristina
    Wright, James C.
    Armstrong, Joel
    Barnes, If
    Berry, Andrew
    Bignell, Alexandra
    Boix, Carles
    Carbonell Sala, Silvia
    Cunningham, Fiona
    Di Domenico, Tomas
    Donaldson, Sarah
    Fiddes, Ian T.
    Giron, Carlos Garcia
    Gonzalez, Jose Manuel
    Grego, Tiago
    Hardy, Matthew
    Hourlier, Thibaut
    Howe, Kevin L.
    Hunt, Toby
    Izuogu, Osagie G.
    Johnson, Rory
    Martin, Fergal J.
    Martinez, Laura
    Mohanan, Shamika
    Muir, Paul
    Navarro, Fabio C. P.
    Parker, Anne
    Pei, Baikang
    Pozo, Fernando
    Riera, Ferriol Calvet
    Ruffier, Magali
    Schmitt, Bianca M.
    Stapleton, Eloise
    Suner, Marie-Marthe
    Sycheva, Irina
    Uszczynska-Ratajczak, Barbara
    Wolf, Maxim Y.
    Xu, Jinuri
    Yang, Yucheng T.
    Yates, Andrew
    Zerbino, Daniel
    Zhang, Yan
    Choudhary, Jyoti S.
    Gerstein, Mark
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) : D916 - D923
  • [8] RIblast: an ultrafast RNA-RNA interaction prediction system based on a seed-and-extension approach
    Fukunaga, Tsukasa
    Hamada, Michiaki
    [J]. BIOINFORMATICS, 2017, 33 (17) : 2666 - 2674
  • [9] ImReLnc: Identifying Immune-Related LncRNA Characteristics in Human Cancers Based on Heuristic Correlation Optimization
    Gao, Meihong
    Liu, Shuhui
    Qi, Yang
    Guo, Xinpeng
    Shang, Xuequn
    [J]. FRONTIERS IN GENETICS, 2022, 12
  • [10] MechRNA: prediction of lncRNA mechanisms from RNA-RNA and RNA-protein interactions
    Gawronski, Alexander R.
    Uhl, Michael
    Zhang, Yajia
    Lin, Yen-Yi
    Niknafs, Yashar S.
    Ramnarine, Varune R.
    Malik, Rohit
    Feng, Felix
    Chinnaiyan, Arul M.
    Collins, Colin C.
    Sahinalp, S. Cenk
    Backofen, Rolf
    [J]. BIOINFORMATICS, 2018, 34 (18) : 3101 - 3110