Molecular subgraph representation learning based on spatial structure transformer

被引:0
作者
Zhang, Shaoguang [1 ]
Lu, Jianguang [1 ]
Tang, Xianghong [1 ]
机构
[1] Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Guizhou, Peoples R China
关键词
Graph representation learning; Graph transformer; Molecular subgraph; Graph neural network; GRAPH; FRAMEWORK;
D O I
10.1007/s40747-024-01602-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of molecular biology, graph representation learning is crucial for molecular structure analysis. However, challenges arise in recognising functional groups and distinguishing isomers due to a lack of spatial structure information. To address these problems, we design a novel graph representation learning method based on a spatial structure information extraction Transformer (SSET). The SSET model comprises the Edge Feature Fusion Subgraph Spatial Structure Extractor (ETSE) module and the Positional Information Encoding Graph Transformer (PEGT) module. The ETSE module extracts spatial structural information by fusing edge features and generating the most-value subgraph (Mv-subgraph). The PEGT module encodes positional information based on the graph transformer, addressing the indistinguishability problem among nodes with identical features. In addition, the SSET model alleviates the burden of high computational complexity by using subgraph. Experiments on real datasets show that the SSET model, built on the graph transformer, considerably improves graph representation learning.
引用
收藏
页码:8197 / 8212
页数:16
相关论文
共 51 条
  • [1] Alon Uri, 2020, ARXIV
  • [2] Beaini Dominique, 2021, P INT C MACH LEARN, P748
  • [3] Improving Graph Neural Network Expressivity via Subgraph Isomorphism Counting
    Bouritsas, Giorgos
    Frasca, Fabrizio
    Zafeiriou, Stefanos
    Bronstein, Michael M.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 657 - 668
  • [4] Cai D, 2020, AAAI CONF ARTIF INTE, V34, P7464
  • [5] Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438
  • [6] Chen HC, 2018, AAAI CONF ARTIF INTE, P2127
  • [7] Corso G, 2020, ADV NEUR IN, V33
  • [8] Cucurull Guillem, 2017, arXiv
  • [9] Defferrard M, 2016, ADV NEUR IN, V29
  • [10] metapath2vec: Scalable Representation Learning for Heterogeneous Networks
    Dong, Yuxiao
    Chawla, Nitesh V.
    Swami, Ananthram
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 135 - 144