Contrastive knowledge integrated graph neural networks for Chinese medical text classification

被引:14
作者
Lan, Ge [1 ]
Hu, Mengting [1 ]
Li, Ye [2 ]
Zhang, Yuzhi [1 ]
机构
[1] Nankai Univ, Coll Software, Tianjin 300350, Peoples R China
[2] Nankai Univ, Coll Artificial Intelligence, Tianjin 300350, Peoples R China
关键词
Knowledge graph; Graph neural networks; Medical text classification; Supervised contrastive learning;
D O I
10.1016/j.engappai.2023.106057
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper aims at medical text classification, where texts describe medicines, diseases, or other medical topics. This field is still challenging since medical texts contain intensive specialization and terminology, which require professional semantic and structured knowledge to classify. Based on our observations, medical knowledge graph (KG) can provide such knowledge although they may be ambiguous. To this end, we propose contrastive knowledge integrated graph neural networks (ConKGNN) to make full use of the above knowledge. Specifically, the proposed method builds two graphs for a medical text, i.e. text graph and text-specific subgraph, containing the text information and relevant KG information, respectively. Two graphs are merged into a united graph, which is jointly modeled by graph neural networks (GNN). In this way, our approach adequately learns interactions between neighbors. Meanwhile, it promotes the mutual influences between text and KG. We further propose graph-based supervised contrastive learning. By randomly cutting off nodes from the text graph, an augmented united graph is obtained. Learning it in a contrastive way could enhance the robustness of introducing KG information. Comprehensive experiments are conducted on five Chinese medical datasets and experimental results show our model outperforms strong baselines remarkably. Consequently, our model can serve as an efficient medical text classifier with excellent performance. We release the code at https://github.com/nolongernome/ConKGNN.
引用
收藏
页数:14
相关论文
共 62 条
  • [1] Abdollahi M, 2019, IEEE C EVOL COMPUTAT, P119, DOI [10.1109/cec.2019.8790259, 10.1109/CEC.2019.8790259]
  • [2] A corpus-based semantic kernel for text classification by using meaning values of terms
    Altinel, Berna
    Ganiz, Murat Can
    Diri, Banu
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 43 : 54 - 66
  • [3] [Anonymous], 2014, ENG APPL ARTIF INTEL, DOI DOI 10.1016/J.ENGAPPAI.2014.01.013
  • [4] [Anonymous], 2012, P 11 INT WORKSH DAT, DOI DOI 10.1145/2350176.2350181
  • [5] Bordes A., 2013, ADV NEURAL INFORM PR, V26, P2787, DOI DOI 10.5555/2999792.2999923
  • [6] CAFE: Knowledge graph completion using neighborhood-aware features
    Borrego, Agustin
    Ayala, Daniel
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 103
  • [7] Bosselut A, 2021, AAAI CONF ARTIF INTE, V35, P4923
  • [8] Che W., 2020, arXiv
  • [9] Chen JD, 2019, AAAI CONF ARTIF INTE, P6252
  • [10] Chen T, 2020, PR MACH LEARN RES, V119