Entity understanding with hierarchical graph learning for enhanced text classification

被引:8
作者
Wang, Chao [1 ]
Jiang, Haiyun [3 ]
Chen, Tao [1 ]
Liu, Jingping [2 ]
Wang, Menghui [1 ]
Jiang, Sihang [1 ]
Li, Zhixu [1 ]
Xiao, Yanghua [1 ,4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Data Sci, Shanghai, Peoples R China
[2] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai, Peoples R China
[3] Tencent AI Lab, Shenzhen, Peoples R China
[4] Fudan Aishu Cognit Intelligence Joint Res Ctr, Shanghai, Peoples R China
关键词
Text classification; Hierarchical graph learning; Soft clustering; NETWORKS;
D O I
10.1016/j.knosys.2022.108576
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification plays an important role in the areas of natural language processing and data mining. In general, a text is usually described around a collection of entities, i.e., the entities are the core part of the text. As a result, a deep understanding of the entities in a text benefits the classification of texts. To understand entities, traditional work tends to introduce concepts or web data for entities. However, we argue that the potential relations between entities are also important for the understanding of entity semantics, thus further supporting the classification of texts. In this paper, we focus on enhancing the performance of the existing text classification models by extracting features from entities with hierarchical graph learning. To this end, we mine the concepts of entities and the relations between them for a given text simultaneously, and further construct the semantic graph of the text. Then a novel hierarchical graph learning model is proposed to learn the graph embedding that well captures the node, relation, and graph structure information. Our experiments show that the proposed method has the ability to effectively improve the performance of the existing text classifiers. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 52 条
  • [1] Crowdsourcing the character of a place: Character-level convolutional networks for multilingual geographic text classification
    Adams, Benjamin
    McKenzie, Grant
    [J]. TRANSACTIONS IN GIS, 2018, 22 (02) : 394 - 408
  • [2] [Anonymous], 2020, AAAI CONF ARTIF INTE
  • [3] [Anonymous], 2014, PROC 23 ACM INT C C, DOI DOI 10.1145/2661829.2662067
  • [4] Neural Relational Topic Models for Scientific Article Analysis
    Bai, Haoli
    Chen, Zhuangbin
    Lyu, Michael R.
    King, Irwin
    Xu, Zenglin
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 27 - 36
  • [5] Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
  • [6] Bunescu R., 2005, P HUM LANG TECHN C C, P724
  • [7] Can DC, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2902
  • [8] Effective Deep Attributed Network Representation Learning With Topology Adapted Smoothing
    Chen, Jia
    Zhong, Ming
    Li, Jianxin
    Wang, Dianhui
    Qian, Tieyun
    Tu, Hang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 5935 - 5946
  • [9] Chen JD, 2019, AAAI CONF ARTIF INTE, P6252
  • [10] Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107