HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction

被引:0
作者
Song, Mingyang [1 ]
Liu, Huafeng [1 ]
Jing, Liping [1 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing, Peoples R China
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the exponential growth in the number of documents on the web in recent years, there is an increasing demand for accurate models to extract keyphrases from such documents. Keyphrase extraction is the task of automatically identifying representative keyphrases from the source document. Typically, candidate keyphrases exhibit latent hierarchical structures embedded with intricate syntactic and semantic information. Moreover, the relationships between candidate keyphrases and the document also form hierarchical structures. Therefore, it is essential to consider these latent hierarchical structures when extracting keyphrases. However, many recent unsupervised keyphrase extraction models overlook this aspect, resulting in incorrect keyphrase extraction. In this paper, we address this issue by proposing a new hyperbolic ranking model (HyperRank). HyperRank is designed to jointly model global and local context information for estimating the importance of each candidate keyphrase within the hyperbolic space, enabling accurate keyphrase extraction. Experimental results demonstrate that HyperRank significantly outperforms recent state-of-the-art baselines.
引用
收藏
页码:16070 / 16080
页数:11
相关论文
共 48 条
  • [1] [Anonymous], P 28 INT C COMPUTAT, P2037
  • [2] Bennani-Smires K., 2018, P 22 CONLL, P221, DOI DOI 10.18653/V1/K18-1022
  • [3] Boudin F., 2018, P 2018 C N AM CHAPT, V2, P667, DOI DOI 10.18653/V1/N18-2105
  • [4] Bougouin A., 2013, INT JOINT C NEUR LAN, P543
  • [5] YAKE! Collection-Independent Automatic Keyword Extractor
    Campos, Ricardo
    Mangaravite, Vitor
    Pasquali, Arian
    Jorge, Alipio Mario
    Nunes, Celia
    Jatowt, Adam
    [J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 806 - 810
  • [6] Chen BL, 2020, AAAI CONF ARTIF INTE, V34, P7496
  • [7] Chen Boli, 2021, INT C LEARNING REPR
  • [8] Dai Shuyang, 2020, arXiv
  • [9] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [10] Ding HR, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P1919