Fine-Grained Category Generation for Sets of Entities

被引：0

作者：

Du, Yexing ^{[1
]}

Yu, Jifan ^{[2
]}

Wan, Jing ^{[1
]}

Xu, Jianjun ^{[3
]}

Hou, Lei ^{[2
]}

机构：

[1] Beijing Univ Chem Technol, Beijing 100029, Peoples R China

[2] Tsinghua Univ, Beijing 100084, Peoples R China

[3] Beijing Caizhi Technol Co Ltd, Beijing 100081, Peoples R China

来源：

WEB AND BIG DATA, PT IV, APWEB-WAIM 2023 | 2024年 / 14334卷

基金：

国家重点研发计划;

关键词：

Category generation; Fine-grained; Prompt;

D O I：

10.1007/978-981-97-2421-5_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Category systems play an essential role in knowledge bases by groupings of semantically related entities. Category generation task aims to produce category suggestions which can help knowledge editors to expand a category system. Most past research has focused on solving coarse-grained problems, not fine-grained scenarios. In this paper, we propose a two-stage framework to generate fine-grained categories for sets of entities. In the category generation stage, we extract conceptual texts from the context of entities and then employ the Seq2Seq model to generate candidate categories. In the category selection stage, we cluster the entities and design discrete patterns using entity names for prompt ranking, which are further ensembled to preserve the final categories. We construct a new fine-grained category generation dataset based on Wikipedia. Experimental results demonstrate the effectiveness of the framework over the state-of-the-art abstractive summarization methods.

引用

页码：390 / 405

页数：16

共 35 条

[1] MEMORY FOR A LECTURE - EFFECTS OF NOTES, LECTURE RATE, AND INFORMATIONAL DENSITY [J].

AIKEN, EG ;

THOMAS, GS ;

SHENNUM, WA .

JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1975, 67 (03) :439-444

[2]

Ben-David E, 2022, Arxiv, DOI arXiv:2102.12206

[3]

Brown TB, 2020, ADV NEUR IN, V33

[4]

Carlsson F, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P6837

[5]

Choi E, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P87

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Ding N., 2021, arXiv, DOI DOI 10.48550/ARXIV.2108.10604

[8]

Han X, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P2241

[9]

Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830

[10]

Hearst MartiA., 1992, COLING, P539, DOI DOI 10.3115/992133.992154

← 1 2 3 4 →