An Improved Nested Named-Entity Recognition Model for Subject Recognition Task under Knowledge Base Question Answering

被引：3

作者：

Wang, Ziming ^{[1
]}

Xu, Xirong ^{[1
]}

Li, Xinzi ^{[1
]}

Li, Haochen ^{[1
]}

Wei, Xiaopeng ^{[1
]}

Huang, Degen ^{[1
]}

机构：

[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 20期

关键词：

natural language processing; knowledge base question answering; subject recognition; named-entity recognition; deep learning; robustness;

D O I：

10.3390/app132011249

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In the subject recognition (SR) task under Knowledge Base Question Answering (KBQA), a common method is by training and employing a general flat Named-Entity Recognition (NER) model. However, it is not effective and robust enough in the case that the recognized entity could not be strictly matched to any subjects in the Knowledge Base (KB). Compared to flat NER models, nested NER models show more flexibility and robustness in general NER tasks, whereas it is difficult to employ a nested NER model directly in an SR task. In this paper, we take advantage of features of a nested NER model and propose an Improved Nested NER Model (INNM) for the SR task under KBQA. In our model, each question token is labeled as either an entity token, a start token, or an end token by a modified nested NER model based on semantics. Then, entity candidates would be generated based on such labels, and an approximate matching strategy is employed to score all subjects in the KB based on string similarity to find the best-matched subject. Experimental results show that our model is effective and robust to both single-relation questions and complex questions, which outperforms the baseline flat NER model by a margin of 3.3% accuracy on the SimpleQuestions dataset and a margin of 11.0% accuracy on the WebQuestionsSP dataset.

引用

页数：13

共 43 条

[1] Automatic question generation based on sentence structure analysis using machine learning approach [J].

Blstak, Miroslav ;

Rozinajova, Viera .

NATURAL LANGUAGE ENGINEERING, 2022, 28 (04) :487-517

[2]

Bollacker KD., 2008, P ACM SIGMOD INT C M, P1247, DOI DOI 10.1145/1376616.1376746

[3]

Bordes A, 2015, Arxiv, DOI [arXiv:1506.02075, 10.48550/arXiv.1506.02075, DOI 10.48550/ARXIV.1506.02075]

[4]

Chuang YS, 2020, Arxiv, DOI arXiv:1910.11559

[5]

Clark Kevin, 2020, INFORM SYST RES, DOI DOI 10.48550/ARXIV.2003.10555

[6]

Dai ZH, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P800

[7]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[8]

Gangwar A, 2021, PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2021, P1232

[9]

Gu Yu, 2022, arXiv

[10]

Nguyen H, 2022, PROCEEDINGS OF THE 5TH WORKSHOP ON E-COMMERCE AND NLP (ECNLP 5), P171

← 1 2 3 4 5 →