Combining Multi-granularity Text Semantics with Graph Relational Semantics for Question Retrieval in CQA

被引:0
作者
Li, Hong [1 ,2 ]
Li, Jianjun [1 ]
Jin, Huazhong [2 ]
Chen, Zixuan [2 ]
Zou, Wei [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan 430074, Peoples R China
[2] Hubei Univ Technol, Coll Comp Sci & Technol, Wuhan 430068, Peoples R China
[3] Hubei Univ Technol, Coll Sci, Wuhan 430068, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT II, ICIC 2024 | 2024年 / 14876卷
基金
中国国家自然科学基金;
关键词
Question Retrieval; Community Question Answering; Text Similarity; Sequence Relevance; Network Embedding;
D O I
10.1007/978-981-97-5666-7_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question retrieval aims to retrieve historical question-answer pairs that are semantically similar or related to newly posted questions. Existing methods rely on measuring the textual similarity between the asked question and the solved question, but suffer from insufficient semantic mining and inaccurate matching feature extraction. To address these issues, we propose a novel model that considers fine-grained word-level similarities and graph-based semantic relationships between questions, as well as potential sequence correlations between questions and answers. Specifically, a tag-enhanced multi-granularity matching strategy is designed to learn the semantic similarity between questions, and a BERT-based correlation mining method is adopted to explore the relevance between questions and answers. In addition, we construct a homogeneous question network based on the pointing relationships between question knowledge units and learn the relational semantics of question nodes through an auxiliary information-enhanced skip-gram algorithm. Evaluation results on two community datasets show that our proposed model significantly improves retrieval accuracy and efficiency compared to state-of-the-art methods.
引用
收藏
页码:53 / 64
页数:12
相关论文
共 13 条
  • [1] Cai Li., 2011, P 5 INT JOINT C NAT
  • [2] Developing a platform-specific framework for web credibility assessment: A case of social Q&A sites
    Choi, Wonchan
    Stvilia, Besiki
    Lee, Hyun Seung
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [3] Chong L, 2023, Arxiv, DOI arXiv:2210.11806
  • [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [5] Ji Z., 2012, P 21 ACM INT C INF K, P2471, DOI [10.1145/2396761.2398669, DOI 10.1145/2396761.2398669]
  • [6] A Transformer Based Encodings for Detection of Semantically Equivalent Questions in cQA
    Kumar, Shobhan
    Chauhan, Arun
    [J]. COMPUTER JOURNAL, 2023, 66 (05) : 1139 - 1155
  • [7] Lei Tao, 2016, P 2016 C N AM CHAPT, P1279, DOI DOI 10.18653/V1/N16-1153
  • [8] Ma D., 2023, LNCS, V13945, P457, DOI [10.1007/978-3-031-30675-4_33, DOI 10.1007/978-3-031-30675-4_33]
  • [9] Mass Y, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P807
  • [10] Peinelt N, 2020, P 58 ANN M ASS COMPU, P7047, DOI 10.18653/v1/2020.aclmain.630