Integration of multi-level semantics in PTMs with an attention model for question matching

被引:0
|
作者
Ye, Zheng [1 ,2 ]
Che, Linwei [1 ,2 ]
Ge, Jun [3 ]
Qin, Jun [1 ,2 ]
Liu, Jing [1 ,2 ]
机构
[1] South Cent Minzu Univ, Coll Comp Sci, Natl Ethn Affairs Commiss, Wuhan, Hubei, Peoples R China
[2] South Cent Minzu Univ, Informat Phys Fus Intelligent Comp Key Lab, Natl Ethn Affairs Commiss, Wuhan, Hubei, Peoples R China
[3] Wuhan Text Univ, Coll Int Business Econ, Wuhan, Hubei, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
D O I
10.1371/journal.pone.0305772
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The task of question matching/retrieval focuses on determining whether two questions are semantically equivalent. It has garnered significant attention in the field of natural language processing (NLP) due to its commercial value. While neural network models have made great strides and achieved human-level accuracy, they still face challenges when handling complex scenarios. In this paper, we delve into the utilization of different specializations encoded in different layers of large-scale pre-trained language models (PTMs). We propose a novel attention-based model called ERNIE-ATT that effectively integrates the diverse levels of semantics acquired by PTMs, thereby enhancing robustness. Experimental evaluations on two challenging datasets showcase the superior performance of our proposed model. It outperforms not only traditional models that do not use PTMs but also exhibits a significant improvement over strong PTM-based models. These findings demonstrate the effectiveness of our approach in enhancing the robustness of question matching/retrieval systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Multi-level graph bisection and "cocktail" matching
    Haralambides, J
    CSC '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON SCIENTIFIC COMPUTING, 2005, : 122 - 128
  • [42] An image inpainting model based on channel attention gated convolution and multi-level attention mechanism
    Zhao, Sihan
    Li, Chunmeng
    Zhang, Chenyang
    Yang, Xiaozhong
    DISPLAYS, 2025, 87
  • [43] Visual Relation Detection with Multi-Level Attention
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
  • [44] Attention as a multi-level system of weights and balances
    Narhi-Martinez, William
    Dube, Blaire
    Golomb, Julie D.
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2023, 14 (01)
  • [45] CNNs with Multi-Level Attention for Domain Generalization
    Ballas, Aristotelis
    Diou, Cristos
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 592 - 596
  • [46] Multi-level attention for referring expression comprehension
    Sun, Yanfeng
    Zhang, Yunru
    Jiang, Huajie
    Hu, Yongli
    Yin, Baocai
    PATTERN RECOGNITION LETTERS, 2023, 172 : 252 - 258
  • [47] Dynamic Multi-Level Governance - Bringing the Study of Multi-Level Interactions into the Theorising of European Integration
    Littoz-Monnet, Annabelle
    EUROPEAN INTEGRATION ONLINE PAPERS-EIOP, 2010, 14 (01):
  • [48] Joint Deep Model with Multi-Level Attention and Hybrid-Prediction for Recommendation
    Lin, Zhipeng
    Tang, Yuhua
    Zhang, Yongjun
    ENTROPY, 2019, 21 (02):
  • [49] Road Crack Model Based on Multi-Level Feature Fusion and Attention Mechanism
    Song, Rongrong
    Wang, Caiyong
    Tian, Qichuan
    Zhang, Qi
    Computer Engineering and Applications, 2023, 59 (13): : 281 - 288
  • [50] MULTI-LEVEL ATTENTION MODEL WITH DEEP SCATTERING SPECTRUM FOR ACOUSTIC SCENE CLASSIFICATION
    Li, Zhitong
    Hou, Yuanbo
    Xie, Xiang
    Li, Shengchen
    Zhang, Liqiang
    Du, Shixuan
    Liu, Wei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 396 - 401