Integration of multi-level semantics in PTMs with an attention model for question matching

被引:0
|
作者
Ye, Zheng [1 ,2 ]
Che, Linwei [1 ,2 ]
Ge, Jun [3 ]
Qin, Jun [1 ,2 ]
Liu, Jing [1 ,2 ]
机构
[1] South Cent Minzu Univ, Coll Comp Sci, Natl Ethn Affairs Commiss, Wuhan, Hubei, Peoples R China
[2] South Cent Minzu Univ, Informat Phys Fus Intelligent Comp Key Lab, Natl Ethn Affairs Commiss, Wuhan, Hubei, Peoples R China
[3] Wuhan Text Univ, Coll Int Business Econ, Wuhan, Hubei, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 08期
关键词
D O I
10.1371/journal.pone.0305772
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The task of question matching/retrieval focuses on determining whether two questions are semantically equivalent. It has garnered significant attention in the field of natural language processing (NLP) due to its commercial value. While neural network models have made great strides and achieved human-level accuracy, they still face challenges when handling complex scenarios. In this paper, we delve into the utilization of different specializations encoded in different layers of large-scale pre-trained language models (PTMs). We propose a novel attention-based model called ERNIE-ATT that effectively integrates the diverse levels of semantics acquired by PTMs, thereby enhancing robustness. Experimental evaluations on two challenging datasets showcase the superior performance of our proposed model. It outperforms not only traditional models that do not use PTMs but also exhibits a significant improvement over strong PTM-based models. These findings demonstrate the effectiveness of our approach in enhancing the robustness of question matching/retrieval systems.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A Multi-Level Attention Model for Remote Sensing Image Captions
    Li, Yangyang
    Fang, Shuangkang
    Jiao, Licheng
    Liu, Ruijiao
    Shang, Ronghua
    REMOTE SENSING, 2020, 12 (06)
  • [22] Multi-level Stereo Attention Model for Center Channel Extraction
    Lim, Wootaek
    Beack, Seungkwon
    Lee, Taejin
    2019 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2019,
  • [23] Multi-Level Contextual RNNs With Attention Model for Scene Labeling
    Fan, Heng
    Mei, Xue
    Prokhorov, Danil
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (11) : 3475 - 3485
  • [24] Multi-level attention model for person re-identification
    Yan, Yichao
    Ni, Bingbing
    Liu, Jinxian
    Yang, Xiaokang
    PATTERN RECOGNITION LETTERS, 2019, 127 : 156 - 164
  • [25] MulAttenRec: A Multi-level Attention-Based Model for Recommendation
    Lin, Zhipeng
    Yang, Wenjing
    Zhang, Yongjun
    Wang, Haotian
    Tang, Yuhua
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT II, 2018, 11302 : 240 - 252
  • [26] Multi-Level Semantics with Vertical Integrity Constraints
    Panisson, Alison R.
    Bordini, Rafael H.
    da Rocha Costa, Antonio Carlos
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1708 - 1709
  • [27] Trajectory Similarity Search with Multi-level Semantics
    Zheng, Jianbing
    Wang, Shuai
    Jin, Cheqing
    Gao, Ming
    Zhou, Aoying
    Ni, Liang
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 602 - 619
  • [28] Strategic Multi-Omics Data Integration via Multi-Level Feature Contrasting and Matching
    Zhang, Jinli
    Ren, Hongwei
    Jiang, Zongli
    Chen, Zheng
    Yang, Ziwei
    Matsubara, Yasuko
    Sakurai, Yasushi
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2024, 23 (04) : 579 - 590
  • [29] SACIC: A Semantics-Aware Convolutional Image Captioner Using Multi-level Pervasive Attention
    Parameswaran, Sandeep Narayan
    Das, Sukhendu
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 64 - 76
  • [30] MLAN: Multi-Level Attention Network
    Qin, Peinuan
    Wang, Qinxuan
    Zhang, Yue
    Wei, Xueyao
    Gao, Meiguo
    IEEE ACCESS, 2022, 10 : 105437 - 105446