Query-induced multi-task decomposition and enhanced learning for aspect-based sentiment quadruple prediction

被引:3
作者
Zhang, Hua [1 ]
Song, Xiawen [1 ]
Jia, Xiaohui [1 ]
Yang, Cheng [1 ]
Chen, Zeqi [1 ]
Chen, Bi [1 ]
Jiang, Bo [1 ]
Wang, Ye [1 ]
Feng, Rui [2 ,3 ]
机构
[1] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Zhejiang Econ & Informat Ctr, Hangzhou 310012, Peoples R China
[3] Zhejiang Univ, Inst Comp Innovat, Hangzhou 310027, Peoples R China
关键词
Aspect sentiment quadruple prediction; Machine reading comprehension; Contrastive learning; Refined inference; EXTRACTION; MODEL;
D O I
10.1016/j.engappai.2024.108609
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A complete sentiment analysis of product and service reviews has attracted growing concerns from merchants to enhance personalized marketing activities. Aspect sentiment quadruple prediction (ASQP) is a demanding and challenging task with the objective to predict four sentiment elements from given reviews. Existing methods for ASQP face certain issues, with pipeline-based non-generative approaches prone to error propagation and generative models at the potential risk of producing unexpected outputs or longer inference times. To avoid these shortcomings, we develop a novel end-to-end non-generative model for ASQP involving multi-task decomposition within machine reading comprehension (MRC) framework. Specifically, the ASQP task is decomposed into six query-induced subtasks by introducing task-specific question templates. The proposed model, named MRCCLRI, is trained with multi-task joint learning. It also incorporates contrastive learning for category identification and sentiment classification to enhance the correlation of the six subtasks. To further promote the quadruple prediction, we present a refined inference algorithm in a bidirectional multi-turn inference procedure to effectively match aspect and opinion terms and optimize two inference hyperparameters: distance threshold and probability threshold. Experimental results exhibit superior performance compared to existing two nongenerative and seven generative baselines. Our proposed MRC-CLRI, as a novel non-generative model, outperforms the best existing generative method by an average F1 score improvement of 1.69% and the best previous non-generative method by an average F1 score improvement of 15.77% across four review datasets. Ablation experiments further validate the efficacy of the designed contrastive learning and the refined inference algorithm.
引用
收藏
页数:16
相关论文
共 32 条
[31]   MTTLm6A: A multi-task transfer learning approach for base-resolution mRNA m6A site prediction based on an improved transformer [J].
Wang, Honglei ;
Zeng, Wenliang ;
Huang, Xiaoling ;
Liu, Zhaoyang ;
Sun, Yanjing ;
Zhang, Lin .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) :272-299
[32]   MTTLA-DLW: Multi-task TCN-Bi-LSTM transfer learning approach with dynamic loss weights based on feature correlations of the training samples for short-term wind power prediction [J].
Song, Jifeng ;
Peng, Xiaosheng ;
Song, Jiajiong ;
Yang, Zimin ;
Wang, Bo ;
Che, Jianfeng .
WIND ENERGY, 2024, 27 (07) :733-744