Query-induced multi-task decomposition and enhanced learning for aspect-based sentiment quadruple prediction

被引：3

作者：

Zhang, Hua ^{[1
]}

Song, Xiawen ^{[1
]}

Jia, Xiaohui ^{[1
]}

Yang, Cheng ^{[1
]}

Chen, Zeqi ^{[1
]}

Chen, Bi ^{[1
]}

Jiang, Bo ^{[1
]}

Wang, Ye ^{[1
]}

Feng, Rui ^{[2
,3
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China

[2] Zhejiang Econ & Informat Ctr, Hangzhou 310012, Peoples R China

[3] Zhejiang Univ, Inst Comp Innovat, Hangzhou 310027, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 133卷

关键词：

Aspect sentiment quadruple prediction; Machine reading comprehension; Contrastive learning; Refined inference; EXTRACTION; MODEL;

D O I：

10.1016/j.engappai.2024.108609

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A complete sentiment analysis of product and service reviews has attracted growing concerns from merchants to enhance personalized marketing activities. Aspect sentiment quadruple prediction (ASQP) is a demanding and challenging task with the objective to predict four sentiment elements from given reviews. Existing methods for ASQP face certain issues, with pipeline-based non-generative approaches prone to error propagation and generative models at the potential risk of producing unexpected outputs or longer inference times. To avoid these shortcomings, we develop a novel end-to-end non-generative model for ASQP involving multi-task decomposition within machine reading comprehension (MRC) framework. Specifically, the ASQP task is decomposed into six query-induced subtasks by introducing task-specific question templates. The proposed model, named MRCCLRI, is trained with multi-task joint learning. It also incorporates contrastive learning for category identification and sentiment classification to enhance the correlation of the six subtasks. To further promote the quadruple prediction, we present a refined inference algorithm in a bidirectional multi-turn inference procedure to effectively match aspect and opinion terms and optimize two inference hyperparameters: distance threshold and probability threshold. Experimental results exhibit superior performance compared to existing two nongenerative and seven generative baselines. Our proposed MRC-CLRI, as a novel non-generative model, outperforms the best existing generative method by an average F1 score improvement of 1.69% and the best previous non-generative method by an average F1 score improvement of 15.77% across four review datasets. Ablation experiments further validate the efficacy of the designed contrastive learning and the refined inference algorithm.

引用

页数：16

共 32 条

[31] MTTLm6A: A multi-task transfer learning approach for base-resolution mRNA m6A site prediction based on an improved transformer [J].

Wang, Honglei ;

Zeng, Wenliang ;

Huang, Xiaoling ;

Liu, Zhaoyang ;

Sun, Yanjing ;

Zhang, Lin .

MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) :272-299

[32] MTTLA-DLW: Multi-task TCN-Bi-LSTM transfer learning approach with dynamic loss weights based on feature correlations of the training samples for short-term wind power prediction [J].

Song, Jifeng ;

Peng, Xiaosheng ;

Song, Jiajiong ;

Yang, Zimin ;

Wang, Bo ;

Che, Jianfeng .

WIND ENERGY, 2024, 27 (07) :733-744

← 1 2 3 4 →