Nonautoregressive Encoder-Decoder Neural Framework for End-to-End Aspect-Based Sentiment Triplet Extraction

被引：47

作者：

Fei, Hao ^{[1
]}

Ren, Yafeng ^{[2
]}

Zhang, Yue ^{[3
]}

Ji, Donghong ^{[1
]}

机构：

[1] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan 430072, Peoples R China

[2] Guangdong Univ Foreign Studies, Lab Language & Artificial Intelligence, Guangzhou 510420, Peoples R China

[3] Westlake Univ, Sch Engn, Hangzhou 310024, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Decoding; Sentiment analysis; Predictive models; Labeling; Analytical models; Transformers; Bipartite matching loss; encoder-decoder framework; natural language processing (NLP); nonautoregressive decoding; pointer network; sentiment analysis; NETWORK; MODEL;

D O I：

10.1109/TNNLS.2021.3129483

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aspect-based sentiment triplet extraction (ASTE) aims at recognizing the joint triplets from texts, i.e., aspect terms, opinion expressions, and correlated sentiment polarities. As a newly proposed task, ASTE depicts the complete sentiment picture from different perspectives to better facilitate real-world applications. Unfortunately, several major challenges, such as the overlapping issue and long-distance dependency, have not been addressed effectively by the existing ASTE methods, which limits the performance of the task. In this article, we present an innovative encoder-decoder framework for end-to-end ASTE. Specifically, the ASTE task is first modeled as an unordered triplet set prediction problem, which is satisfied with a nonautoregressive decoding paradigm with a pointer network. Second, a novel high-order aggregation mechanism is proposed for fully integrating the underlying interactions between the overlapping structure of aspect and opinion terms. Third, a bipartite matching loss is introduced for facilitating the training of our nonautoregressive system. Experimental results on benchmark datasets show that our proposed framework significantly outperforms the state-of-the-art methods. Further analysis demonstrates the advantages of the proposed framework in handling the overlapping issue, relieving long-distance dependency and decoding efficiency.

引用

页码：5544 / 5556

页数：13

共 81 条

[1]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

[2]

Berend Gabor, 2011, P 5 INT JOINT C NAT, P1162

[3]

Bojanowski Piotr, 2017, Transactions of the Association for Computational Linguistics, V5, P135, DOI DOI 10.1162/TACL_A_00051

[4]

Chen S., 2020, P 58 ANN M ASS COMP, P6515, DOI DOI 10.18653/V1/2020.ACL-MAIN.582

[5]

Chen SW, 2021, AAAI CONF ARTIF INTE, V35, P12666

[6]

Chen Z, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P3685

[7]

Chopra Sumit, 2016, P 2016 C N AM CHAPT, P93

[8]

Dai HL, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5268

[9]

De Clercq O., 2017, P 8 WORKSH COMP APPR, P136, DOI DOI 10.18653/V1/W17-5218

[10]

Dong L, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P49

← 1 2 3 4 5 6 7 8 9 →