MEMD-ABSA: a multi-element multi-domain dataset for aspect-based sentiment analysis

被引:0
作者
Cai, Hongjie [1 ]
Song, Nan [1 ]
Wang, Zengzhi [1 ]
Xie, Qiming [1 ]
Zhao, Qiankun [1 ]
Li, Ke [1 ]
Wu, Siwei [1 ]
Liu, Shijie [1 ]
Ma, Heqing [1 ]
Yu, Jianfei [1 ]
Xia, Rui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
关键词
Natural language processing; Aspect-based sentiment analysis; Opinion mining; Implicit expression;
D O I
10.1007/s10579-025-09820-9
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Aspect-based sentiment analysis is a long-standing research interest in the field of opinion mining, and in recent years, researchers have gradually shifted their focus from simple ABSA subtasks to end-to-end multi-element ABSA tasks. However, the datasets currently used in the research are limited to individual elements of specific tasks, usually focusing on in-domain settings, ignoring implicit aspects and opinions, and with a small data scale. To address these issues, we propose a large-scale Multi-Element Multi-Domain dataset (MEMD) that covers the four elements across five domains, including nearly 20,000 review sentences and 30,000 quadruples annotated with both explicit and implicit aspects and opinions for ABSA research. Meanwhile, we conduct experiments on multiple ABSA subtasks under the open domain setting to verify the effectiveness of several generative and non-generative baselines, and the results show that open domain ABSA as well as mining implicit aspects and opinions remain ongoing challenges to be addressed.
引用
收藏
页数:29
相关论文
共 46 条
  • [1] A Proposal for Book Oriented Aspect Based Sentiment Analysis: Comparison over Domains
    Alvarez-Lopez, Tamara
    Fernandez-Gavilanes, Milagros
    Costa-Montenegro, Enrique
    Bellot, Patrice
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 3 - 14
  • [2] Cai HJ, 2021, 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), P340
  • [3] Chen H, 2022, PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), P2974
  • [4] Chen S., 2020, 2020 58 ANN M ASS CO, P6515
  • [5] Chen SW, 2021, AAAI CONF ARTIF INTE, V35, P12666
  • [6] Chen Z, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P3685
  • [7] Chia YK, 2023, Arxiv, DOI arXiv:2305.14434
  • [8] Dai JQ, 2021, Arxiv, DOI arXiv:2104.04986
  • [9] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
  • [10] Fan ZF, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2509