Self-Supervised Learning based on Sentiment Analysis with Word Weight Calculation

被引：2

作者：

Son, Dongcheol ^{[1
]}

Ko, Youngjoong ^{[1
]}

机构：

[1] Sungkyunkwan Univ, Suwon, Gyeonggi Do, South Korea

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年

基金：

新加坡国家研究基金会;

关键词：

Self-supervised Learning; Sentiment Analysis; Fine-tuning; Word Weight Calculation;

D O I：

10.1145/3459637.3482180

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning domain information for a downstream task is important to improve the performance of sentiment analysis. However, the labeling task to obtain a sufficient amount of training data in an application domain tends to be highly time-consuming and tedious. To solve this problem, we propose a novel method to effectively learn domain information and improve sentiment analysis performance with a small amount of training data. We use the masked language model (MLM), which is a self-supervised learning model, to calculate word weights and improve a downstream fine-tuning task for sentiment analysis. In particular, the MLM with the calculated word weights is executed simultaneously with the fine-tuning task. The results show that the proposed model achieves better performances than previous models in four different datasets for sentiment analysis.

引用

页码：3428 / 3432

页数：5

共 14 条

[1]

[Anonymous], 2020, P 28 INT C COMP LING, DOI DOI 10.1109/ICARM49381.2020.9195367

[2]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[3]

Du JF, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P5408

[4]

Gururangan Suchin, 2020, P 58 ANN M ASS COMP, P8342, DOI DOI 10.18653/V1/2020.ACL-MAIN.740

[5]

Ke P, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6975

[6]

Liu B., 2012, Synthesis Lectures on Human Language Technologies, V5, P1, DOI DOI 10.2200/S00416ED1V01Y201204HLT016

[7]

Liu Yinhan, 2019, CoRR, DOI DOI 10.48550/ARXIV.1907.11692

[8]

Pang Bo, 2005, "Seeing Stars: Exploiting Class Relationships for Sentiment Categorization With Respect to Rating Scales. In Proceedings of the Annual Meeting of the Association for Computational Linguistics"

[9]

Socher Richard, 2013, P 2013 C EMP METH NA, P1631

[10]

Tian H, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4067

← 1 2 →