Self-Supervised Learning based on Sentiment Analysis with Word Weight Calculation

被引:2
作者
Son, Dongcheol [1 ]
Ko, Youngjoong [1 ]
机构
[1] Sungkyunkwan Univ, Suwon, Gyeonggi Do, South Korea
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年
基金
新加坡国家研究基金会;
关键词
Self-supervised Learning; Sentiment Analysis; Fine-tuning; Word Weight Calculation;
D O I
10.1145/3459637.3482180
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning domain information for a downstream task is important to improve the performance of sentiment analysis. However, the labeling task to obtain a sufficient amount of training data in an application domain tends to be highly time-consuming and tedious. To solve this problem, we propose a novel method to effectively learn domain information and improve sentiment analysis performance with a small amount of training data. We use the masked language model (MLM), which is a self-supervised learning model, to calculate word weights and improve a downstream fine-tuning task for sentiment analysis. In particular, the MLM with the calculated word weights is executed simultaneously with the fine-tuning task. The results show that the proposed model achieves better performances than previous models in four different datasets for sentiment analysis.
引用
收藏
页码:3428 / 3432
页数:5
相关论文
共 14 条
[1]  
[Anonymous], 2020, P 28 INT C COMP LING, DOI DOI 10.1109/ICARM49381.2020.9195367
[2]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[3]  
Du JF, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P5408
[4]  
Gururangan Suchin, 2020, P 58 ANN M ASS COMP, P8342, DOI DOI 10.18653/V1/2020.ACL-MAIN.740
[5]  
Ke P, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6975
[6]  
Liu B., 2012, Synthesis Lectures on Human Language Technologies, V5, P1, DOI DOI 10.2200/S00416ED1V01Y201204HLT016
[7]  
Liu Yinhan, 2019, CoRR, DOI DOI 10.48550/ARXIV.1907.11692
[8]  
Pang Bo, 2005, "Seeing Stars: Exploiting Class Relationships for Sentiment Categorization With Respect to Rating Scales. In Proceedings of the Annual Meeting of the Association for Computational Linguistics"
[9]  
Socher Richard, 2013, P 2013 C EMP METH NA, P1631
[10]  
Tian H, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4067