Grouped Contrastive Learning of Self-Supervised Sentence Representation

被引:1
|
作者
Wang, Qian [1 ]
Zhang, Weiqi [1 ]
Lei, Tianyi [1 ]
Peng, Dezhong [1 ,2 ,3 ]
机构
[1] Sichuan Univ, Coll Comp Sci & Technol, Chengdu 610065, Peoples R China
[2] Chengdu Ruibei Yingte Informat Technol Co Ltd, Chengdu 610054, Peoples R China
[3] Sichuan Zhiqian Technol Co Ltd, Chengdu 610065, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 17期
关键词
contrastive learning; self-attention; data augmentation; grouped representation; unsupervised learning;
D O I
10.3390/app13179873
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a method called Grouped Contrastive Learning of self-supervised Sentence Representation (GCLSR), which can learn an effective and meaningful representation of sentences. Previous works maximize the similarity between two vectors to be the objective of contrastive learning, suffering from the high-dimensionality of the vectors. In addition, most previous works have adopted discrete data augmentation to obtain positive samples and have directly employed a contrastive framework from computer vision to perform contrastive training, which could hamper contrastive training because text data are discrete and sparse compared with image data. To solve these issues, we design a novel framework of contrastive learning, i.e., GCLSR, which divides the high-dimensional feature vector into several groups and respectively computes the groups' contrastive losses to make use of more local information, eventually obtaining a more fine-grained sentence representation. In addition, in GCLSR, we design a new self-attention mechanism and both a continuous and a partial-word vector augmentation (PWVA). For the discrete and sparse text data, the use of self-attention could help the model focus on the informative words by measuring the importance of every word in a sentence. By using the PWVA, GCLSR can obtain high-quality positive samples used for contrastive learning. Experimental results demonstrate that our proposed GCLSR achieves an encouraging result on the challenging datasets of the semantic textual similarity (STS) task and transfer task.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Contrastive self-supervised learning: review, progress, challenges and future research directions
    Kumar, Pranjal
    Rawat, Piyush
    Chauhan, Siddhartha
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022,
  • [32] Contrastive self-supervised learning: review, progress, challenges and future research directions
    Pranjal Kumar
    Piyush Rawat
    Siddhartha Chauhan
    International Journal of Multimedia Information Retrieval, 2022, 11 : 461 - 488
  • [33] Self-supervised learning representation for abnormal acoustic event detection based on attentional contrastive learning
    Wei, Juan
    Zhang, Qian
    Ning, Weichen
    DIGITAL SIGNAL PROCESSING, 2023, 142
  • [34] Contrastive self-supervised learning: review, progress, challenges and future research directions
    Kumar, Pranjal
    Rawat, Piyush
    Chauhan, Siddhartha
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (04) : 461 - 488
  • [35] Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval
    Chen, Qingzhong
    Cai, Shilun
    Cai, Crystal
    Yu, Zefang
    Qian, Dahong
    Xiang, Suncheng
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1056 - 1061
  • [36] ConCur: Self-supervised graph representation based on contrastive learning with curriculum negative sampling
    Yan, Rong
    Bao, Peng
    NEUROCOMPUTING, 2023, 551
  • [37] Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning
    Zaiem, Salah
    Parcollet, Titouan
    Essid, Slim
    INTERSPEECH 2022, 2022, : 669 - 673
  • [38] DimCL: Dimensional Contrastive Learning for Improving Self-Supervised Learning
    Nguyen, Thanh
    Pham, Trung Xuan
    Zhang, Chaoning
    Luu, Tung M.
    Vu, Thang
    Yoo, Chang D.
    IEEE ACCESS, 2023, 11 : 21534 - 21545
  • [39] Self-Supervised Contrastive Learning for Volcanic Unrest Detection
    Bountos, Nikolaos Ioannis
    Papoutsis, Ioannis
    Michail, Dimitrios
    Anantrasirichai, Nantheera
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [40] CONTRASTIVE SELF-SUPERVISED LEARNING FOR WIRELESS POWER CONTROL
    Naderializadeh, Navid
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4965 - 4969