A Thangka cultural element classification model based on self-supervised contrastive learning and MS Triplet Attention

被引:1
|
作者
Tang, Wenjing [1 ]
Xie, Qing [1 ,2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Tibetan Thangka classification; Sample imbalance problem; Self-supervised contrastive learning; Gradient Harmonizing Mechanism Loss; Attention mechanism;
D O I
10.1007/s00371-024-03397-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Being a significant repository of Buddhist imagery, Thangka images are valuable historical materials of Tibetan studies, which covers many domains such as Tibetan history, politics, culture, social life and even traditional medicine and astronomy. Thangka cultural element images are the essence of Thangka images. Hence, Thangka cultural element images classification is one of the most important works of knowledge representation and mining in the field of Thangka and is the foundation of digital protection of Thangka images. However, due to the limited quantity, high complexity and the intricate textures of Thangka images, the classification of Thangka images is limited to a small number of categories and coarse granularity. Thus, a novel fusion texture feature dual-branch Thangka cultural elements classification model based on the attention mechanism and self-supervised contrastive learning has been proposed in this paper. Specifically, to address the issue of insufficient labeled samples and improve the classification performance, this method utilizes a large amount of unlabeled irrelevant data to pre-train the feature extractor through self-supervised learning. During the fine-tuning stage of the downstream task, a dual-branch feature extraction structure incorporating texture features has been designed, and MS Triplet Attention proposed by us is used for the integration of important features. Additionally, to address the problem of sample imbalance and the existence of a large number of difficult samples in the Thangka cultural element dataset, the Gradient Harmonizing Mechanism Loss has been adopted, and it has been improved by introducing a self-designed adaptive mechanism. The experimental results on Thangka cultural elements dataset prove the superiority of the proposed method over the state-of-the-art methods. The source code of our proposed algorithm and the related datasets is available at https://github.com/WiniTang/MS-BiCLR.
引用
收藏
页码:3919 / 3935
页数:17
相关论文
共 50 条
  • [1] Self-Supervised Contrastive Learning for Singing Voices
    Yakura, Hiromu
    Watanabe, Kento
    Goto, Masataka
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1614 - 1623
  • [2] SWaCo: Safe Wafer Bin Map Classification With Self-Supervised Contrastive Learning
    Kwak, Min Gu
    Lee, Young Jae
    Kim, Seoung Bum
    IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2023, 36 (03) : 416 - 424
  • [3] Parkinson's Disease Classification with Self-supervised Learning and Attention Mechanism
    Zhang, Yuchen
    Lei, Haijun
    Huang, Zhongwei
    Zhao, Menglu
    Li, Zhen
    Liu, Chuan-Ming
    Lei, Baiying
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4601 - 4607
  • [4] Modulation Recognition of Digital Signals Based on Contrastive Self-Supervised Learning
    Liao, Yanping
    Gao, Yang
    Guo, Qiang
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 432 - 436
  • [5] An Attention Self-supervised Contrastive Learning based Three-stage Model for Hand Shape Feature Representation in Cued Speech
    Wang, Jianrong
    Gu, Nan
    Yu, Mei
    Li, Xuewei
    Fang, Qiang
    Liu, Li
    INTERSPEECH 2021, 2021, : 626 - 630
  • [6] Self-supervised Contrastive Learning for Predicting Game Strategies
    Lee, Young Jae
    Baek, Insung
    Jo, Uk
    Kim, Jaehoon
    Bae, Jinsoo
    Jeong, Keewon
    Kim, Seoung Bum
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2023, 542 : 136 - 147
  • [7] ASDC-FER: attention-guided self-supervised distilled contrastive learning for facial expression recognition
    Yan, Lingyu
    Yang, Jinquan
    Wang, Chunzhi
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [8] Missing nodes detection on graphs with self-supervised contrastive learning
    Liu, Chen
    Cao, Tingting
    Zhou, Lixin
    Shao, Ying
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [9] Self-supervised learning representation for abnormal acoustic event detection based on attentional contrastive learning
    Wei, Juan
    Zhang, Qian
    Ning, Weichen
    DIGITAL SIGNAL PROCESSING, 2023, 142
  • [10] Self-supervised contrastive learning for heterogeneous graph based on multi-pretext tasks
    Shuai Ma
    Jian-wei Liu
    Neural Computing and Applications, 2023, 35 : 10275 - 10296