A Thangka cultural element classification model based on self-supervised contrastive learning and MS Triplet Attention

被引:1
|
作者
Tang, Wenjing [1 ]
Xie, Qing [1 ,2 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Minist Educ, Engn Res Ctr Intelligent Serv Technol Digital Publ, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Tibetan Thangka classification; Sample imbalance problem; Self-supervised contrastive learning; Gradient Harmonizing Mechanism Loss; Attention mechanism;
D O I
10.1007/s00371-024-03397-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Being a significant repository of Buddhist imagery, Thangka images are valuable historical materials of Tibetan studies, which covers many domains such as Tibetan history, politics, culture, social life and even traditional medicine and astronomy. Thangka cultural element images are the essence of Thangka images. Hence, Thangka cultural element images classification is one of the most important works of knowledge representation and mining in the field of Thangka and is the foundation of digital protection of Thangka images. However, due to the limited quantity, high complexity and the intricate textures of Thangka images, the classification of Thangka images is limited to a small number of categories and coarse granularity. Thus, a novel fusion texture feature dual-branch Thangka cultural elements classification model based on the attention mechanism and self-supervised contrastive learning has been proposed in this paper. Specifically, to address the issue of insufficient labeled samples and improve the classification performance, this method utilizes a large amount of unlabeled irrelevant data to pre-train the feature extractor through self-supervised learning. During the fine-tuning stage of the downstream task, a dual-branch feature extraction structure incorporating texture features has been designed, and MS Triplet Attention proposed by us is used for the integration of important features. Additionally, to address the problem of sample imbalance and the existence of a large number of difficult samples in the Thangka cultural element dataset, the Gradient Harmonizing Mechanism Loss has been adopted, and it has been improved by introducing a self-designed adaptive mechanism. The experimental results on Thangka cultural elements dataset prove the superiority of the proposed method over the state-of-the-art methods. The source code of our proposed algorithm and the related datasets is available at https://github.com/WiniTang/MS-BiCLR.
引用
收藏
页码:3919 / 3935
页数:17
相关论文
共 50 条
  • [41] SSA-GAT: Graph-Based Self-supervised Learning for Network Intrusion Detection
    Liu, Qian
    Zhang, Hui
    Zhang, Youpeng
    Fan, Lin
    Jin, Xue
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT IX, 2024, 15024 : 476 - 491
  • [42] Multi-label modality enhanced attention based self-supervised deep cross-modal hashing
    Zou, Xitao
    Wu, Song
    Zhang, Nian
    Bakker, Erwin M.
    KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [43] Self-Supervised Monocular Depth Estimation for Traffic Scenes Based on Dual Attention Mechanism and Adaptive Cost Volume
    Wu G.
    Liu W.
    Hu J.
    Cheng S.
    Yang W.-X.
    Sun L.-K.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (05): : 1670 - 1678
  • [44] Attention-based label consistency for semi-supervised deep learning based image classification
    Chen, Jiaming
    Yang, Meng
    Ling, Jie
    NEUROCOMPUTING, 2021, 453 : 731 - 741
  • [45] Classification Model of Clock Drawing Test Based on Contrastive Learning Using Multi-Channel Features With Channel-Spatial Attention
    Kang, Changsu
    Wang, Bohyun
    Lim, J. S.
    IEEE ACCESS, 2024, 12 : 186466 - 186475
  • [46] Self-Supervised Real-World Image Denoising Based on Multi-Scale Feature Enhancement and Attention Fusion
    Tang, Hailiang
    Zhang, Wenxiao
    Zhu, Hailin
    Zhao, Ke
    IEEE ACCESS, 2024, 12 : 49720 - 49734
  • [47] Brain Tumor Classification Based on Attention Guided Deep Learning Model
    Wen Jun
    Zheng Liyuan
    International Journal of Computational Intelligence Systems, 15
  • [48] Brain Tumor Classification Based on Attention Guided Deep Learning Model
    Wen Jun
    Zheng Liyuan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2022, 15 (01)
  • [49] GAF-MAE: A Self-Supervised Automatic Modulation Classification Method Based on Gramian Angular Field and Masked Autoencoder
    Shi, Yunhao
    Xu, Hua
    Zhang, Yue
    Qi, Zisen
    Wang, Dan
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 94 - 106
  • [50] A simple self-supervised learning framework with patch-based data augmentation in diagnosis of Alzheimer's disease
    Gong, Haoqiang
    Wang, Zhiwen
    Huang, Shuaihui
    Wang, Jinfeng
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96